DLAI Logo
AI is the new electricity and will transform and improve nearly all areas of human lives.

Welcome back!

We'd like to know you better so we can create more relevant courses. What do you do for work?

DLAI Logo
  • Explore Courses
  • Community
    • Forum
    • Events
    • Ambassadors
    • Ambassador Spotlight
  • My Learnings
DLAI Logo

Adhithya Akella, congratulations on completing Reinforcement Fine-Tuning LLMs With GRPO!

Reinforcement Fine-Tuning LLMs With GRPO
Short Course

Reinforcement Fine-Tuning LLMs With GRPO

Improve LLM reasoning with reinforcement fine-tuning and reward functions.

Evaluation and MonitoringFine-TuningGenAI ApplicationsLLMOpsLLM ServingMachine LearningPrompt EngineeringSupervised LearningTransformers
  • Predibase
Predibase
100% Completed
View Course
  • Introduction
    Video
    ・
    3 mins
  • Introduction to reinforcement learning
    Video
    ・
    7 mins
  • Benefits of reinforcement finetuning
    Video
    ・
    4 mins
  • Can a large language model master Wordle
    Video with Code Example
    ・
    10 mins
  • Reward functions
    Video with Code Example
    ・
    10 mins
  • Reward functions with LLM as a judge
    Video with Code Example
    ・
    12 mins
  • Reward hacking
    Video with Code Example
    ・
    7 mins
  • Calculating loss in GRPO
    Video with Code Example
    ・
    18 mins
  • Putting it all together: Training Wordle
    Video with Code Example
    ・
    8 mins
  • Conclusion
    Video
    ・
    1 min
  • Appendix – Tips, Help, and Download
    Code Example
    ・
    1 min