DLAI Logo
AI is the new electricity and will transform and improve nearly all areas of human lives.

Welcome back!

We'd like to know you better so we can create more relevant courses. What do you do for work?

DLAI Logo
  • Explore Courses
  • Community
    • Forum
    • Events
    • Ambassadors
    • Ambassador Spotlight
  • My Learnings
  • daily streak fire

    You've achieved today's streak!

    Complete one lesson every day to keep the streak going.

    Su

    Mo

    Tu

    We

    Th

    Fr

    Sa

    free pass got

    You earned a Free Pass!

    Free Passes help protect your daily streak. Complete more lessons to earn up to 3 Free Passes.

    Free PassFree PassFree Pass
Reinforcement Fine-Tuning LLMs With GRPO
  • Introduction
    Video
    ・
    3 mins
  • Introduction to reinforcement learning
    Video
    ・
    7 mins
  • Benefits of reinforcement finetuning
    Video
    ・
    4 mins
  • Can a large language model master Wordle
    Video with Code Example
    ・
    10 mins
  • Reward functions
    Video with Code Example
    ・
    10 mins
  • Reward functions with LLM as a judge
    Video with Code Example
    ・
    12 mins
  • Reward hacking
    Video with Code Example
    ・
    7 mins
  • Calculating loss in GRPO
    Video with Code Example
    ・
    18 mins
  • Putting it all together: Training Wordle
    Video with Code Example
    ・
    8 mins
  • Conclusion
    Video
    ・
    1 min
  • Appendix – Tips, Help, and Download
    Code Example
    ・
    1 min
  • Course Feedback
  • Community
  • 0%