DLAI Logo
AI is the new electricity and will transform and improve nearly all areas of human lives.

Welcome back!

We'd like to know you better so we can create more relevant courses. What do you do for work?

DLAI Logo
  • Explore Courses
  • Community
    • Forum
    • Events
    • Ambassadors
    • Ambassador Spotlight
  • My Learnings
  • daily streak fire

    You've achieved today's streak!

    Complete one lesson every day to keep the streak going.

    Su

    Mo

    Tu

    We

    Th

    Fr

    Sa

    free pass got

    You earned a Free Pass!

    Free Passes help protect your daily streak. Complete more lessons to earn up to 3 Free Passes.

    Free PassFree PassFree Pass
Congratulations on making it to the end of this short course. In the short course, you tried your hands at different variants of linear quantization methods and implemented them from scratch using PyTorch. You also built a quantizer to quantize any model in eight-bit precision. Finally, you learned about some important challenges when it comes to quantization, such as weights packing. And you implemented the packing and unpacking algorithm by hand. We encourage you to explore other quantization methods available on the Hugging Face transformers. And we hope this course will give you all the tools you need to get started with quantizing any model. And if you find this course helpful, maybe you can even share it with your friends.
course detail
Next Lesson
Quantization in Depth
  • Introduction
    Video
    ・
    4 mins
  • Overview
    Video
    ・
    3 mins
  • Quantize and De-quantize a Tensor
    Video with Code Example
    ・
    11 mins
  • Get the Scale and Zero Point
    Video with Code Example
    ・
    12 mins
  • Symmetric vs Asymmetric Mode
    Video with Code Example
    ・
    7 mins
  • Finer Granularity for more Precision
    Video with Code Example
    ・
    2 mins
  • Per Channel Quantization
    Video with Code Example
    ・
    11 mins
  • Per Group Quantization
    Video with Code Example
    ・
    7 mins
  • Quantizing Weights & Activations for Inference
    Video with Code Example
    ・
    3 mins
  • Custom Build an 8-Bit Quantizer
    Video with Code Example
    ・
    13 mins
  • Replace PyTorch layers with Quantized Layers
    Video with Code Example
    ・
    5 mins
  • Quantize any Open Source PyTorch Model
    Video with Code Example
    ・
    8 mins
  • Load your Quantized Weights from HuggingFace Hub
    Video with Code Example
    ・
    7 mins
  • Weights Packing
    Video
    ・
    5 mins
  • Packing 2-bit Weights
    Video with Code Example
    ・
    8 mins
  • Unpacking 2-Bit Weights
    Video with Code Example
    ・
    8 mins
  • Beyond Linear Quantization
    Video
    ・
    7 mins
  • Conclusion
    Video
    ・
    1 min
  • Course Feedback
  • Community
  • 0%