Short CourseQuantization Fundamentals with Hugging FaceLearn how to quantize any open-source model. Learn to compress models with the Hugging Face Transformers library and the Quanto library.Generative ModelsCompression and QuantizationMultiModalTransformersHugging Face
Short CourseEfficiently Serving LLMsUnderstand how LLMs predict the next token and how techniques like KV caching can speed up text generation. Write code to serve LLM applications efficiently to multiple users.Fine-TuningGenerative ModelsLLMOpsLLM ServingTransformersPredibase
Short CourseOpen Source Models with Hugging FaceLearn how to easily build AI applications using open-source models and Hugging Face tools. Find and filter open-source models on Hugging Face Hub.ChatbotsGenerative ModelsMultiModalNLPPrompt EngineeringTransformersHugging Face
Short CoursePrompt Engineering with Llama 2&3Learn best practices for prompting and selecting among Meta Llama 2 & 3 models. Interact with Meta Llama 2 Chat, Code Llama, and Llama Guard models.AI SafetyGenAI ApplicationsGenerative ModelsPrompt EngineeringTransformersMeta
Short CourseReinforcement Learning From Human FeedbackGet an introduction to tuning and evaluating LLMs using Reinforcement Learning from Human Feedback (RLHF) and fine-tune the Llama 2 model.Fine-TuningGenerative ModelsLLMOpsTransformersGoogle Cloud