LLM Model Quantization: An Overview
A General Introduction and Overview of LLM Model Quantization Techniques and Practices
2.83 (3 reviews)

28
students
1 hour
content
Jan 2024
last update
$19.99
regular price
What you will learn
Understand the fundamental principles of model quantization and its critical role in optimizing Large Language Models (LLMs) for diverse applications.
Explore and differentiate between various types of model quantization methods, including post-training quantization, quantization-aware training.
Gain proficiency in implementing model quantization using major frameworks like TensorFlow, PyTorch, ONNX, and NVIDIA TensorRT.
Develop skills to effectively evaluate the performance and quality of quantized LLMs using standard metrics and real-world testing scenarios.
Screenshots




5660131
udemy ID
11/14/2023
course created date
11/15/2023
course indexed date
Bot
course submited by