LLM Model Quantization: An Overview

A General Introduction and Overview of LLM Model Quantization Techniques and Practices
2.83 (3 reviews)
Udemy
platform
English
language
Other
category
instructor
LLM Model Quantization: An Overview
28
students
1 hour
content
Jan 2024
last update
$19.99
regular price

What you will learn

Understand the fundamental principles of model quantization and its critical role in optimizing Large Language Models (LLMs) for diverse applications.

Explore and differentiate between various types of model quantization methods, including post-training quantization, quantization-aware training.

Gain proficiency in implementing model quantization using major frameworks like TensorFlow, PyTorch, ONNX, and NVIDIA TensorRT.

Develop skills to effectively evaluate the performance and quality of quantized LLMs using standard metrics and real-world testing scenarios.

Screenshots

LLM Model Quantization: An Overview - Screenshot_01LLM Model Quantization: An Overview - Screenshot_02LLM Model Quantization: An Overview - Screenshot_03LLM Model Quantization: An Overview - Screenshot_04
5660131
udemy ID
11/14/2023
course created date
11/15/2023
course indexed date
Bot
course submited by