LLM Model Quantization: An Overview

A General Introduction and Overview of LLM Model Quantization Techniques and Practices

2.83 (3 reviews)

Udemy

platform

English

language

Other

category

instructor

LLM Model Quantization: An Overview

28

students

1 hour

content

Jan 2024

last update

$19.99

regular price

What you will learn

Understand the fundamental principles of model quantization and its critical role in optimizing Large Language Models (LLMs) for diverse applications.

Explore and differentiate between various types of model quantization methods, including post-training quantization, quantization-aware training.

Gain proficiency in implementing model quantization using major frameworks like TensorFlow, PyTorch, ONNX, and NVIDIA TensorRT.

Develop skills to effectively evaluate the performance and quality of quantized LLMs using standard metrics and real-world testing scenarios.

Screenshots

LLM Model Quantization: An Overview - Screenshot_01

LLM Model Quantization: An Overview - Screenshot_02

LLM Model Quantization: An Overview - Screenshot_03

LLM Model Quantization: An Overview - Screenshot_04

5660131

udemy ID

11/14/2023

course created date

11/15/2023

course indexed date

Bot

course submited by