Quantization-in-Depth
PublicDive into advanced quantization techniques. Learn to implement and customize linear quantization functions, measure quantization error, and compress model weights using PyTorch for efficient and accessible AI models.
2-bit-weights8-bit-compressionadvanced-quantizationai-optimizationasymmetric-quantizationlinear-quantizationmachine-learningmodel-compressionper-channel-granularityper-group-granularity
Creat:2024-05-14T21:24:17
Update:2024-06-27T02:20:00
https://www.deeplearning.ai/short-courses/quantization-in-depth/
4
Stars
0
Stars Increase