Quantization-Fundamentals-with-Hugging-Face
PublicLearn linear quantization techniques using the Quanto library and downcasting methods with the Transformers library to compress and optimize generative AI models effectively.
compressiondowncastinggenerative-aihugging-facelinear-quantizationmodel-compressionmodel-deploymentmodel-optimizationoptimizequantization
Creat:2024-04-17T21:27:58
Update:2025-01-05T01:40:18
https://www.deeplearning.ai/short-courses/quantization-fundamentals-with-hugging-face/
3
Stars
0
Stars Increase