Advanced-AI-Hardware-Software-Co-Design
PublicThis course provides a hands-on introduction to extreme model quantization, hardware-aware optimization, and on-device deployment for generative AI models. You'll explore advanced techniques to reduce model size, accelerate inference, and deploy compact LLMs on edge devices like Android smartphones.