AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Llama3 Compressed Version! Nvidia Releases Small Language Model Llama-3.1-Minitron4B with Only 400 Million Parameters

Nvidia's research team has successfully launched Llama-3.1-Minitron4B using model pruning and distillation techniques. This is a compressed version of the Llama3 model, aimed at implementing artificial intelligence on devices. The model reduces the parameter count of the original 8B model through deep and width pruning techniques while maintaining performance close to larger models. Despite a significant reduction in training data (by 40 times), the model achieved a 16% performance improvement on the MMLU benchmark.

17.6k 3 hours ago
Llama3 Compressed Version! Nvidia Releases Small Language Model Llama-3.1-Minitron4B with Only 400 Million Parameters
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map