AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

Models

View More

DeepSeek R1 Distill Qwen 32B Quantized.w8a8

neuralmagic

D

INT8 quantized version of DeepSeek-R1-Distill-Qwen-32B, reducing VRAM usage and improving computational efficiency through weight and activation quantization.

Natural Language ProcessingTransformersTransformers
neuralmagic
2.3k
9

DeepSeek R1 Distill Qwen 14B Quantized.w8a8

neuralmagic

D

The quantized version of DeepSeek-R1-Distill-Qwen-14B, optimized with INT8 quantization for weights and activations, reducing GPU memory requirements and improving computational efficiency.

Natural Language ProcessingTransformersTransformers
neuralmagic
765
2
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map