DeepSeek releases V3.2-exp model, pioneering sparse attention mechanism significantly reduces AI inference costs
DeepSeek releases the experimental model V3.2-exp, which adopts an innovative 'sparse attention' mechanism to significantly reduce the cost of long context inference. The model is now available on Hugging Face and GitHub. The core is the 'lightning indexer' and optimized attention mechanisms to improve processing efficiency. This breakthrough technology is expected to promote the development of AI in the field of long text processing.