SqueezeLLM
Public[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
efficient-inferencelarge-language-modelsllamallmlocalllmmodel-compressionnatural-language-processingpost-training-quantizationquantizationsmall-models
Creat:2023-06-12T11:48:17
Update:2025-03-12T01:53:40
https://arxiv.org/abs/2306.07629
697
Stars
0
Stars Increase