SqueezeLLM
Public[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
efficient-inferencelarge-language-modelsllamallmlocalllmmodel-compressionnatural-language-processingpost-training-quantizationquantizationsmall-models
Erstellungszeit:2023-06-12T11:48:17
Aktualisierungszeit:2025-03-12T01:53:40
https://arxiv.org/abs/2306.07629
698
Stars
1
Stars Increase