WhisperQuantization
PublicWhisperCPP INT8, INT4, INT5, INT6 quantization effect on model latency and WER experiment
Creat:2025-03-09T09:35:58
Update:2025-03-21T08:06:05
https://doi.org/10.48550/arXiv.2503.09905
3
Stars
0
Stars Increase
WhisperCPP INT8, INT4, INT5, INT6 quantization effect on model latency and WER experiment