Nanoflow
PublicA throughput-oriented high-performance serving framework for LLMs
Creat:2024-08-19T14:39:19
Update:2025-02-28T16:05:08
https://arxiv.org/abs/2408.12757
865
Stars
3
Stars Increase
A throughput-oriented high-performance serving framework for LLMs