dash-infer
PublicDashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including CUDA, x86 and ARMv9.
Creat:2024-04-01T16:19:50
Update:2025-03-26T16:25:32
https://dashinfer.readthedocs.io/
262
Stars
0
Stars Increase