DeepSpeed-MII
PublicMII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Creat:2022-03-24T06:30:45
Update:2025-03-26T04:04:16
2.0K
Stars
1
Stars Increase
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.