An efficient AI inference platform designed for data centers.
Minimax
-
Input tokens/M
Output tokens/M
Context Length