2025-03-17 14:51:56.AIbase.16.3k
Lenovo ThinkSystem WA7785a G3 Server Sets Record! Achieves 6708 tokens/s throughput with 671B DeepSeek Large Language Model on a single machine!
The Lenovo ThinkSystem WA7785a G3 server has achieved a record-breaking throughput of 6708 tokens per second while running the 671B parameter DeepSeek large language model. This demonstrates the exceptional performance capabilities of the system for demanding AI workloads.