Domestic AI has once again made a major breakthrough in the field of code generation. Kwaipilot team under Kuaishou has recently officially open-sourced the KAT-Dev-72B-Exp model, an experimental large language model with 72 billion parameters. It topped the open-source code model ranking list due to its outstanding performance in software engineering benchmark tests, marking a milestone progress in the field of programming assistants for domestic AI.

The KAT-Dev-72B-Exp achieved impressive results in the authoritative SWE-Bench Verified benchmark test, achieving a 74.6% accuracy rate using the strict SWE-agent scaffold evaluation criteria. This achievement not only surpassed all previous open-source models but was also seen as a landmark performance that rivals top closed-source models. As an authoritative evaluation system in the field of software engineering, SWE-Bench focuses on real code repository repair and optimization tasks. The high score of this model proves its stability and practical value in handling complex programming scenarios far beyond expectations.

Industry analysts believe that this achievement stems from the model's end-to-end capabilities in actual development tasks, covering multiple dimensions such as code completion, bug fixing, and system-level refactoring. Compared to traditional code generation tools, this model places more emphasis on the complete software engineering process, helping developers transition from merely writing code to building systems.

image.png

The core advantage of KAT-Dev-72B-Exp lies in its innovative design of the training architecture. The team introduced a large-scale reinforcement learning mechanism combined with a new training engine, achieving shared prefix trajectory and entropy shaping advantage strategy, effectively solving the exploration collapse problem in reinforcement learning training.

The shared prefix trajectory technology reuses the prefix path of frequently used code sequences, allowing the model to efficiently accumulate experience and reduce redundant computation, significantly improving training stability. The entropy shaping advantage draws on principles from information theory, dynamically balancing exploration and exploitation, preventing the model from getting stuck in local optima, ensuring strong generalization ability in complex tasks.

This innovation not only significantly reduced training costs but also provided a valuable experimental platform for subsequent model iterations. As a preview version of the KAT-Coder series, this model is open to the research community and is now available on the Hugging Face platform, encouraging global developers to download, use, and provide feedback.

Kuaishou's initiative has injected new vitality into the global open-source AI ecosystem. The release of KAT-Dev-72B-Exp not only lowers the threshold for using high-end code AI but also provides free and efficient toolchains for small and medium-sized development teams. The Kuaishou StreamLake platform has launched an online trial service for KAT Coder, offering daily free access, allowing more users to experience its powerful performance instantly.

As domestic models continue to break through in parameter scale and task adaptability, AI programming assistants are evolving from auxiliary tools into core productivity. The Kwaipilot team of Kuaishou stated that they will continue to promote the commercialization of the KAT series and provide stronger technical support for the global developer community.

In the context of increasingly fierce competition between China and the United States in AI, the emergence of KAT-Dev-72B-Exp demonstrates the strength of domestic open-source. This breakthrough reminds the industry that technological innovation is not only about scale expansion, but also about accurately solving practical pain points. The model is now available for download in the official Hugging Face repository, and interested developers can go there to experience this new benchmark in the field of code generation.

Address: https://huggingface.co/Kwaipilot/KAT-Dev-72B-Exp