Recently, StepFun officially launched a new deep research intelligent agent model -

Different from common Web Agents on the market that are mainly optimized for short questions,
In order for AI to achieve the research level of human experts,
Currently, the model has achieved a compliance rate of 61.42% in Scale AI's research evaluation metrics, performing well enough to rival the deep research systems of OpenAI and Google. In StepFun's own ADR-Bench Chinese benchmark test, this 32B model even surpassed some larger-scale open-source models, demonstrating high practical value and cost advantages.
Paper: https://arxiv.org/pdf/2512.20491
Key Points:
🧠 Single Agent Architecture:
internalizes planning, searching, verifying, and writing as atomic capabilities of a single model, without needing to call multiple external agents, significantly improving efficiency and reducing costs.Step-DeepResearch 📚 Deep Research Orientation: Unlike simple question-and-answer retrieval, this model supports a context length of up to 128k, enabling it to retrieve information from over 20 million papers and authoritative indexes, generating rigorous structured reports.
🏆 Strong Performance: It performs well in multiple deep research evaluations, achieving professional research standards comparable to large-parameter closed-source models with its 32B size.

