New Breakthrough in Diffusion Models: Radical Numerics Open-Sources 30B-Parameter RND1 AI, Marking a Key Step in Self-Improvement

AIbase基地

Published inAI News · 7 min read · Oct 13, 2025

AI model architectures are undergoing a profound transformation. Diffusion language models, with their parallel generation and efficient inference capabilities, are becoming the focus of industry attention. On October 9th, the AI research institution Radical Numerics officially released RND1-Base, the largest open-source diffusion language model to date, with a parameter scale of 30B, of which 3B are active parameters, using a sparse expert mixture architecture. The model not only performs well in benchmark tests, but also opens up complete weights, training recipes, and inference code, aiming to accelerate post-training and inference research in the field of diffusion language models.

RND1-Base is based on the autoregressive base model Qwen3-30BA3B, and achieves a seamless transition to the diffusion paradigm through simple continuous pre-training. The conversion process uses a bidirectional masking mechanism and layer-specific learning rates to preserve existing knowledge, and uses large batch training of up to 8M tokens to ensure stability, ultimately completing pre-training on 500B tokens. This efficient approach avoids the resource waste of training from scratch, demonstrating Radical Numerics' innovative thinking in model reuse.

Different from traditional autoregressive language models that generate tokens sequentially, RND1 views text generation as a process similar to image denoising, refining the entire sequence in parallel from noise, and supports a bidirectional attention mechanism. This not only enhances the flexibility and controllability of generation, but also significantly reduces inference latency, making it particularly suitable for complex reasoning and code generation tasks.

In general benchmark tests, RND1-Base has shown strong performance, surpassing earlier open-source diffusion language models such as Dream-7B and LLaDA-8B. Specific results include 57.2% on MMLU multi-task language understanding, 72.1% on GSM8K mathematical reasoning, and 51.3% on MBPP code generation. These metrics cover reasoning, STEM, and programming fields, proving that the model maintains the advantages of the autoregressive base while achieving performance improvements in the diffusion architecture.

The sparse expert mixture design of RND1 activates only 3B parameters out of 30B total parameters, optimizing computational efficiency and making it suitable for large-scale deployment. The model has not yet undergone post-training, and may occasionally repeat when using greedy sampling, but the open-source code integrates FlashInfer and SGLang backends, supporting fast inference iteration.

Radical Numerics positions itself as the next-generation AI laboratory, focusing on building recursive self-improvement engines. RND1 is the product of this vision, allowing models to participate in optimizing the next generation of AI through an automated AI research platform. The team consists of researchers and engineers from top institutions such as DeepMind, Meta, Liquid, and Stanford, with the goal of enabling AI to design AI autonomously and accelerate scientific and industrial discoveries.

The purpose of open-sourcing RND1 is to inspire the community to explore the potential of diffusion language models for inference optimization and post-training. Currently, the application of diffusion models in the language domain is moving from experimental stages to practical applications, especially showing advantages in parallel generation of long sequences. Industry experts expect that this move will stimulate more experiments in converting autoregressive models to diffusion models, filling the gap in the open-source ecosystem for efficient generation models.

Although RND1 leads in scale and performance, the generalization ability and memory overhead of diffusion models still need further optimization. Future integration with multi-objective fine-tuning or hybrid architectures is expected to further unlock its potential. Radical Numerics has opened recruitment, welcoming AI professionals to join this cutting-edge exploration.

This breakthrough marks an important turning point for diffusion language models, transitioning from theoretical exploration to engineering practice. By open-sourcing such a large-scale diffusion model, Radical Numerics not only provides the research community with valuable tools, but also opens new possibilities for AI self-improvement and recursive optimization. As more researchers get involved in this field, diffusion language models may become a key direction for the next generation of AI architectures.

Meta Invests $1.5 Billion in Texas to Build an AI Data Center, with a Water Recycling Rate of 200%

Meta announced an investment of $1.5 billion in El Paso, Texas to build a new AI-optimized data center, which will be the 29th globally and the third in the state, aiming to meet the growing demand for AI applications. The design capacity can be expanded to 1 gigawatt, and the power supply can support the daily electricity needs of a city as large as San Francisco.

Tencent Launches AI Public Examination Pass, Making Access to Civil Service Exam Information Easier!

The Tencent QQ Browser has launched the free 'AI Public Examination Pass' feature, specifically designed for civil service exam candidates to provide intelligent job selection services. This feature relies on Tencent Cloud technology to real-time aggregate information from thousands of recruitment sites across the country, helping candidates conveniently access recruitment announcements and job recommendations, solving the problem of information acquisition.

Anthropic May Experience a Surge in Revenue Over the Next Three Years, with Annual Revenue Possibly Reaching $26 Billion by 2026!

Artificial intelligence company Anthropic expects to achieve an annual revenue of $26 billion by 2026, and may reach $9 billion by the end of 2025, showing strong growth momentum. The company was founded in 2020 by former OpenAI employees, focusing on developing safe AI technology, reflecting its rapid development and wide application potential in the AI field.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

New Breakthrough in Diffusion Models: Radical Numerics Open-Sources 30B-Parameter RND1 AI, Marking a Key Step in Self-Improvement

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AMD Reaches New Heights! Collaborating with Oracle to Deploy 50,000 AI Chips and Fully Expanding into the AI Market

Apple M5 Chip Launches with Fourfold Improvement in AI Performance, Accelerating On-Device Intelligence Experience

China's First Standardized Field Large Model Launched: Solving the Challenges of Standard Retrieval and Application

Meta Invests $1.5 Billion in Texas to Build an AI Data Center, with a Water Recycling Rate of 200%

Thriller! U.S. School Nurse Uses AI Tools to Study Poisons, Causing Husband's Paralysis

Tencent Launches AI Public Examination Pass, Making Access to Civil Service Exam Information Easier!

AI-Powered Digital Twins Free Up Work Efficiency: Startup Viven Raises $35 Million in Seed Funding to Address the Challenge of Employees Not Being Present

Beijing Investigates First Case of AI False Advertising: AI Imitates Famous Host to Sell Fish Oil

King of Cost-Effectiveness! Anthropic Launches Claude Haiku 4.5 with Programming Capabilities Comparable to Sonnet 4 at One-Third the Price!

Anthropic May Experience a Surge in Revenue Over the Next Three Years, with Annual Revenue Possibly Reaching $26 Billion by 2026!

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

New Breakthrough in Diffusion Models: Radical Numerics Open-Sources 30B-Parameter RND1 AI, Marking a Key Step in Self-Improvement

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AMD Reaches New Heights! Collaborating with Oracle to Deploy 50,000 AI Chips and Fully Expanding into the AI Market

Apple M5 Chip Launches with Fourfold Improvement in AI Performance, Accelerating On-Device Intelligence Experience

China's First Standardized Field Large Model Launched: Solving the Challenges of Standard Retrieval and Application

Meta Invests $1.5 Billion in Texas to Build an AI Data Center, with a Water Recycling Rate of 200%

Thriller! U.S. School Nurse Uses AI Tools to Study Poisons, Causing Husband's Paralysis

Tencent Launches AI Public Examination Pass, Making Access to Civil Service Exam Information Easier!

AI-Powered Digital Twins Free Up Work Efficiency: Startup Viven Raises $35 Million in Seed Funding to Address the Challenge of Employees Not Being Present

Beijing Investigates First Case of AI False Advertising: AI Imitates Famous Host to Sell Fish Oil

King of Cost-Effectiveness! Anthropic Launches Claude Haiku 4.5 with Programming Capabilities Comparable to Sonnet 4 at One-Third the Price!

Anthropic May Experience a Surge in Revenue Over the Next Three Years, with Annual Revenue Possibly Reaching $26 Billion by 2026!

GEO Services