Power Efficiency Ratio of Computing Approaches Rubin 5 Times? Startup Positron Launches Asimov Architecture to Reshape AI Inference

AIbase基地

Published inAI News · 4 min read · Feb 5, 2026

Amid the intensifying global AI chip competition, the startup chip company Positron has officially unveiled its new AI inference chip, Asimov. The company claims that this chip, which is deeply optimized for large model (LLM) inference, is expected to achieve five times the energy efficiency (tokens per watt) and cost-effectiveness (tokens per dollar) of NVIDIA's next-generation Rubin architecture. This bold data has immediately attracted widespread attention in the industry.

Positron's core logic lies in a "subtraction" approach to redefining traditional GPU architectures. The Asimov chip discards the complex control circuits found in traditional computing cards and instead adopts a more pure tensor processing architecture, aiming to minimize energy loss in non-computational stages. This design not only allows Asimov to consume less power when running models of the same scale but also significantly reduces the manufacturing and packaging costs of the chip. The Positron team emphasized that, given the strict power limitations in current data centers, this extreme energy efficiency will become a key factor for enterprises deploying AI services.

Although Asimov demonstrates impressive theoretical figures, challenging NVIDIA's market position is no easy task. Currently, Positron is working on building a supporting compiler and development ecosystem to ensure developers can seamlessly migrate existing PyTorch or TensorFlow models. The Asimov chip is planned to use advanced process technology and has been hardware-optimized for the current mainstream Transformer architecture, ensuring high throughput and low latency when processing trillion-parameter models.

AIbase believes that Positron's entry represents the trend of transitioning from "general computing power" to "specialized inference" in the AI chip field. If Asimov can fulfill its promise of five times the performance, it could completely reshape the cost landscape of the large model inference market.

Key Points:

🚀 Energy Efficiency Challenge: Asimov chip claims to have five times the token efficiency per watt and per dollar compared to NVIDIA's future Rubin architecture, focusing on extreme cost-effectiveness.
🏗️ Architectural Simplification Innovation: Discarding redundant designs of general computing, the chip uses a specialized architecture focused on tensor computation, significantly reducing energy loss and hardware costs during inference.
🌐 Targeting Large-Scale Inference: Hardware design is deeply optimized for the Transformer architecture, aiming to address power bottlenecks and high operational costs when deploying trillion-parameter models.

Has the Era of Layers Died? Wu Xinhong from Meitu: Focus on High-Value Vertical Scenarios, Applications Coexist with Large Models

Meitu CEO Wu Xinhong addresses concerns about 'large models overshadowing apps', stating that general large models and vertical applications are complementary, not competitive. He likens large models to 'Swiss Army knives'—versatile but less efficient, while apps are 'specialized tools' that precisely meet core needs. The moat for apps lies in deeply exploring vertical scenarios, fulfilling long-tail demands, and securing user loyalty.....

Overseas Revenue Surpasses Domestic: Kimi K2.5 Drives Moonshot Global Breakthrough

Moon's Dark Side released the K2.5 model, marking a milestone in Kimi's global expansion. Overseas revenue surpassed domestic for the first time, achieving a major breakthrough in international commercialization of domestic large models. Post-update, global paid users surged fourfold within days, leading in popularity on platforms like Openroute.....

Tencent Yuanbao Launches 1 Billion Red Envelope to Intensify the Battle for AI Applications During the Spring Festival!

Tencent's Yuanbao App launched a 1 billion yuan Spring Festival red envelope campaign, topping Apple's free app chart. The red envelope competition highlights not just monetary rivalry but also deep AI application, crucial for capturing user traffic. Analyst Pei Yifan from AVIC Securities notes AI is intensifying industry competition.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Power Efficiency Ratio of Computing Approaches Rubin 5 Times? Startup Positron Launches Asimov Architecture to Reshape AI Inference

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Has the Era of Layers Died? Wu Xinhong from Meitu: Focus on High-Value Vertical Scenarios, Applications Coexist with Large Models

Performance Surpasses Flagship! Claude Sonnet5 Leaked, Price Halved to Target OpenAI's Weakness

Global Chinese Language Model Competition! Overseas Strong Contenders Take the Top Three, Domestic Ones Show Promise!

Overseas Revenue Surpasses Domestic: Kimi K2.5 Drives Moonshot Global Breakthrough

Tencent Yuanbao Launches 1 Billion Red Envelope to Intensify the Battle for AI Applications During the Spring Festival!

Amazon Cuts 16,000 Employees: Accelerating Organizational Start-up and the Rise of AI Replacing White-Collar Jobs

The Era of Large Models in Input Methods: Sogou Input Method's AI Users Exceed 100 Million, Voice Accuracy Reaches 98%

Financing 5 Billion Yuan Sets a New Record! Yin Qi Joins Jiechu Xingchen, the Large Model Playoffs Enter the Physical World

Yumin Hong Talks About AI + Education: Challenges Under the Transformation of Roles, Half of Primary and Secondary School Teachers in China Face a Crisis of Incompetence

Criticism of 'Great Leap Forward'-Style Slogans! Li Feifei of Alibaba Cloud Sets Hard Metrics for AI-Native Databases

AI News Recommendations

Has the Era of Layers Died? Wu Xinhong from Meitu: Focus on High-Value Vertical Scenarios, Applications Coexist with Large Models

Performance Surpasses Flagship! Claude Sonnet5 Leaked, Price Halved to Target OpenAI's Weakness

Global Chinese Language Model Competition! Overseas Strong Contenders Take the Top Three, Domestic Ones Show Promise!

Overseas Revenue Surpasses Domestic: Kimi K2.5 Drives Moonshot Global Breakthrough

Tencent Yuanbao Launches 1 Billion Red Envelope to Intensify the Battle for AI Applications During the Spring Festival!

Amazon Cuts 16,000 Employees: Accelerating Organizational Start-up and the Rise of AI Replacing White-Collar Jobs

The Era of Large Models in Input Methods: Sogou Input Method's AI Users Exceed 100 Million, Voice Accuracy Reaches 98%

Financing 5 Billion Yuan Sets a New Record! Yin Qi Joins Jiechu Xingchen, the Large Model Playoffs Enter the Physical World

Yumin Hong Talks About AI + Education: Challenges Under the Transformation of Roles, Half of Primary and Secondary School Teachers in China Face a Crisis of Incompetence

Criticism of 'Great Leap Forward'-Style Slogans! Li Feifei of Alibaba Cloud Sets Hard Metrics for AI-Native Databases

GEO Services