Large model generation speed becomes a bottleneck, a team of former Google employees develops a new chip LPU

新硅

Published inAI News · 2 min read · Feb 21, 2024

181

Groq is an AI chip startup founded by former Google employees. The company has introduced an AI acceleration chip named LPU, which significantly speeds up the inference and generation of large models through technological innovation, achieving speeds up to 10 times that of GPUs. This is primarily due to the adoption of high-speed SRAM storage technology and an architecture design that minimizes memory access. Users can run various large models such as Llama and Mixtral on the LPU. The introduction of LPU helps to further optimize the performance of large models and may be used to enhance the response speeds of applications like voice assistants and AI writing.

large models chips generation speed

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Ali Tongyi Lab Launches ZeroSearch: Allowing Large Models to Self-Search Without APIs

May 19, 2025

130

Google AlphaEvolve Released! Gemini Self-Evolving AI Solves Math Problems, Optimizes Chips and Data Centers, Training Speed Boosts by 32.5%!

May 15, 2025

430

Nvidia to supply 18,000 advanced AI chips to Saudi Arabia

American chipmaker Nvidia recently announced it would collaborate with Humain, an AI startup owned by the Saudi Arabian sovereign wealth fund, to provide 18,000 cutting-edge AI chips to Saudi Arabia. This announcement was made during the visit of a US delegation to Saudi Arabia, Qatar and the UAE, marking an important step for Saudi Arabia in artificial intelligence and cloud computing infrastructure. Nvidia founder Jensen Huang expressed this at the Saudi-US Investment Forum.

May 14, 2025

100

HeHe Information Launches MCP Service to Help Efficient and Intelligent Document Processing for Large Models

May 13, 2025

130

Kimi joins Xiaohongshu: AI large models shift from traffic competition to in-depth content creation

May 12, 2025

220

Shanghai Promotes Innovation and Upgrading of the Automotive Industry, Strengthening Applications of Innovative Technologies such as High-Performance Computing Chips and Intelligent Driving Large Models

Shanghai is actively promoting innovation and upgrading within its automotive industry, focusing on the application of advanced technologies including high-performance computing chips and intelligent driving large models. This initiative aims to enhance the competitiveness and technological advancement of the city's automotive sector.

Apr 25, 2025

300

Revenue Target Increased Nearly 10-Fold! Commercialization of Innovative Medical AI Large Models Accelerates, Aiming for 40 Million Next Year

Apr 24, 2025

260

Accelerated Transformation of Banking Technology: Large Models Deepen into Core Business

As the challenges and pressures faced by the banking industry in digital transformation intensify, more and more banks are beginning to integrate large model technology into their core businesses, rather than simply relying on chatbot applications. The latest financial reports show that some major domestic banks have made significant progress in technology investment and large model applications, but also reveal a trend of differentiated investment. According to an analysis of ten major banks by Titanium Media App, including the six major state-owned banks and several joint-stock banks, six of them have seen a reduction in technology investment. For example,

Apr 18, 2025

280

Tencent Cloud's Wang Qi: Large Models and Knowledge Bases Empower Enterprise AI Application Landing

At the recently concluded 2025 Tencent Global Digital Ecosystem Summit Chengdu Forum, Tencent Cloud Vice President Wang Qi delivered a compelling speech on how enterprises can effectively implement Artificial Intelligence (AI) applications. He highlighted that combining large models with knowledge bases is currently the optimal path for enterprises to achieve AI implementation. Wang Qi emphasized that Tencent Cloud adheres to a "core technology self-research + embracing advanced open source" multi-model strategy, a philosophy that permeates Tencent's comprehensive layout across underlying computing power, foundational large models, model development platforms, and intelligent applications. Image source note:

Apr 18, 2025

420

Xunlei Upgrade: One-Click Download for Large Models, Enjoy Accelerated Experience!

In today's rapidly developing AI landscape, developers often need to download massive model files. Traditional methods of downloading individual files one by one are time-consuming and cumbersome, and organizing the resulting files can be a headache. To address this, Xunlei recently released an updated plugin with significant upgrades for large model downloads, offering a seamless experience with automatic loading of complete files, intelligent archiving, and one-click download. The upgraded one-click download feature is designed to dramatically improve download efficiency.

Apr 15, 2025

150

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview