AI News

Don't miss any moment of global AI innovation

AI Daily

Daily three-minute AI industry trends

AI Timeline

AI industry milestones

Al Hardware

Lists all AI hardware products.

AI Monetization Guide

Latest Cases

AI monetization case sharing

Image Collection

AI image creation monetization cases

Video Collection

AI video creation monetization cases

Audio Collection

AI audio creation monetization cases

Content Collection

AI content writing monetization cases

AI Tutorials

Latest Tutorials

Free sharing of the latest AI tutorials

AI Product Rankings

AI Product Ranking

Shows total visits ranking of AI websites

AI Traffic Growth Ranking

Track fastest growing AI websites by traffic

AI Traffic Decline Ranking

Focus on AI websites with significant traffic drops

AI Weekly Ranking

Shows weekly visits ranking of AI websites

Popular Country Rankings

United States

AI websites most popular with US users

China

AI websites most popular with Chinese users

India

AI websites most popular with Indian users

Brazil

AI websites most popular with Brazilian users

Popular Category Rankings

Image Generation

Total visits ranking of AI image generation websites

Personal Assistant

Total visits ranking of AI personal assistant websites

Character Generation

Total visits ranking of AI character generation websites

Video Generation

Total visits ranking of AI video generation websites

Popular Open Source Data Rankings

AI Project Ranking

GitHub popular AI projects by total stars

AI Project Growth Ranking

GitHub popular AI projects by growth rate

AI Developer Ranking

GitHub popular AI developer ranking

AI Organization Ranking

GitHub popular AI organization ranking

Popular Open Source Categories

Deepseek

GitHub popular deepseek open source projects

TTS

GitHub popular TTS open source projects

LLM

GitHub popular LLM open source projects

ChatGPT

GitHub popular ChatGPT open source projects

AI Open Source Project Library

Overview

Overview of GitHub popular AI open source projects

Product Library Tool Navigation MCP

Red Hat Joins Hands with Google and NVIDIA to Launch LLM-D Open Source Project to Solve Dual Challenges of Large-Scale AI Inference Cost and Latency

AIbase基地

Published inAI News · 6 min read · May 27, 2025

Red Hat, a global leader in open-source solutions, recently announced the launch of the revolutionary open-source project llm-d, specifically designed to address the pressing need for large-scale generative AI inference. This project brings together industry giants such as CoreWeave, Google Cloud, IBM Research, and NVIDIA as founding contributors with the aim of leveraging breakthrough technologies to meet the most stringent production-level goals for large language model inference on the cloud.

Inference Era Arrives, Challenges Mount

According to Gartner's latest data predictions, "By 2028, more than 80% of data center workload accelerators will be exclusively deployed for inference, rather than training purposes, as the market matures." This trend highlights the strategic importance of inference technology.

However, as inference models become increasingly complex and larger in scale, the rapid rise in resource demands is limiting the feasibility of centralized inference. Excessive costs and prolonged delays could become critical bottlenecks to AI innovation, urgently requiring new technological solutions.

Robot AI Artificial Intelligence

llm-d: Revolutionary Breakthroughs in Unified Platforms

Red Hat and its partners are tackling this challenge head-on through the llm-d project by successfully integrating advanced inference capabilities into existing enterprise IT infrastructures. This unified platform empowers IT teams to deploy innovative technologies that maximize efficiency while meeting the diverse service needs of critical business workloads, significantly reducing the total cost of ownership for high-performance AI accelerators.

The core value of this solution lies in breaking the limitations of traditional inference deployment, offering enterprises a more flexible, efficient, and economical choice for AI inference.

Strong Industry Alliance Support

The llm-d project has garnered strong support from a powerful alliance comprising generative AI model providers, AI accelerator pioneers, and major AI cloud platforms. In addition to the four founding contributors, important enterprises such as AMD, Cisco, Hugging Face, Intel, Lambda, and Mistral AI have also joined as partners, demonstrating the depth of collaboration across the industry in building the future of large-scale LLM services.

Industry Leaders Respond Positively

Mark Lohmeyer, Vice President and General Manager of Google Cloud AI and Compute Infrastructure, emphasized: "Efficient AI inference is crucial in enabling enterprises to deploy AI at scale and create value for users. As we enter the era of inference, Google Cloud is proud to be a founding contributor to the llm-d project, building on our tradition of open-source contributions."

Ujval Kapasi, Vice President of NVIDIA Engineering AI Frameworks, stated: "The llm-d project is a significant addition to the open-source AI ecosystem, reflecting NVIDIA's commitment to collaborating to drive generative AI innovation. Scalable, high-performance inference is key to the next wave of generative AI and agent-based AI. We are working with Red Hat and other supporting partners to accelerate the development of llm-d using NVIDIA innovations like NIXL."

Open Source Driving Industrial Transformation

The launch of the llm-d project marks a new phase in the field of AI inference. By leveraging the open-source model to gather industry wisdom, this project not only aims to address current challenges in cost and performance for large-scale inference but also lays a solid foundation for the sustainable development of the entire AI ecosystem.

With more companies and developers getting involved, llm-d has the potential to become a significant force in driving the standardization and popularization of AI inference technology, fully preparing for the upcoming era of inference.

Generative AI redhat llm-d CoreWeave

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Quark Launches Advanced Research: AI Writing Reports Is No Longer a Dream, with Limited Daily Access Available

In May, Quark officially launched the new "Advanced Research" feature and began inviting users for limited early access. This function is based on the Tongyi Qianwen large model, supporting complex topics such as academic research subjects and industry analysis, completing the entire process from material collection, data analysis, viewpoint extraction to report generation. It enables the user to input a topic and output a finished product. Users can now apply for an invitation code through the Quark App or PC version, activate their eligibility, and then click the 'Advanced Research' icon on the home page to submit their requests. The system will automatically generate a structured report within a few minutes and supports exporting it as a PDF.

May 30, 2025

320

Munich Startup Spaitial: Reshaping the Future of 3D Spaces with Generative AI

May 29, 2025

430

Microsoft Adds Generative AI Features to Paint, Snipping Tool, and Notepad

May 26, 2025

340

OpenAI and CoreWeave Reach up to $4 Billion Cloud Computing Agreement as Collaboration Deepens

May 16, 2025

110

CoreWeave Strikes $4 Billion Cloud Computing Deal with OpenAI, Contract Extended to 2029

May 16, 2025

210

Swisscom, the major Swiss telecommunications company, joins the national artificial intelligence institute to promote generative AI development

May 15, 2025

220

How the Advertising Industry Adapts to the AI Era: From Google to ChatGPT

Google's rise in the history of the internet is almost legendary. Founded in 1999, Google attracted a massive user base with its clean, ad-free search experience. Early on, founders Larry Page and Sergey Brin staunchly avoided advertising, believing it would compromise search quality. However, by 2000, to achieve profitability, Google launched AdWords, rapidly transforming into an advertising revenue giant. Advertising gradually became a significant component of search results pages. However...

Apr 28, 2025

140

Developer Alert! One-Fifth of AI-Recommended Packages are Fake: Slopsquatting Threat Emerges

Cybersecurity researchers warn of a new software supply chain attack called "Slopsquatting." This attack exploits the 'package hallucination' phenomenon – where generative AI (like LLMs) may suggest non-existent package names during code writing. Attackers can preemptively register these fictitious names and inject malicious code. Image Note: Image generated by AI, courtesy of Midjourney. Research reveals that AI-fabricated package names often exhibit a high degree of...

Apr 27, 2025

260

Google AI Releases 601 Real-World Generative AI Application Cases Across Industries

Apr 27, 2025

890

Pixverse Launches MCP: Unlocking a New Frontier in AI Video Generation

With the rapid advancement of generative AI technology, the video creation field is experiencing a new wave of transformation. Pixverse, a leading platform in AI video generation, recently launched the Model Context Protocol (MCP), providing users and developers with a more efficient and flexible video generation solution. What is MCP? Unlocking new ways to generate AI videos. Pixverse's MCP (Model Context Protocol) is specifically designed for AI video generation...

Apr 25, 2025

500