Search AI Products and News

Explore worldwide AI information, discover new AI opportunities

✓AI News
AI Tools

Type :

✓AI News
AI Tools

2025-07-03 11:05:44.AIbase

Exploring the Compatibility of LLMs with Reinforcement Learning: Shanghai Jiao Tong University Reveals Differences Between Llama and Qwen, Introducing OctoThinker

Large Language Models (LLMs) have achieved significant progress in complex reasoning tasks by combining task prompts with large-scale reinforcement learning (RL), as demonstrated by models like Deepseek-R1-Zero, which directly apply reinforcement learning to base models, showcasing strong reasoning capabilities. However, this success is difficult to replicate across different base model families, especially within the Llama series. This raises a core question: what factors lead to inconsistent performance of different base models during reinforcement learning? How does reinforcement learning perform in

2025-06-25 08:58:59.AIbase

Federal Judge Rules First Time AI Training Using Copyrighted Books as Fair Use, Anthropic Wins but Still Faces Piracy Accusations

2025-06-18 09:38:23.AIbase

{title: OpenAI Advances GPT-4.5 API Deprecation Plan, Sparking Strong Reactions in the Developer Community}

The plan's advancement has triggered dissatisfaction among developers. According to VentureBeat, OpenAI's current API deprecation involves the GPT-4.5 Preview version, which is an intermediary model launched between GPT-4 and GPT-5. Multiple developers have stated that this sudden change has disrupted their development plans, forcing significant adjustments to applications already built based on this API. A developer who wished to remain anonymous commented: We understand the necessity of technological iteration, but the lack of sufficient buffer time in the schedule has brought additional pressure to development teams.

2025-06-18 09:06:46.AIbase

MiniMax launches the world's first open-source hybrid architecture model M1, significantly reducing reinforcement training costs!

2025-06-10 10:58:30.AIbase

All Ohio State University students will receive AI training to enhance technical application skills

2025-06-05 14:31:04.AIbase

Internet Queen's AI Trend Report: High Cost of AI Model Training but Plunging Inference Costs

2025-06-05 14:23:05.AIbase

Mary Meeker's latest report: AI training costs逼近百亿美元, inference costs plummet by 99%

2025-05-27 09:27:25.AIbase

Ex-Meta executive Clegg warns: Requiring AI training to get artists' consent will fundamentally kill the UK's AI industry

2025-05-21 10:03:03.AIbase

DeepSeek releases end-to-end paper on large model training, showcasing excellent engineering depth

2025-05-16 08:51:11.AIbase

DeepSeek-V3 Releases New Paper Revealing the Secrets of Cost-Effective Large Model Training

Recently, the DeepSeek team released a technical paper about their latest model, DeepSeek-V3, focusing on the challenges of scaling in large AI model training and related considerations of hardware architecture. This 14-page paper not only summarizes the experiences and lessons learned during the development of V3 by DeepSeek but also provides deep insights for future hardware design. Notably, CEO of DeepSeek, Liang Wenfeng, also participated in writing the paper.

2025-05-15 13:48:44.AIbase

Google AlphaEvolve Released! Gemini Self-Evolving AI Solves Math Problems, Optimizes Chips and Data Centers, Training Speed Boosts by 32.5%!

2025-05-15 09:18:32.AIbase

Google DeepMind Launches AlphaEvolve: AI Breaks a 56-Year Record in Mathematics and Optimizes Its Own Training System

2025-05-14 16:22:22.AIbase

AI Self-Improvement Wonder! Self-Refine Boosts GPT-4 Output by 20% Without Training!

2025-04-28 09:38:24.AIbase

ByteDance Unveils QuaDMix: A Unified Framework for Large Language Model Pre-training Data Quality and Diversity

2025-04-25 11:17:35.AIbase

AI Boosts UK Workplace Productivity: Employees Save 122 Hours Annually!

A new Google report reveals that effective AI training for employees could unlock a £400 billion (approximately $533 billion) boost to the UK economy from AI-driven growth. Based on a UK pilot program, the report shows employees saved over 122 hours annually on administrative tasks by using AI tools. The report highlights that simplifying AI usage and providing adequate training are key to wider AI adoption. Google's European, Middle...

2025-04-23 15:48:40.AIbase

XPeng Motors Unveils AI Brain at Shanghai Auto Show, Launches Intelligent Driving Safety Training Camp

2025-04-23 14:17:36.AIbase

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

2025-04-21 12:00:22.AIbase

Google Releases Gemma 3 QAT Model: Runable on a Single RTX 3090

Google recently released a new version of its Gemma3 series, exciting many AI enthusiasts. Just a month after its initial launch, Google released a Quantization Aware Training (QAT) optimized version of Gemma3, aiming to significantly reduce memory requirements while maintaining model quality. Specifically, the QAT-optimized Gemma3 27B model reduces VRAM requirements from 54GB to 14.1GB, meaning users can now run it on a single NVIDIA RTX 3090.

2025-04-21 10:06:12.AIbase

Hugging Face's Top Model Leaderboard Unveiled: AI Innovation Continues to Heat Up

Hugging Face recently released its top model leaderboard for the second week of April 2025, encompassing various modalities such as text generation, image generation, and video generation. This highlights the rapid iteration and diverse applications of AI technology. According to AIbase, the models featured in this week's leaderboard not only showcase the innovative vitality of the open-source community but also reflect technical trends such as low-precision training and multi-modal generation. Below is an analysis of the leaderboard highlights, with professional insights provided by the AIbase editorial team. Text Generation Models: Balancing Efficiency and Specialization

2025-04-19 10:22:09.AIbase

Kingsoft Cloud's StarStream Training and Inference Platform Integrates with Zhipu GLM Series Inference Models

Kingsoft Cloud announced that its StarStream training and inference platform has fully integrated with Zhipu's GLM series inference models, becoming one of the first platforms to integrate this series. This move marks Kingsoft Cloud's further expansion in the AI field, providing users with more efficient, intelligent, and cost-effective model services.

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Search AI Products and News

Explore worldwide AI information, discover new AI opportunities

Exploring the Compatibility of LLMs with Reinforcement Learning: Shanghai Jiao Tong University Reveals Differences Between Llama and Qwen, Introducing OctoThinker

Federal Judge Rules First Time AI Training Using Copyrighted Books as Fair Use, Anthropic Wins but Still Faces Piracy Accusations

{title: OpenAI Advances GPT-4.5 API Deprecation Plan, Sparking Strong Reactions in the Developer Community}

MiniMax launches the world's first open-source hybrid architecture model M1, significantly reducing reinforcement training costs!

All Ohio State University students will receive AI training to enhance technical application skills

Internet Queen's AI Trend Report: High Cost of AI Model Training but Plunging Inference Costs

Mary Meeker's latest report: AI training costs逼近百亿美元, inference costs plummet by 99%

Ex-Meta executive Clegg warns: Requiring AI training to get artists' consent will fundamentally kill the UK's AI industry

DeepSeek releases end-to-end paper on large model training, showcasing excellent engineering depth

DeepSeek-V3 Releases New Paper Revealing the Secrets of Cost-Effective Large Model Training

Google AlphaEvolve Released! Gemini Self-Evolving AI Solves Math Problems, Optimizes Chips and Data Centers, Training Speed Boosts by 32.5%!

Google DeepMind Launches AlphaEvolve: AI Breaks a 56-Year Record in Mathematics and Optimizes Its Own Training System

AI Self-Improvement Wonder! Self-Refine Boosts GPT-4 Output by 20% Without Training!

ByteDance Unveils QuaDMix: A Unified Framework for Large Language Model Pre-training Data Quality and Diversity

AI Boosts UK Workplace Productivity: Employees Save 122 Hours Annually!

XPeng Motors Unveils AI Brain at Shanghai Auto Show, Launches Intelligent Driving Safety Training Camp

ByteDance Releases Efficient Pre-training Length Scaling Technology, Breaking Through Long Sequence Training Bottlenecks

Google Releases Gemma 3 QAT Model: Runable on a Single RTX 3090

Hugging Face's Top Model Leaderboard Unveiled: AI Innovation Continues to Heat Up

Kingsoft Cloud's StarStream Training and Inference Platform Integrates with Zhipu GLM Series Inference Models