ReMax Algorithm Enhances RLHF Efficiency for Large Models, Resolving RTX 4090 Limitations

站长之家

Published inAI News · 1 min read · Oct 20, 2023

Translated Data: The ReMax algorithm is designed for RLHF tasks, featuring observation-based characteristics and greedy reward generation to reduce computational overhead, making it more efficient compared to PPO. Studies show that it decreases GPU memory usage and enhances training speed. Addressing the GPU demand issue for large models, ReMax offers a potentially universal solution.

ReMax RLHF large models GPU computational resources

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Beijing Registers 15 New Generative AI Services

Beijing's internet regulator now requires registration for AI apps using licensed large models, allowing legal operation. As of April 3, 2026, 15 new services have been registered, enhancing public convenience.....

Apr 3, 2026

590

Voice instantly becomes family and friend's voice! Guangqi Honda P7 starts OTA update: AI large model officially arrives in the car

GAC Honda P7 receives OTA update to Zhidao Interconnect 4.2.2, integrating AI models for enhanced cabin interaction, including new 'Voice Cloning' feature and automated travel planning, marking Honda's entry into AI-driven electric vehicles in China.....

Apr 3, 2026

190

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Google DeepMind releases Gemma4, a new open-source model with generational performance leap, now under Apache 2.0 license for commercial use. Four variants cater to mobile to workstation needs.....

Apr 3, 2026

1.2k

Xiaomi MiMo Large Model First Token Package: Starting from 39 Yuan

Xiaomi launches MiMo's first Token Plan, offering paid subscriptions for developers and AI enthusiasts with four tiers: Lite (39 yuan/month), Standard (99 yuan/month), Pro (329 yuan/month), and Max, marking its AI ecosystem's entry into a paid era.....

Apr 3, 2026

290

Xiaomi MiMo Large Model Launches Token Subscription Plan: Four Tiers Cover All Modalities Starting at 39 Yuan per Month

Xiaomi launched its first Token Plan subscription package for the MiMo large model at the end of March, offering four tiers ranging from 39 to 659 yuan per month. It uses a unified Credit point system to achieve transparent billing for multi-model and multi-modal calls, covering core models and supporting text, images, audio, etc., marking the start of large-scale delivery in Xiaomi's AI commercialization.

Apr 3, 2026

570

Tencent Cloud Launches Agent Memory Service to Address the Issue of Ephemeral Memory in Large Models

Tencent Cloud launches 'TencentDB Agent Memory' service, offering AI agents long-term memory via a four-tier progressive system that converts fragmented dialogues into structured facts, contextual cognition, and personalized profiles, enhancing answer accuracy.....

Apr 3, 2026

220

Google Plans to Build a 933 MW Natural Gas Power Plant to Support the Operation of Its Large AI Data Centers

Google plans a 933 MW natural gas plant in Texas to power AI data centers, raising concerns over its zero-carbon pledge.....

Apr 3, 2026

240

Microsoft Accelerates Development of In-House AI Models, Aiming to Lead the Industry in Text, Image, and Audio Processing

Microsoft is intensively developing its own AI models, aiming to achieve world-leading capabilities in text, image, and audio processing by 2027 to compete with companies like OpenAI and reduce external dependencies.....

Apr 3, 2026

150

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Google's open-source model Gemma4 enhances 'parameter efficiency', setting new standards for AI workflows. It includes 2.3B/4.5B efficient and 26B/31B high-performance versions, based on Gemini3, all supporting multimodal input, with some enabling real-time voice understanding.....

Apr 3, 2026

240

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

Alibaba released the new generation large language model Qwen3.6-Plus, which is hailed as the strongest domestic programming model at present. Compared to the 3.5 version, its performance has been significantly improved, ranking first among domestic models in multiple programming evaluations, and its overall capabilities are close to the international benchmark Claude series. The model demonstrates a high level of autonomy in front-end development, complex repository tasks, and other areas.

Apr 2, 2026

1.5k

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

ReMax Algorithm Enhances RLHF Efficiency for Large Models, Resolving RTX 4090 Limitations

站长之家

This article is from AIbase Daily

AI News Recommendations

Beijing Registers 15 New Generative AI Services

Voice instantly becomes family and friend's voice! Guangqi Honda P7 starts OTA update: AI large model officially arrives in the car

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Xiaomi MiMo Large Model First Token Package: Starting from 39 Yuan

Xiaomi MiMo Large Model Launches Token Subscription Plan: Four Tiers Cover All Modalities Starting at 39 Yuan per Month

Tencent Cloud Launches Agent Memory Service to Address the Issue of Ephemeral Memory in Large Models

Google Plans to Build a 933 MW Natural Gas Power Plant to Support the Operation of Its Large AI Data Centers

Microsoft Accelerates Development of In-House AI Models, Aiming to Lead the Industry in Text, Image, and Audio Processing

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

AI News Recommendations

Beijing Registers 15 New Generative AI Services

Voice instantly becomes family and friend's voice! Guangqi Honda P7 starts OTA update: AI large model officially arrives in the car

Powered by the Apache 2.0 License! Google Gemma 4 is Now Open Source: 31B Parameters Performance Approaches Leading Large Models

Xiaomi MiMo Large Model First Token Package: Starting from 39 Yuan

Xiaomi MiMo Large Model Launches Token Subscription Plan: Four Tiers Cover All Modalities Starting at 39 Yuan per Month

Tencent Cloud Launches Agent Memory Service to Address the Issue of Ephemeral Memory in Large Models

Google Plans to Build a 933 MW Natural Gas Power Plant to Support the Operation of Its Large AI Data Centers

Microsoft Accelerates Development of In-House AI Models, Aiming to Lead the Industry in Text, Image, and Audio Processing

Google Officially Launches Gemma4 Open-Source Large Model: Available in Four Specifications, 31B Version Ranks Third in Global Open-Source List

Qwen 3.6 Officially Released: 1 Million Long Context, Competing with Claude Code

GEO Services