Apple Launches New AI Training Method, Significantly Improving Model Performance by Replacing Manual Scoring with Task Lists
Apple introduces RLCF, a reinforcement learning method using task lists instead of human ratings, enhancing LLMs' ability to execute complex instructions, contrasting with RLHF's reliance on simple evaluations.....