The Battle for AI Training Data Intensifies: Datacurve Raises $15 Million, Uses Bounty Hunter Model to Secure High-Quality Data

AIbase基地

Published inAI News · 4 min read · Oct 10, 2025

As AI companies mature, the competition for high-quality data has become one of the fiercest battlegrounds in the industry, giving rise to companies like Mercor and Surge, with the most notable being Scale AI, founded by Alexandr Wang. However, Wang has now taken charge of Meta's AI business, and many investors see an opportunity, willing to fund companies that have compelling new strategies for collecting training data.

Datacurve, a company graduated from Y Combinator, is one such company, focusing on providing high-quality data for software development. On Thursday, the company announced a $15 million Series A funding round led by Mark Goldberg from Chemistry, with participation from employees at DeepMind, Vercel, Anthropic, and OpenAI. Previously, the company had also completed a $2.7 million seed funding round, with Balaji Srinivasan, former CTO of Coinbase, participating in the investment.

Investment, financing, money

Datacurve uses a bounty hunter system to attract skilled software engineers to complete the most difficult data sets. The company pays for these contributions and has distributed over $1 million in bounties so far.

However, co-founder Serena Ge said that the biggest motivation is not money. For high-value services like software development, the compensation for data work is always far lower than traditional employment relationships, so the company's most important advantage is a positive user experience.

Ge said, we treat this as a consumer product rather than a data annotation operation. They have spent a lot of time thinking about how to optimize it to attract and engage the people they want to enter the platform.

This is especially important as the demand for data after training becomes more complex. Early models were trained on simple data sets, while today's AI products rely on complex reinforcement learning environments, which need to be built through specific and strategic data collection. As environments become more complex, data requirements are becoming stricter in both quantity and quality, which may give high-quality data collection companies like Datacurve an advantage.

As an early-stage company, Datacurve currently focuses on the field of software engineering, but Ge said this model can also apply to fields such as finance, marketing, and even medicine.

Ge explained that what they are doing now is building an infrastructure for post-training data collection, attracting and retaining high-level talent in their respective fields.

AI Daily: Kimi K3 Tops the World's Largest Open-Source Model; Xiaodu AI Watch Fit On Sale; China Launches Large Model IPv6 Special Action

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you grasp technical trends and understand innovative AI product applications. Fresh AI products click to learn more: https://app.aibase.com/zh1. KimiK3 is open source, with 2.8 trillion parameters, topping the world's largest open-source model, even Elon Musk liked it. The release of KimiK3 marks a significant progress in China's large model field, its power

Samsung Plans to Invest 1 Billion Euros in Mistral: The Confidence for Europe's AI Independence Is Being Revalued

Samsung plans to increase its investment in French AI firm Mistral AI, following a prior investment by its venture capital arm. If this round proceeds, it will deepen their ties. The company aims to launch a Series D funding at a €20 billion valuation, and the investment could reshape Europe’s AI landscape.....

Google Vids Introduces the Gemini Omni Model: Customize Your Own Digital Avatar with a Selfie and Audio

As Sora may step back, Google launches a major update for Google Vids: users need only upload a selfie and voice to generate a digital virtual person with highly realistic appearance and voice, allowing video explanations without the need for a real person on camera. Meanwhile, this feature is deeply integrated with the multi-modal model Gemini, enhancing the AI video creation experience.

OpenAI Criticizes AI Evaluation Benchmark: 731 Questions, Nearly a Third Have Flaws. 8-Month Passing Rate Rises from 23% to 80%, Now Ineffective

OpenAI publicly questioned the SWE-Bench Pro benchmark, pointing out that about 30% of its 731 test tasks have evaluation flaws. The benchmark, launched by Scale AI, is an industry authority for measuring large model programming capabilities. However, OpenAI warned that the passing rate of cutting-edge models has surged from 23.3% to 80.3% within 8 months, which is unusually fast, indicating doubts about the reliability of the evaluation.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

The Battle for AI Training Data Intensifies: Datacurve Raises $15 Million, Uses Bounty Hunter Model to Secure High-Quality Data

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Apple's First System Update to Thank AI: Claude and Codex Uncover Multiple Vulnerabilities

AI Daily: Kimi K3 Tops the World's Largest Open-Source Model; Xiaodu AI Watch Fit On Sale; China Launches Large Model IPv6 Special Action

Enigma Raises $71 Million: Making Robot Control as Simple as Turning a Volume Knob

Brockman acknowledges that Kimi K3 is quite good, but says OpenAI still holds a significant advantage

Samsung Plans to Invest 1 Billion Euros in Mistral: The Confidence for Europe's AI Independence Is Being Revalued

Google Vids Introduces the Gemini Omni Model: Customize Your Own Digital Avatar with a Selfie and Audio

Zeng Guoyang, CTO of Mianbi Intelligence: From Typewriters to Large Models - The Evolution and Breakthrough of Edge AI

New Breakthrough in Visual Models: SenseNova-Vision-7B-MoT Open-Sourced by SenseTime

Generating 2 billion USD! MiniMax completes a new round of financing worth 16 billion HKD with more than 7 times oversubscription

OpenAI Criticizes AI Evaluation Benchmark: 731 Questions, Nearly a Third Have Flaws. 8-Month Passing Rate Rises from 23% to 80%, Now Ineffective

AI News Recommendations

Apple's First System Update to Thank AI: Claude and Codex Uncover Multiple Vulnerabilities

AI Daily: Kimi K3 Tops the World's Largest Open-Source Model; Xiaodu AI Watch Fit On Sale; China Launches Large Model IPv6 Special Action

Enigma Raises $71 Million: Making Robot Control as Simple as Turning a Volume Knob

Brockman acknowledges that Kimi K3 is quite good, but says OpenAI still holds a significant advantage

Samsung Plans to Invest 1 Billion Euros in Mistral: The Confidence for Europe's AI Independence Is Being Revalued

Google Vids Introduces the Gemini Omni Model: Customize Your Own Digital Avatar with a Selfie and Audio

Zeng Guoyang, CTO of Mianbi Intelligence: From Typewriters to Large Models - The Evolution and Breakthrough of Edge AI

New Breakthrough in Visual Models: SenseNova-Vision-7B-MoT Open-Sourced by SenseTime

Generating 2 billion USD! MiniMax completes a new round of financing worth 16 billion HKD with more than 7 times oversubscription

OpenAI Criticizes AI Evaluation Benchmark: 731 Questions, Nearly a Third Have Flaws. 8-Month Passing Rate Rises from 23% to 80%, Now Ineffective