AIToday

XDOF, a startup building robot training data infrastructure, has raised $70 million(約110億円) and is working with 20 customers including frontier AI labs—addressing a critical bottleneck that the largest AI companies prefer to outsource rather than build themselves.

TechCrunch AI19h ago3 min read
XDOF, a startup building robot training data infrastructure, has raised $70 million(約110億円) and is working with 20 customers including frontier AI labs—addressing a critical bottleneck that the largest AI companies prefer to outsource rather than build themselves.

Summaries like this, in your inbox every morning.

Sign up free →

3 Key Points

  1. 1

    What happened: XDOF emerged from stealth with $70 million(約110億円) in funding from Thrive Capital, Spark Capital, a16z, Lux, and WndrCo. The company, founded by Philipp Wu, Fred Shentu, and Nemo Jin, is building data pipelines, collection tools, and annotation systems for robot learning. It is already working with 20 customers, including several frontier AI labs, and has partnered with UC Berkeley's AI Research lab to release a dataset called ABC, which includes 130,000 trajectories of robot manipulation data, 300 hours of simulation, and 100 hours of evaluations.

  2. 2

    Why it matters: Unlike large language models trained on publicly available text, robots require high-fidelity physical interaction data that barely exists today. Frontier AI labs recognize they need this data but face a practical choice: build massive in-house data collection operations (requiring hundreds of thousands of square feet of warehouse space, hundreds of robots, and trained operators worldwide) or outsource to specialized infrastructure firms. XDOF is betting that labs will choose outsourcing, positioning data pipelines as the next critical bottleneck in AI development.

  3. 3

    What to watch: XDOF plans to work across three tiers of data collection—teleoperation on actual deployment robots, general teleoperated robot data via low-cost systems like GELLO, and egocentric data from wearable sensors worn by humans. The company intends to hire and train teleoperators and data operators globally, making execution on labor-intensive operations a key determinant of whether it can serve the scale frontier labs require.

Discussion

No comments yet. Be the first to share your thoughts!

Log in to join the discussion

Related Articles

Stay ahead with AI news

Get curated AI news from 200+ sources delivered daily to your inbox. Free to use.

Get Started Free

Free · takes 30 seconds · unsubscribe anytime

5 minutes a day. The AI essentials.

200+ sources · Email / LINE / Slack

Get it free →