Origin Lab AI

Origin Lab raises $8 million to glean AI training data from game worlds

Join the must-attend GamesBeat flagship event. This summer in Los Angeles, GamesBeat Summit brings together top leaders, CEOs, and dealmakers on May 18–19 to spark connections and close major deals. Don’t miss where gaming and business converge. To celebrate one year of going independent, enjoy a limited-time buy one, get one free offer—ending soon while supplies last. Secure your spot now before tickets sell out.

Origin Lab, a technology platform turning licensed game worlds into structured training data for world models and multimodal AI, has announced an $8M seed round led by Lightspeed Venture Partners.

Origin Lab’s seed round is intended to accelerate development of the company’s software, capture, enrichment, QA, search, and delivery systems, while expanding its applied research work in world understanding, dataset intelligence, and interactive simulation.

As AI evolves, frontier models need data that reflects how the world actually works: motion, physics, spatial structure, action, environment state, and cause and effect. Much of that data already exists inside video games, and Origin Lab is building a professional platform to connect game publishers to the AI labs that need legally accessed, structured, researcher-ready data.

Origin Lab works directly with video game publishers to license game-world content at the source, capture it through proprietary pipelines, enrich it with structured metadata, and package datasets to buyer specification. The company has secured exclusive partnerships with more than 20 game publishers representing more than 50 titles and is under contract with a leading frontier AI lab.

Unlike scraped video, Origin Lab’s datasets are rights-cleared, source-controlled, and designed for model training from the start. Its capture and enrichment systems can pair high-fidelity video with structured metadata across gameplay, scene composition, camera movement, player inputs, environment state, and other signals that help AI systems learn not just what a world looks like, but how it behaves.

Origin AI world
Origin Lab turns game worlds into training for AI. Image credit: Origin Lab

“Frontier AI is moving from understanding language to understanding worlds,” said Faraz Fatemi, a partner at Lightspeed Venture Partners, in a press release. “That shift requires a different class of data: licensed, structured, multimodal, and grounded in interactive environments. Origin Lab is building the missing platform between the game industry and the AI labs training the next generation of world models.”

The seed financing will be used to expand Origin Lab’s capture and enrichment technology, deepen partnerships with video game publishers, and grow the engineering and research teams building tools for dataset creation, QA, search, annotation, packaging, and delivery.

“AI has outgrown the data it started with. At the same time, game studios spent decades building some of the richest interactive environments in the world, but had no professional way to bring that data to market,” said Origin Lab founder, co-chief executive officer, and chief commercial officer Anne-Margot Rodde in a press release.

Origin Lab works with an emerging category of data that the company calls Artificial World Intelligence: licensed, structured data and systems for AI models that must understand, simulate, and interact with complex environments. The company’s initial focus is video games, where game worlds, interactivity, physics, player behavior, and controllable capture can produce training signals that cannot be reproduced from web-scale scraping alone. Origin Lab’s applied research efforts extend the platform beyond data delivery toward the technical foundation for training and evaluating AI systems in complex interactive worlds.