Hugging Face expands its LeRobot platform with training data for self-driving machines

Last year, Hugging Face, the AI dev platform, launched LeRobot, a collection of open AI models, data sets, and tools to help build real-world robotics systems. On Tuesday, Hugging Face teamed up with AI startup Yaak to expand LeRobot with a training set for robots and cars that can navigate environments, like city streets, autonomously.
The new set, called Learning to Drive (L2D), is over a petabyte in size, and contains data from sensors that were installed on cars in German driving schools. L2D captures camera, GPS, and “vehicle dynamics” data from driving instructors and students navigating streets with construction zones, intersections, highways, and more.
There’s a number of open self-driving training sets out there from companies including Alphabet’s Waymo and Comma AI. But many of these focus on planning tasks like object detection and tracking, which require high-quality annotations, according to L2D’s creators — making them difficult to scale.

In contrast, L2D is designed to support the development of “end-to-end” learning, its creators claim, which helps predict actions (e.g. when a pedestrian might cross the street) directly from sensor inputs (e.g. camera footage)
“The AI community can now build end-to-end self-driving models,” Yaak co-founder Harsimrat Sandhawalia and Remi Cadene, a member of the AI for robotics team at Hugging Face, wrote in a blog post. “L2D aims to be the largest open-source self-driving data set that empowers the AI community with unique and diverse ‘episodes’ for training end-to-end spatial intelligence.”
Hugging Face and Yaak plan to conduct real-world “closed-loop” testing of models trained using L2D and LeRobot this summer, deployed on a vehicle with a safety driver. The companies are calling on the AI community to submit models and tasks they’d like the models to be evaluated on, like navigating roundabouts and parking spaces.
You Might Also Like
Chinese marketplace DHgate becomes a top US app as trade war intensifies
The Trump trade war has gone viral on TikTok, pushing a Chinese e-commerce app, DHgate, to the top of the...
Hertz says customers’ personal data and driver’s licenses stolen in data breach
Car rental giant Hertz has begun notifying its customers of a data breach that included their personal information and driver’s...
OpenAI plans to phase out GPT-4.5, its largest-ever AI model, from its API
OpenAI said on Monday that it would soon wind down the availability of GPT-4.5, its largest-ever AI model, via its...
Google’s newest AI model is designed to help study dolphin ‘speech’
Google’s AI research lab, Google DeepMind, says that it has created an AI model that can help decipher dolphin vocalizations,...