Star Memory Technology, Incubated by Tsinghua University, Gets First

Star Memory Technology, Incubated by Tsinghua University, Gets First

Star Memory Technology, Incubated by Tsinghua University, Gets First

https://eu.36kr.com/en/p/3740899945136130

Publish Date: 2026-03-27 04:44:00

Source Domain: eu.36kr.com

Text by | Ren Qian

The global competition in the embodied data layer is heating up rapidly. NVIDIA Research released the EgoScale data and training framework in 2026, training the VLA model on ego – centric human operation videos. Using 20,854 hours of first – person human videos with action annotations, they observed a near – log – linear scaling law between data scale and validation loss. 1X collects first – person human and household behavior data, and through the Sunday project, it collects millions of hours of household scenario videos. Guanglun Intelligence adopts a hybrid approach of simulated synthetic data and human video data (EgoSuite), claiming to have cumulatively delivered over one million hours of data, and its valuation has soared to billions of US dollars.

Within just a few months, the industry’s focus has shifted from “who can collect more data” to “who can truly turn human – centric/ego – centric data into high – freedom, high – precision, low – cost, and trainable assets.”

Behind this is a clear shift in the data paradigm. In the past year, global leading players have almost simultaneously turned their attention to human – centric data: not just larger – scale third – person materials, nor just expensive and scarce real – machine teleoperation, but data that is closer to the real distribution of human operations. And among them, ego – centric data, with the first – person human perspective, real physical interaction, and multi – modal perception at its core, is rapidly becoming the most crucial data collection route.

The reason is that what robots ultimately need to learn is not just to understand the world, but to perform actions correctly in the real physical world. Third – person videos lack details of contact and control, simulations can’t fully cover the long – tail of real – world physics, and pure teleoperation data is expensive and scarce. What is truly scarce is data that is both real enough, detailed enough, and can be produced on a…

Source