Human Archive

Human Archive

Multimodal data provider for robotics and world modeling

Winter 2026ActiveIndustrialsManufacturing and RoboticsArtificial IntelligenceRoboticsData LabelingSan Francisco, CA, USA
We’re archiving the physical world for embodied intelligence by collecting and labeling aligned multimodal data. To build dexterous and perceptive robots that generalize robustly, we need massive amounts of real-world data across multiple modalities and environments. We have thought deeply about the fine line between biomimicry and its application to humanoid systems. Based on this research, we design and deploy custom hardware across residential and manufacturing settings. We then post-process the resulting data through internal QA, anonymization, and annotation pipelines to deliver diverse, high-fidelity datasets at scale to frontier labs developing robotics foundation models and general-purpose robotics companies. We believe we are at a historic inflection point, with a unique opportunity to leave a dent on humanity and reshape physical labor markets forever. That's why our team dropped out of Stanford and Berkeley and moved to Asia to collect the world’s largest annotated multimodal dataset.

Verdict

High Signal
Market Opportunity
Robotics training data is a massive and fast-growing market as humanoid robot companies (Figure, 1X, Physical Intelligence, etc.) race to build foundation models and desperately need diverse real-world multimodal datasets. The ICP is clear: frontier robotics labs and general-purpose robotics companies. Monetization path via data licensing/sales is well understood. TAM is easily $1B+ given the scale of capital flowing into embodied AI.
Medium Signal
Founder Signal
Four young founders from Berkeley/Stanford with limited real work experience. Rushil (Berkeley MET) had a PM internship at Coinbase and a prior acquired startup with $25k MRR — the most substantive signal on the team. Samay (Berkeley EECS, on leave) did SDE at Amazon for 4 months and ML work at Lightning AI. Raj is a Berkeley dropout whose primary listed experience is farming/mango-selling for 9 years. Shloke is a current Stanford ME/CS researcher. Team is young and light on direct robotics data industry experience, though technical backgrounds exist.
Medium Signal
Competition
Competitors include Scale AI (dominant data labeling player with robotics focus), Apptronik data initiatives, Physical Intelligence's internal data collection, and other robotics data startups. The differentiation claim — custom hardware deployment in residential and manufacturing settings in Asia for diversity — is plausible but unverified. No proprietary moat is demonstrated yet; Scale AI has massive advantages in infrastructure and enterprise relationships.
Medium Signal
Product
Website shows operational metrics (50,000+ contributor network, 125+ national partnerships, 1,000+ custom rigs), eight video tiles demonstrating multimodal capture (3D pose estimation, hand tracking, stereo depth, tactile sensing), and six industry verticals. Backed by YC, UC Berkeley, Stanford. No pricing, no named customers, no API docs.
OverallB Tier

Human Archive has a strong market thesis in robotics training data with impressive operational scale indicators: 50,000+ contributor network, 125+ national partnerships, and 1,000+ custom rigs. Backed by YC, UC Berkeley, and Stanford. No low signals across any dimension. However, no named enterprise customers, no pricing, and the robotics data market has unclear unit economics. The scale metrics are promising and put this above vaporware territory.

Active Founders

Rushil Agarwal
Rushil Agarwal
Founder

building multimodal real-world datasets for robotics | prev. UC Berkeley MET (IEOR + Business)

Samay Maini
Samay Maini
Founder

Creating multimodal real-world datasets for robotics

Raj Patel
Raj Patel
Founder

Archiving the structure of human interaction in the physical world. Berkeley dropout and previous farmer (sold mangoes & planted trees)

Shloke Patel
Shloke Patel
Founder

building in robotics

Human Archive
Human Archive
TierB Tier
BatchWinter 2026
Team Size4
StatusActive
LocationSan Francisco, CA, USA
Last Updated2 days ago