Humyn Labs expands data collection for physical AI with $20 mn push

Agnidev Bhattacharya
2 min read13 Apr 2026, 01:54 PM IST
logo
The company is expanding its voice data infrastructure to encompass 33 languages, dialects, accents, and code-switching patterns. (istockphoto)
Summary
The firm focuses on recording first-person human activity, visuals, and movements. This data collection occurs within commercial, agricultural, and residential environments. The datasets capture how humans physically interact with surroundings to train physical AI systems

Artificial intelligence firm Humyn Labs has committed $20 million to expand data collection operations across India, Southeast Asia, Latin America, and the West Asia, the company said in a press release on Monday.

The capital will finance the infrastructure required to train physical artificial intelligence systems (robots) and voice models. Co-founders Manish Agarwal and Ishank Gupta channeled the investment to organize and validate human intelligence, the release said.

Agarwal told Mint that Humyn, still in its early stages and not backed by any institutional or venture capital, is leveraging its revenue to invest $20 million into building high-quality datasets for physical AI, focusing on egocentric and conversational voice data. Egocentric in artificial intelligence refers to data from a first-person view captured by a human or agent while interacting with its surroundings.

Also Read | Mint Quick Edit | TCS’s results reveal resilience in the face of an AI threat

The firm focuses on source-first data collection, recording first-person human activity, visuals, and movements. This data collection occurs within commercial, agricultural, and residential environments. The datasets capture how humans navigate and physically interact with surroundings to train physical AI systems, Agarwal explained.

The company is expanding its voice data infrastructure to encompass 33 languages, dialects, accents, and code-switching patterns. This expansion addresses the utilization of voice for real-world commands and human-robot interactions.

Humyn Labs will establish robotics labs to construct simulation environments and world models. This unit integrates real-world data with training frameworks to deploy physical AI systems. In AI systems, a world model is internal representation of how the real world works. For robotics labs, these world models are typically used to let robots learn in simulation before actual deployment.

Also Read | TCS payouts to Tata Sons fall as acquisitions, AI push weigh on cash flows

Humyn AI utilizes a decentralized network across the global south to source data. The current pipeline is driven by roughly 15–18 customers. While Agarwal did not disclose any client names, he said that it targets top-tier labs where successful proof-of-concepts can quickly scale into $10–$15 million contracts.

The company’s growth is currently uneven. "While we have delivered approximately $2 million in revenue over the last few months, we are operating at an annualized run rate of around $4–5 million," Agarwal said.

Agarwal said that Humyn has a sales pipeline of around $45–50 million. "Based on current execution, we expect to reach a $100 million Annual Recurring Revenue (ARR) by the end of December 2026."

Also Read | The use of AI in combat has only thickened the fog of war

According to market research firm Fortune Business Insights, the global AI training dataset market size was valued at $3.59 billion in 2025, and is projected to grow from $4.44 billion in 2026 to $23.18 billion by 2034, with the India market projected be worth $190 million in 2026.

About the Author

Agnidev is a business journalist with over two years of reporting experience tracking the intersection of capital, policy, and corporate strategy in India.<br><br>He joined Mint in December 2025, after a stint at NDTV Profit (erstwhile BQ Prime). At Mint, Agnidev focuses on the high-stakes world of the Indian capital market, specialising in mergers and acquisitions, burgeoning IPOs, and the investment banking industry.<br><br>Backed by a rigorous, data-driven approach, Agnidev frequently breaks news on the valuation cycles, deal pipelines and listing strategies of India’s most prominent companies. His reportage offers deep dives into the operational health of market leaders across the corporate landscape, providing readers with a clear-eyed view of institutional growth.<br><br>He has reported on major issues like India's derivatives frenzy, IPO froth, the competitive quick commerce industry, the real-money gaming ban, and has broken investigative stories related to scandals such as IndusInd Bank's accounting manipulation and the Gensol-BluSmart fiasco.<br><br>As a reporter, he brings stories that ultimately affect your stock market investments, and tries to bring clarity and brevity in a field that is often filled with jargon and noise.

Catch all the Business News , Corporate news , Breaking News Events and Latest News Updates on Live Mint. Download The Mint News App to get Daily Market Updates.

More