Alibaba Cloud has led a $290 million investment in startup ShengShu to advance ‘general world models’ using video and physical data. The move signals a strategic pivot beyond text-based LLMs, targeting applications in robotics, autonomous driving, and digital-physical integration.
Alibaba Cloud has placed a $290 million bet on ShengShu, a three-year-old startup behind the AI video tool Vidu, according to an official announcement on Friday. The funding targets the development of a ‘general world model’, an AI architecture trained on multimodal inputs including vision, audio, and touch, rather than text alone.
ShengShu’s founder Zhu Jun stated the goal is to connect perception and action, enabling consistent modeling of real-world behavior. This contrasts with large language models like ChatGPT, which struggle to replicate physical dynamics. The startup aims to bridge digital environments (games, generated video) with physical systems (autonomous driving, robotics).
As LLMs approach diminishing returns in physical reasoning, world models offer a pathway for AI to simulate and predict real-world causality, critical for industrial automation and embodied AI. Impacted sectors include autonomous vehicles, industrial robotics, smart manufacturing, and simulation software.
For enterprises, this signals a shift in AI R&D spend toward multimodal, physically grounded systems, potentially reshaping competitive advantage in logistics, automation, and digital twins.
About GlobeNewsInfo
GlobeNewsInfo.com is a business news platform providing latest updates on global business developments, projects, and contract opportunities across diverse sectors and regions. The platform is designed to serve as a trusted source of information for companies, investors, and professionals worldwide.