1
Country: USA | Funding: $1.1B
Luma AI is an advanced generative platform for creating high-quality videos. The company's flagship Ray3 model is designed specifically for storytelling. The company claims it can think and reason visually, taking into account physical laws and scene coherence. Its Draft Mode enables rapid exploration of new ideas. Ray3 delivers incredible realism, image-to-video conversion, keyframing and editing controls. Content generation is available through a web interface and a mobile app. Companies can use the Luma API to access advanced image and video generation features with an easy-to-use endpoint. The company also enables the generation of 3D models from simple text or photographs. Luma AI conducts fundamental research into multimodal general-purpose intelligence (AGI).
Luma AI is an advanced generative platform for creating high-quality videos. The company's flagship Ray3 model is designed specifically for storytelling. The company claims it can think and reason visually, taking into account physical laws and scene coherence. Its Draft Mode enables rapid exploration of new ideas. Ray3 delivers incredible realism, image-to-video conversion, keyframing and editing controls. Content generation is available through a web interface and a mobile app. Companies can use the Luma API to access advanced image and video generation features with an easy-to-use endpoint. The company also enables the generation of 3D models from simple text or photographs. Luma AI conducts fundamental research into multimodal general-purpose intelligence (AGI).
2
Country: UK
DeepMind was acquired by Google as a leading ML startup and now acts as its mostly independent AI division. Its initial goal was to create strong artificial intelligence, which the developers hoped to achieve through the ability to play games (Atari, Go, chess, Dota) and control physical robots. DeepMind gradually took over all of Google's AI projects, including the LLM model and the Gemini chatbot. DeepMind was one of the first to develop multimodal intelligence, including language, code, images, video and 3D world generation. Furthermore, the startup continues projects of creating specialized machine learning systems for various industries, including medicine (drug development) and energy (energy optimization for Google's data centers).
DeepMind was acquired by Google as a leading ML startup and now acts as its mostly independent AI division. Its initial goal was to create strong artificial intelligence, which the developers hoped to achieve through the ability to play games (Atari, Go, chess, Dota) and control physical robots. DeepMind gradually took over all of Google's AI projects, including the LLM model and the Gemini chatbot. DeepMind was one of the first to develop multimodal intelligence, including language, code, images, video and 3D world generation. Furthermore, the startup continues projects of creating specialized machine learning systems for various industries, including medicine (drug development) and energy (energy optimization for Google's data centers).
3
Country: UK | Funding: $1.3B
Wayve is developing an end-to-end autonomous driving software Wayve AI Driver based on a proprietary AI model. This approach replaces the modular "sense-plan-act" architecture of the traditional AV1.0 approach with a single neural network trained on diverse data to transform raw sensor inputs into safe outputs. AV2.0 learns driving skills from raw, unlabeled data using self-training, eliminating the need for costly and time-consuming curation of labeled datasets. Wayve also uses generative models of the world for training, creating rich and realistic synthetic scenarios. Wayve's technology does not require HD maps, allowing it to be easily scaled to new roads and cities. Wayve AI Driver is sensor- and hardware-independent and compatible with any type of vehicle.
Wayve is developing an end-to-end autonomous driving software Wayve AI Driver based on a proprietary AI model. This approach replaces the modular "sense-plan-act" architecture of the traditional AV1.0 approach with a single neural network trained on diverse data to transform raw sensor inputs into safe outputs. AV2.0 learns driving skills from raw, unlabeled data using self-training, eliminating the need for costly and time-consuming curation of labeled datasets. Wayve also uses generative models of the world for training, creating rich and realistic synthetic scenarios. Wayve's technology does not require HD maps, allowing it to be easily scaled to new roads and cities. Wayve AI Driver is sensor- and hardware-independent and compatible with any type of vehicle.
4
Country: USA | Funding: $230M
World Labs develops spatially intelligent AI world models that are capable of perceiving, generating, reasoning and interacting with 3D world, unlocking the full potential of AI. The company believes that spatial intelligence will unlock new forms of storytelling, creativity, design, modeling and immersive experiences in both virtual and physical worlds. Its first product, Marble - is based on best-in-class generative 3D world models and enables anyone to create spatially consistent, highly accurate and robust 3D worlds using just a single image, video or text prompt. World Labs' founders include AI pioneer Fei-Fei Li and other world-renowned experts in machine learning, generative AI and computer vision.
World Labs develops spatially intelligent AI world models that are capable of perceiving, generating, reasoning and interacting with 3D world, unlocking the full potential of AI. The company believes that spatial intelligence will unlock new forms of storytelling, creativity, design, modeling and immersive experiences in both virtual and physical worlds. Its first product, Marble - is based on best-in-class generative 3D world models and enables anyone to create spatially consistent, highly accurate and robust 3D worlds using just a single image, video or text prompt. World Labs' founders include AI pioneer Fei-Fei Li and other world-renowned experts in machine learning, generative AI and computer vision.
5
Country: USA | Funding: $133.7M
General Intuition is a spinoff of Medal, a platform for uploading and sharing videogame clips. The startup uses this vast collection of game videos to train and create base models and AI agents capable of understanding how objects and entities move through space and time. General Intuition's model is capable of understanding environments it hasn't been trained on and correctly predicts actions within them. It does this solely based on visual input: the agents see only what the human player sees and navigate space following the controller's directions. According to the company, this approach can be naturally transferred to physical systems such as robotic arms, drones and autonomous vehicles, which people often control with video game controllers.
General Intuition is a spinoff of Medal, a platform for uploading and sharing videogame clips. The startup uses this vast collection of game videos to train and create base models and AI agents capable of understanding how objects and entities move through space and time. General Intuition's model is capable of understanding environments it hasn't been trained on and correctly predicts actions within them. It does this solely based on visual input: the agents see only what the human player sees and navigate space following the controller's directions. According to the company, this approach can be naturally transferred to physical systems such as robotic arms, drones and autonomous vehicles, which people often control with video game controllers.
6
Country: USA | Funding: $27M
Odyssey uses AI to create world models for film, gaming, and beyond
Odyssey uses AI to create world models for film, gaming, and beyond
7
Country: USA | Funding: $25M
Spline offers 3D design and collaboration tools to create and manipulate 3D models and environments.
Spline offers 3D design and collaboration tools to create and manipulate 3D models and environments.
8
Country: UK | Funding: $13M
SpAItial is pioneering Spatial Foundation Models (SFMs), a groundbreaking AI paradigm designed to generate and reason about the appearance and physics of real and imagined environments
SpAItial is pioneering Spatial Foundation Models (SFMs), a groundbreaking AI paradigm designed to generate and reason about the appearance and physics of real and imagined environments
9
Country: Austria | Funding: €160K
BeViAI 3D transforms images into high-quality 3D models in minutes. Designed for eCommerce, gaming, and digital content creation, it removes the complexity of traditional 3D modeling. The AI-driven process delivers optimized, ready-to-use assets for product visualization, game development, and AR/VR applications.
BeViAI 3D transforms images into high-quality 3D models in minutes. Designed for eCommerce, gaming, and digital content creation, it removes the complexity of traditional 3D modeling. The AI-driven process delivers optimized, ready-to-use assets for product visualization, game development, and AR/VR applications.
10
Country: Austria
Atlas empowers the next generation of content creators to build virtual worlds in a fraction of the time with cutting-edge generative 3D AI.
Atlas empowers the next generation of content creators to build virtual worlds in a fraction of the time with cutting-edge generative 3D AI.















