1
Country: USA | Funding: $781M
ElevenLabs develops voice AI models for content creators and publishers. It has created two platforms: the Agents Platform for improving customer engagement and Creative Platform for creating AI-powered voice content. The platform's capabilities include converting text into realistic speech in over 70 languages, configuring, deploying, and monitoring conversational agents, creating studio-quality tracks in any genre and style, transcribing any audio with the highest accuracy, and creating an exact copy of any voice. All platform capabilities are accessible via an API. ElevenLabs is used to create films, advertising, audiobooks and podcasts.
ElevenLabs develops voice AI models for content creators and publishers. It has created two platforms: the Agents Platform for improving customer engagement and Creative Platform for creating AI-powered voice content. The platform's capabilities include converting text into realistic speech in over 70 languages, configuring, deploying, and monitoring conversational agents, creating studio-quality tracks in any genre and style, transcribing any audio with the highest accuracy, and creating an exact copy of any voice. All platform capabilities are accessible via an API. ElevenLabs is used to create films, advertising, audiobooks and podcasts.
2
Country: USA | Funding: $5.5M
VoiceRun is a platform for developing voice-enabled AI agents. Unlike visual agent-builders, it allows to program voice agent behavior using code, which gives greater flexibility. In addition to creating code-based agents, VoiceRun also allows users to conduct A/B testing and instantly deploy solutions with a single click through the VoiceRun cloud telephony system and swap models instantly, backed by enterprise-grade security. The company is focused on enterprise developers, helping companies, for example, implement AI in their customer support services or assist tech companies in launching voice-based products. For example, the platform is used to create an AI concierge for restaurant reservations.
VoiceRun is a platform for developing voice-enabled AI agents. Unlike visual agent-builders, it allows to program voice agent behavior using code, which gives greater flexibility. In addition to creating code-based agents, VoiceRun also allows users to conduct A/B testing and instantly deploy solutions with a single click through the VoiceRun cloud telephony system and swap models instantly, backed by enterprise-grade security. The company is focused on enterprise developers, helping companies, for example, implement AI in their customer support services or assist tech companies in launching voice-based products. For example, the platform is used to create an AI concierge for restaurant reservations.
3
Country: France | Funding: $70M
Gradium creates audio-language AI models, designed to generate and recognize voices with ultra-low latency. The startup's goal is to create a voice API for developers that works in real time. Its voice cloning feature allows to create custom AI voices, including your own (you simply need to upload a 10-second audio sample for speech synthesis). The models have multilingual support: English, French, German, Spanish and more languages to be added in the future. Gradium competes with leading LLMs (OpenAI, Anthropic, Meta Llama, and Mistral) which are providing voice and speech models. However, Gradium cites that its advantages are hyper-realistic voice reproduction and accuracy.
Gradium creates audio-language AI models, designed to generate and recognize voices with ultra-low latency. The startup's goal is to create a voice API for developers that works in real time. Its voice cloning feature allows to create custom AI voices, including your own (you simply need to upload a 10-second audio sample for speech synthesis). The models have multilingual support: English, French, German, Spanish and more languages to be added in the future. Gradium competes with leading LLMs (OpenAI, Anthropic, Meta Llama, and Mistral) which are providing voice and speech models. However, Gradium cites that its advantages are hyper-realistic voice reproduction and accuracy.
4
Country: South Korea | Funding: $52M
Deepbrain AI develops an online video generator for various specialized applications. It features online video editing studio and a large library of video templates, avatars and voices. Its engine utilizes third-party models from Google, OpenAI and others bigtechs. The service's primary applications are creating realistic AI avatars from photos or video samples, creating product videos (you can simply paste a link to a product description in online store, and the service will generate a promotional video featuring live avatar), dubbing videos with translation into 150+ languages, automatic lip synching and voice cloning. The startup also offers an enterprise version with an API, security measures, collaboration capabilities, automation and dedicated support. Companies use the platform for training, marketing and creative projects.
Deepbrain AI develops an online video generator for various specialized applications. It features online video editing studio and a large library of video templates, avatars and voices. Its engine utilizes third-party models from Google, OpenAI and others bigtechs. The service's primary applications are creating realistic AI avatars from photos or video samples, creating product videos (you can simply paste a link to a product description in online store, and the service will generate a promotional video featuring live avatar), dubbing videos with translation into 150+ languages, automatic lip synching and voice cloning. The startup also offers an enterprise version with an API, security measures, collaboration capabilities, automation and dedicated support. Companies use the platform for training, marketing and creative projects.
5
Country: UK | Funding: $33.2M
Papercup translates videos using AI by generating voices that sound like the original speaker
Papercup translates videos using AI by generating voices that sound like the original speaker
6
Country: South Korea | Funding: $26.8M
Typecast's Online AI Voice Generator Puts Emotions in Text to Speech. It allows to power your video content with realistic AI voices and avatars.
Typecast's Online AI Voice Generator Puts Emotions in Text to Speech. It allows to power your video content with realistic AI voices and avatars.
7
Country: Israel | Funding: $20M
Deepdub provides an AI-based dubbing service for entertainment content.
Deepdub provides an AI-based dubbing service for entertainment content.
8
Country: Canada | Funding: $12M
Resemble AI creates custom voices using proprietary deep-learning models that can produce realistic speech synthesis.
Resemble AI creates custom voices using proprietary deep-learning models that can produce realistic speech synthesis.
9
Country: USA | Funding: $10M
WellSaid Labs has developed the art text-to-speech technology that creates life-like synthetic voice, from the voices of real people.
WellSaid Labs has developed the art text-to-speech technology that creates life-like synthetic voice, from the voices of real people.
10
Country: South Korea | Funding: $6.9M
LOVO is an artificial intelligence (AI) voice & synthetic speech tool developer
LOVO is an artificial intelligence (AI) voice & synthetic speech tool developer
11
Country: USA | Funding: $6M
Voice AI is a real-time speech-to-speech AI voice changing platform and UGC content platform.
Voice AI is a real-time speech-to-speech AI voice changing platform and UGC content platform.
12
Country: USA | Funding: $3.6M
Dubformer is an advanced AI-driven video dubbing solution designed for media and entertainment localization, offering high-quality dubbing and voice-over services with a blend of cutting-edge AI and human quality control.
Dubformer is an advanced AI-driven video dubbing solution designed for media and entertainment localization, offering high-quality dubbing and voice-over services with a blend of cutting-edge AI and human quality control.
13
Country: Ukraine | Funding: $2.5M
We build high fidelity voice cloning (voice conversion) systems using deep neural networks.
We build high fidelity voice cloning (voice conversion) systems using deep neural networks.
15
Country: USA
AppTek combines cutting-edge artificial intelligence research with meaningful and transformative real-world applications. Our team consists of world-leading scientists with an extensive list of patents, innovations and academic publications contributing to the advancement of neural network and machine learning science and technology. Based on our scientific research, our engineering team helps convert these advancements into commercially viable real-world applications that improve the daily life in areas including accessibility, commerce, trade, and communication across languages.
AppTek combines cutting-edge artificial intelligence research with meaningful and transformative real-world applications. Our team consists of world-leading scientists with an extensive list of patents, innovations and academic publications contributing to the advancement of neural network and machine learning science and technology. Based on our scientific research, our engineering team helps convert these advancements into commercially viable real-world applications that improve the daily life in areas including accessibility, commerce, trade, and communication across languages.




















