1
Country: USA | Funding: $47M
Nimble is developing a platform that uses AI agents to search the web in real time, verify and validate results, and structure the information into tables that can then be queried like a database. The platform integrates with enterprise data warehouses and data lakes offered by companies like Databricks and Snowflake. This means its AI agents can connect to a company's massive data set, using it to create context and shape the structure and display of search results. These integrations also allow Nimble's software to learn constraints - for example, how a search should be performed or which data sources to use. This is particularly useful for applications such as competitor analysis, pricing research, know-your-customer processes, brand monitoring, in-depth research, and financial analysis.
Nimble is developing a platform that uses AI agents to search the web in real time, verify and validate results, and structure the information into tables that can then be queried like a database. The platform integrates with enterprise data warehouses and data lakes offered by companies like Databricks and Snowflake. This means its AI agents can connect to a company's massive data set, using it to create context and shape the structure and display of search results. These integrations also allow Nimble's software to learn constraints - for example, how a search should be performed or which data sources to use. This is particularly useful for applications such as competitor analysis, pricing research, know-your-customer processes, brand monitoring, in-depth research, and financial analysis.
2
Country: USA | Funding: $255M
Fundamental develops AI models to extract useful insights from the massive volumes of structured data generated by large enterprises. It combines legacy predictive AI systems with more modern tools. Unlike traditional LLMs, Fundamental's large table model, called Nexus, is deterministic - meaning it produces the same answer every time it's asked a given question, and does not use the Transformer architecture. Because Transformer-based AI models can only process data within their context window, they often struggle to analyze extremely large datasets, for example, a spreadsheet with billions of rows. However, such huge structured datasets are common in large enterprises, creating significant opportunities for models capable of handling such scale.
Fundamental develops AI models to extract useful insights from the massive volumes of structured data generated by large enterprises. It combines legacy predictive AI systems with more modern tools. Unlike traditional LLMs, Fundamental's large table model, called Nexus, is deterministic - meaning it produces the same answer every time it's asked a given question, and does not use the Transformer architecture. Because Transformer-based AI models can only process data within their context window, they often struggle to analyze extremely large datasets, for example, a spreadsheet with billions of rows. However, such huge structured datasets are common in large enterprises, creating significant opportunities for models capable of handling such scale.
3
Country: USA | Funding: $25.8B
Databricks is developing cloud-based platform Databricks Data Intelligence for working with Big Data and machine learning. It brings together everything needed for the full data cycle: storage, processing, analysis, model creation, training and deployment. It includes MLflow - an open-source system for managing machine learning models, experiment tracking and automated deployment. The platform provides interactive notebooks (like Jupyter) for writing code in Python, SQL, R and Scala and running it on clusters. Databricks allows to develop your own models, AI agents and generative AI applications based on company's own data.
Databricks is developing cloud-based platform Databricks Data Intelligence for working with Big Data and machine learning. It brings together everything needed for the full data cycle: storage, processing, analysis, model creation, training and deployment. It includes MLflow - an open-source system for managing machine learning models, experiment tracking and automated deployment. The platform provides interactive notebooks (like Jupyter) for writing code in Python, SQL, R and Scala and running it on clusters. Databricks allows to develop your own models, AI agents and generative AI applications based on company's own data.
4
Country: USA | Funding: $1.2B
Dataminr is an artificial intelligence platform for real-time event and risk detection.
Dataminr is an artificial intelligence platform for real-time event and risk detection.
5
Country: France | Funding: $846.8M
Dataiku develops Data Science Studio, the tool that lets data scientists and analysts do machine learning on any (dirty) data.
Dataiku develops Data Science Studio, the tool that lets data scientists and analysts do machine learning on any (dirty) data.
6
Country: UK | Funding: $545M
Quantexa has built a machine learning platform branded “Contextual Decision Intelligence” (CDI) that analyses disparate data points to get better insight into nefarious activity
Quantexa has built a machine learning platform branded “Contextual Decision Intelligence” (CDI) that analyses disparate data points to get better insight into nefarious activity
7
Country: USA | Funding: $415.1M
WEKA is a global data platform company that delivers a cloud-native, software-based data platform for AI & next-generation workloads
WEKA is a global data platform company that delivers a cloud-native, software-based data platform for AI & next-generation workloads
8
Country: France | Funding: $399.3M
C3 IoT delivers a complete platform as a service for rapidly developing and operating big data, predictive analytics, AI, and IoT software applications. By leveraging telemetry, elastic cloud computing, analytics, and machine learning, C3 IoT brings the power of predictive insights to any business value chain.
C3 IoT delivers a complete platform as a service for rapidly developing and operating big data, predictive analytics, AI, and IoT software applications. By leveraging telemetry, elastic cloud computing, analytics, and machine learning, C3 IoT brings the power of predictive insights to any business value chain.
9
Country: USA | Funding: $381M
VAST Data offers a unified data platform that integrates storage, database, and compute capabilities into a single software platform.
VAST Data offers a unified data platform that integrates storage, database, and compute capabilities into a single software platform.
See also:
Top 10 Medical Big Data startups
Top 10 Medical Big Data startups
10
Country: USA | Funding: $328M
Clarify Health Solutions develops AI analytics software solutions that enable health systems to deliver more satisfying, better outcome and higher value care
Clarify Health Solutions develops AI analytics software solutions that enable health systems to deliver more satisfying, better outcome and higher value care
11
Country: USA | Funding: $237M
Primer builds machines that can read and write, automating the analysis of very large datasets. It trawls thousands of sources online, using natural language processing to read and try to make sense of the vast amount of “open source intelligence”.
Primer builds machines that can read and write, automating the analysis of very large datasets. It trawls thousands of sources online, using natural language processing to read and try to make sense of the vast amount of “open source intelligence”.
12
Country: Singapore | Funding: $201.5M
Atlan is an active metadata platform crafted for data teams that integrates metadata from diverse sources.
Atlan is an active metadata platform crafted for data teams that integrates metadata from diverse sources.
13
Country: USA | Funding: $192.6M
Gloat is an AI-based anonymous career development platform allowing users to both know their worth and get concrete offers in real time.
Gloat is an AI-based anonymous career development platform allowing users to both know their worth and get concrete offers in real time.
14
Country: Israel | Funding: $125.1M
Explorium offers a new breed of data science platform, fueled by Automated Data and Feature Discovery. By dynamically integrating a company’s internal data with thousands of external sources, the Explorium platform is able to extract the most relevant features and power superior machine learning models.
Explorium offers a new breed of data science platform, fueled by Automated Data and Feature Discovery. By dynamically integrating a company’s internal data with thousands of external sources, the Explorium platform is able to extract the most relevant features and power superior machine learning models.
15
Country: Japan | Funding: $104.4M
ABEJA develops AI business solutions utilizing deep learning.
ABEJA develops AI business solutions utilizing deep learning.
16
Country: Canada | Funding: $102.3M
Through the application of machine learning and artificial intelligence technologies, the MindBridge platform detects anomalous patterns of activities, unintentional errors and intentional financial misstatements. Using the MindBridge AI Auditor, organizations across multiple industries can minimize financial loss
Through the application of machine learning and artificial intelligence technologies, the MindBridge platform detects anomalous patterns of activities, unintentional errors and intentional financial misstatements. Using the MindBridge AI Auditor, organizations across multiple industries can minimize financial loss
17
Country: USA | Funding: $102M
Continual is a startup that aims to bring operational AI to the modern data warehouse-centric data stack
Continual is a startup that aims to bring operational AI to the modern data warehouse-centric data stack
18
Country: Israel | Funding: $99M
Navina develops an AI-driven platform that restructures patient data into intuitive patient portraits for better diagnoses and care.
Navina develops an AI-driven platform that restructures patient data into intuitive patient portraits for better diagnoses and care.
19
Country: USA | Funding: $84.8M
Samasource creates trusted platform for expert, ethical training data.
Samasource creates trusted platform for expert, ethical training data.
20
Country: USA | Funding: $84.8M
The training data platform trusted by the world's most ambitious organization to develop accurate machine learning models.
The training data platform trusted by the world's most ambitious organization to develop accurate machine learning models.
21
Country: USA | Funding: $77.9M
Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft, Google and Oracle Clouds.
Qubole delivers a Self-Service Platform for Big Data Analytics built on Amazon, Microsoft, Google and Oracle Clouds.
22
Country: USA | Funding: $73M
WisdomAI offers enterprise AI analytics chatbot that answers business-related questions based on structured and unstructured data (as promised - without hallucinations). The company uses a clever method to address the problem of hallucinations. It doesn't use LLM to write answers, instead LLM is used only to write the query code that will be sent to the data warehouse. Therefore, if the LLM is hallucinating, it will simply write a wrong query and the user will recognize this from the result. WisdomAI has developed its own logic, which it calls the "enterprise context layer" and examines customer data to understand it and generate query code.
WisdomAI offers enterprise AI analytics chatbot that answers business-related questions based on structured and unstructured data (as promised - without hallucinations). The company uses a clever method to address the problem of hallucinations. It doesn't use LLM to write answers, instead LLM is used only to write the query code that will be sent to the data warehouse. Therefore, if the LLM is hallucinating, it will simply write a wrong query and the user will recognize this from the result. WisdomAI has developed its own logic, which it calls the "enterprise context layer" and examines customer data to understand it and generate query code.
23
Country: USA | Funding: $69.2M
Tamr's patented software fuses the power of machine learning with your knowledge of your data to automate the rapid unification of data silos.
Tamr's patented software fuses the power of machine learning with your knowledge of your data to automate the rapid unification of data silos.
24
Country: Canada | Funding: $49.6M
Integrate.ai is building an AI powered platform for B2C enterprises that integrates with business processes to make customer interactions more natural and valuable.
Integrate.ai is building an AI powered platform for B2C enterprises that integrates with business processes to make customer interactions more natural and valuable.
25
Country: Israel | Funding: $49M
DataLoop develops data management and annotation platform that streamlines the process of creating quality machine learning and AI-ready datasets from unstructured data. Dataloop's dataset browser enables visualization of unstructured data at any scale, simplifying data exploration and improving decision making. It's data management capabilities support querying, versioning and curation of all types of unstructured data. The platform scales data to millions of individual elements of video, images, audio and other formats. Dataloop also simplifies the integration of human feedback into the AI development process. Platform use cases include active learning workflows, validating GenAI, building AI Agents, building RAG workflows, DataOps and LiDAR data annotation. Dataloop also provides a marketplace of pre-created AI workflows, allowing teams to jumpstart their development with hundreds of pre-built pipeline templates, great datasets and the latest models.
DataLoop develops data management and annotation platform that streamlines the process of creating quality machine learning and AI-ready datasets from unstructured data. Dataloop's dataset browser enables visualization of unstructured data at any scale, simplifying data exploration and improving decision making. It's data management capabilities support querying, versioning and curation of all types of unstructured data. The platform scales data to millions of individual elements of video, images, audio and other formats. Dataloop also simplifies the integration of human feedback into the AI development process. Platform use cases include active learning workflows, validating GenAI, building AI Agents, building RAG workflows, DataOps and LiDAR data annotation. Dataloop also provides a marketplace of pre-created AI workflows, allowing teams to jumpstart their development with hundreds of pre-built pipeline templates, great datasets and the latest models.
26
Country: USA | Funding: $45M
Tonic mimics your production data to create safe, useful, de-identified data for QA, testing, and development.
Tonic mimics your production data to create safe, useful, de-identified data for QA, testing, and development.
27
Country: USA | Funding: $44M
Coactive AI is a machine learning platform that unlocks analytics and insights from unstructured image and video data.
Coactive AI is a machine learning platform that unlocks analytics and insights from unstructured image and video data.
28
Country: Bulgaria | Funding: $40.6M
Noble.AI is a software company focused on building tools that accelerate discovery in R&D.
Noble.AI is a software company focused on building tools that accelerate discovery in R&D.
29
Country: India | Funding: $35M
Entropik is a Human Insights AI company that specializes in consumer and user research.
Entropik is a Human Insights AI company that specializes in consumer and user research.
30
Country: USA | Funding: $30M
Heartex offers a data labeling and annotations tool for building accurate and smart AI products.
Heartex offers a data labeling and annotations tool for building accurate and smart AI products.
31
Country: Croatia | Funding: $18.2M
Memgraph makes creating real-time streaming graph applications accessible to every developer.
Memgraph makes creating real-time streaming graph applications accessible to every developer.
32
Country: Israel | Funding: $16M
BeaconCure develops AI text Analytics for clinical data. It created a unique text analytics technology for specifically for clinical trial data with the support of a top pharmaceutical multinational as design partner.
BeaconCure develops AI text Analytics for clinical data. It created a unique text analytics technology for specifically for clinical trial data with the support of a top pharmaceutical multinational as design partner.
34
Country: Germany
MAPEGY analyzes the up-to-date global innovation data universe and delivers key insights in a range of intuitive formats, from graphical dashboards and search functions, to custom reports and newsletters
MAPEGY analyzes the up-to-date global innovation data universe and delivers key insights in a range of intuitive formats, from graphical dashboards and search functions, to custom reports and newsletters
35
Country: USA | Funding: $7.9M
Datasaur develops productive tools designed for data labeling needs. It enables users to assign multiple teammates to the same projects and use its specialized review tool to quickly identify where they disagree. Also, its built-in intelligence catches costly errors.
Datasaur develops productive tools designed for data labeling needs. It enables users to assign multiple teammates to the same projects and use its specialized review tool to quickly identify where they disagree. Also, its built-in intelligence catches costly errors.
36
Country: Russia | Funding: $7.7M
Rubbles offers big data analytics and tailor-made IT-solutions empowered by machine learning and artificial intelligence.
Rubbles offers big data analytics and tailor-made IT-solutions empowered by machine learning and artificial intelligence.
37
Country: USA | Funding: $1.3M
Datrics democratizes the creation of self-service analytics and machine learning solutions by providing an intuitive drag-n-drop interface.
Datrics democratizes the creation of self-service analytics and machine learning solutions by providing an intuitive drag-n-drop interface.
38
Country: Canada
Humanata is a big data analytics platform that can help you make data-driven decisions by analyzing vast amounts of data to identify patterns and trends. With its AI tools, you can measure the impact of your programs and services more effectively, demonstrating your work's effectiveness and attracting more funding.
Humanata is a big data analytics platform that can help you make data-driven decisions by analyzing vast amounts of data to identify patterns and trends. With its AI tools, you can measure the impact of your programs and services more effectively, demonstrating your work's effectiveness and attracting more funding.
39
Country: USA
Cogito is the industry leader in data labeling and annotation services to provide the training data sets for AI and machine learning model developments. All types of AI and ML services requires the training data for algorithms with next level of accuracy making AI possible into diverse fields like healthcare, gaming, agriculture, retail, automotive, robotics and security surveillance etc.
Cogito is the industry leader in data labeling and annotation services to provide the training data sets for AI and machine learning model developments. All types of AI and ML services requires the training data for algorithms with next level of accuracy making AI possible into diverse fields like healthcare, gaming, agriculture, retail, automotive, robotics and security surveillance etc.
40
Country: Ukraine
datuum.ai helps with data migration, M&A processes by recognizing data in sources, mapping it to the final database, and automatically ETL pipelines generation. It reduces 80% of Data Engineers’ time.
datuum.ai helps with data migration, M&A processes by recognizing data in sources, mapping it to the final database, and automatically ETL pipelines generation. It reduces 80% of Data Engineers’ time.
41
Country: India
TagX is an Industry-leading data annotation/labeling Company creating high-quality data assets for Artificial Intelligence leveraging AI and humans in the loop. By learning from the data we create AI solutions for industries to maximize profits and reduce downtimes.
TagX is an Industry-leading data annotation/labeling Company creating high-quality data assets for Artificial Intelligence leveraging AI and humans in the loop. By learning from the data we create AI solutions for industries to maximize profits and reduce downtimes.














































