Foundation Models Companies
Explore 43 Foundation Models companies in our AI directory. Leading companies include Apple AI, Amazon AI, IBM watsonx.
Apple AI
Cupertino, United States
Apple AI focuses on the research and development of on-device artificial intelligence, primarily through its MLX framework – an Apple silicon-optimized toolkit for developing and deploying large language models. Targeting AI researchers and developers, MLX enables private, efficient LLM experimentation and deployment directly on Apple hardware, bypassing reliance on cloud infrastructure. This positions Apple AI as a provider of both foundational ML technology and a secure, localized AI development environment.
Amazon AI
Seattle, United States
Amazon AI provides a comprehensive suite of cloud-based artificial intelligence services and infrastructure through AWS, including the development of foundation models like Titan and the SageMaker AI platform. Their core offering centers on enabling organizations to build, train, and deploy agentic AI applications at scale, leveraging specialized infrastructure and a robust data foundation. Targeting businesses across all industries, Amazon AI differentiates itself by offering end-to-end capabilities – from model development to data governance – with a focus on cost optimization and enterprise-grade security.
IBM watsonx
Armonk, United States
IBM watsonx is an enterprise AI platform offering foundation models – including open-source options – and governance tools designed to accelerate the deployment of generative AI. The platform focuses on enabling businesses to build, deploy, and monitor AI assistants and agents across hybrid cloud environments. watsonx targets large organizations seeking to integrate AI into core workflows while maintaining control, compliance, and data security.
xAI
San Francisco, United States
xAI is a US-based artificial intelligence company focused on developing and deploying large language models with a stated mission of advancing scientific discovery. Their primary product is Grok, a generative AI model accessible via API, web, and mobile platforms, offering enhanced speed, precision, and multilingual capabilities. xAI targets both individual users seeking an AI assistant and developers looking to integrate advanced LLM functionality into their applications, as demonstrated by their API offerings and recent partnership with the government of El Salvador.
Moonshot AI
Beijing, China
Moonshot AI is a Chinese artificial intelligence company specializing in the development of large language models. Their core product, Kimi, is a long-context LLM distinguished by its capacity to process exceptionally large input sequences – reportedly millions of tokens – enabling advanced applications in complex document analysis and information retrieval. This capability positions Moonshot AI to serve organizations requiring processing of extensive datasets, such as legal firms, research institutions, and financial analysts.
Cerebras Systems
Sunnyvale, United States
Cerebras Systems develops AI hardware, specifically the Wafer Scale Engine, a large-scale chip designed to accelerate deep learning workloads. Unlike traditional GPUs, Cerebras’ technology aims to significantly reduce the time and cost associated with training complex AI models. Their target market is organizations requiring high-performance computing for demanding AI applications, such as large language models and scientific computing.
Minimax
Shanghai, China
Minimax builds multimodal AI including text, voice, and video generation models for consumer and enterprise applications.
NotCo
Santiago, Chile
NotCo is a food technology company that leverages a proprietary AI platform, Giuseppe, to accelerate and optimize the development of plant-based food products. Their AI utilizes a high-fidelity data corpus and multi-objective optimization to rapidly explore formulation possibilities, addressing constraints related to cost, nutrition, sensory appeal, and manufacturability. NotCo targets food manufacturers seeking to innovate and scale plant-based alternatives more efficiently than traditional R&D methods allow.
VAST Data
New York, United States
VAST Data provides an AI Operating System that unifies storage, database, and compute resources into a single, orchestrated platform. Built on their Disaggregated, Autonomous, Storage Engine (DASE) architecture, the system is designed to eliminate data bottlenecks and deliver terabyte-per-second performance to large-scale GPU clusters. VAST Data targets enterprises deploying demanding, data-intensive AI and agentic computing workloads, offering a solution focused on scalability, performance, and reduced total cost of ownership.
Physical Intelligence
San Francisco, United States
Physical Intelligence is building foundation models for physical AI, enabling robots to learn general-purpose manipulation skills.
Weka
Campbell, United States
Weka provides a high-performance data platform, NeuralMesh, designed to accelerate AI and machine learning workloads. Their technology utilizes a distributed, parallel file system to extend GPU memory by up to 1000x and significantly reduce AI response times – demonstrated by a 20x reduction in time-to-first-token. Weka targets organizations building and scaling demanding agentic AI applications, offering a solution to maximize GPU utilization and improve the economics of AI infrastructure, with deployment options including Oracle Cloud.
Sky Mavis
Ho Chi Minh City, Vietnam
Sky Mavis is a Vietnam-based game development studio and blockchain technology company building play-to-earn games and supporting infrastructure. Their flagship product, Axie Infinity, utilizes non-fungible tokens (NFTs) and blockchain technology to create a player-owned digital ecosystem. Sky Mavis also develops Ronin, an Ethereum Virtual Machine (EVM) blockchain specifically designed to scale games with player-owned economies and reduce transaction fees.
Skild AI
Pittsburgh, United States
Skild AI develops scalable robot brain models that work across different robots and tasks. Raised $300M Series A at $1.5B valuation.
K Health
New York, United States
K Health is a U.S.-based healthcare company delivering virtual primary care services enhanced by an AI-powered diagnostic and intake tool. Their core technology leverages a large dataset of medical information to provide preliminary assessments and insights to clinicians, facilitating more efficient and data-driven patient consultations. K Health targets health systems and individuals seeking accessible, 24/7 primary care that integrates with existing in-person care networks to address access and continuity challenges.
Liquid AI
Cambridge, United States
Liquid AI develops efficient foundation models based on liquid neural networks, a new AI architecture from MIT research.
Covariant
Emeryville, United States
Covariant develops AI-powered robotic automation solutions for warehouse and logistics environments. Their core technology is the Robotics Foundation Model (RFM-1), a universal AI trained on a large dataset of warehouse robotics data, enabling robots to handle a wide variety of SKUs without specialized programming. Covariant targets large retailers and logistics providers seeking to improve operational flexibility and address labor challenges through adaptable, AI-driven automation.
Paige AI
New York, United States
Paige AI develops AI-powered tools for pathology, focusing on improving cancer diagnosis and biomarker discovery. Their core technology leverages large foundation models trained on vast pathology datasets to assist pathologists in detecting cancer, subtyping tumors, and identifying molecular markers directly from tissue samples. Paige targets pathology labs and healthcare providers, aiming to increase diagnostic accuracy and efficiency while alleviating workload pressures.
BioAge Labs
Richmond, United States
BioAge Labs is a US-based biotechnology company focused on developing therapeutics for cardiometabolic diseases by targeting the biological processes of aging. Their core technology is a human-first discovery platform that leverages decades of proprietary multi-omics data to identify and validate novel aging-related drug targets. Currently, BioAge is advancing BGE-102, a program targeting chronic inflammation, and a preclinical apelin receptor agonist with potential for significant weight loss, both aimed at addressing age-related metabolic dysfunction.
Writer
San Francisco, United States
Writer provides full-stack generative AI platform for enterprises with Palmyra LLMs, AI guardrails, and Knowledge Graph.
Poolside
Paris, France
Poolside is a French foundation model company developing AI agents specifically for software development and deployment within enterprise environments. Their core technology focuses on building and operationalizing large language models to enhance developer productivity and automate coding tasks. Unlike many AI providers, Poolside prioritizes data security and control by offering on-premise, VPC, or workstation deployment options, targeting organizations with stringent compliance and security requirements.
Supabase
San Francisco, United States
Supabase is a backend-as-a-service platform built on top of PostgreSQL, providing developers with a comprehensive suite of tools including authentication, real-time subscriptions, and serverless functions. A key differentiator is its integrated vector database capabilities, enabling efficient storage, indexing, and search of vector embeddings for AI and machine learning applications. Supabase targets developers building full-stack applications requiring a scalable database solution with native AI functionality, offering an open-source alternative to platforms like Firebase.
Neon
San Francisco, United States
Neon is a serverless Postgres database provider offering autoscaling and pay-as-you-go pricing for application development. Their platform uniquely integrates pgvector, enabling efficient storage and retrieval for AI-powered applications utilizing vector embeddings. Neon targets developers building scalable applications, particularly those leveraging AI and machine learning, by simplifying database operations and reducing infrastructure management overhead.
Merantix
Berlin, Germany
Merantix is a German AI venture studio that develops and launches independent AI-driven companies. They focus on applied AI research and development, building solutions across multiple industries rather than offering a single product. Merantix targets opportunities where AI can deliver significant impact, functioning as both a creator and investor in these new ventures.
Contextual AI
Mountain View, United States
Contextual AI builds RAG-native language models designed specifically for enterprise knowledge work and document understanding.
Weaviate
Amsterdam, Netherlands
Weaviate is an open-source vector database that enables developers to build AI-native applications with semantic search capabilities. The platform simplifies AI infrastructure by handling vector embeddings, ranking, and auto-scaling, allowing users to connect custom or pre-built machine learning models without complex data pipelines. Weaviate targets developers and enterprises seeking a scalable, secure, and vendor-agnostic solution for knowledge graphs, recommendation engines, and other AI-powered applications.
Arize AI
Berkeley, United States
Arize AI provides a unified platform for monitoring and evaluating Large Language Model (LLM) applications, from development through production deployment. Their technology focuses on LLM observability and agent evaluation, offering tools to analyze performance, identify issues, and close the feedback loop between development and real-world data. Arize AI targets organizations building and scaling AI applications, enabling data-driven iteration and improved reliability at scale.
Reka AI
San Francisco, United States
Reka AI builds multimodal foundation models that can understand text, images, video, and audio in a unified architecture.
Essential AI
San Francisco, United States
Essential AI, founded by former Google researchers, builds enterprise-focused foundation models optimized for business workflows.
Bioptimus
Paris, France
Bioptimus is building universal AI foundation models for biology. Their models understand biological systems to accelerate drug discovery and healthcare innovation.
Wombo
Toronto, Canada
Wombo develops consumer AI applications and has expanded into distributed computing infrastructure. Their w.ai platform allows users to passively earn income by securely contributing their device’s idle processing power to a decentralized network. This network supports AI model training and aims to democratize access to computational resources for the development of artificial intelligence, backed by NVIDIA.
Conjecture
London, United Kingdom
Conjecture is a UK-based AI research company developing a novel AI architecture focused on verifiable safety and control. Their core technology centers on building AI systems with inherent transparency and predictability, moving beyond traditional black-box approaches. This positions Conjecture to serve organizations and researchers prioritizing robust AI safety and alignment, particularly as advanced AI capabilities continue to develop.
Twelve Labs
San Francisco, United States
Twelve Labs is a US-based AI company specializing in comprehensive video understanding through its foundational model, Marengo. Marengo delivers industry-leading performance in semantic video search and analysis, exceeding the capabilities of existing cloud-based and open-source alternatives. The company targets enterprise clients requiring advanced video intelligence for applications like content discovery, asset management, and automated video workflows.
Neptune.ai
Warsaw, Poland
Neptune.ai provides experiment tracking and model registry software specializing in the monitoring of large-scale foundation models. Their platform uniquely focuses on visualizing and debugging per-layer metrics – including losses, gradients, and activations – at scale, enabling faster identification of training instabilities. Targeting AI research and infrastructure teams working with models exceeding billions of parameters, Neptune.ai offers both cloud and on-premise deployment options for comprehensive model monitoring.
Saidot
Helsinki, Finland
Saidot is a Finnish SaaS provider specializing in AI governance solutions. Their platform enables organizations to document, assess, and monitor AI systems to ensure responsible development and adherence to the forthcoming EU AI Act. Saidot targets businesses and public sector entities requiring demonstrable AI compliance and risk management capabilities within the European regulatory landscape.
Redwood Research
Berkeley, United States
Redwood Research is a US-based AI safety nonprofit focused on mitigating catastrophic risks from advanced AI systems. Their core research centers on “AI control” protocols – techniques for reliably monitoring and preventing subversion by potentially deceptive large language models, even when those models intentionally conceal misaligned intentions. They serve as a critical resource for both governments and leading AI developers like Google DeepMind and Anthropic, providing expertise and methodologies for assessing and mitigating AI safety risks.
Apollo Research
London, United Kingdom
Apollo Research is a UK-based AI safety and alignment company specializing in the detection of deceptive behavior in advanced AI systems, particularly large language model agents. They develop and implement novel AI model evaluations focused on “scheming” – covertly pursuing misaligned objectives – and provide technical expertise to governments and international organizations on AI governance and regulation. Their core offering is third-party evaluation of frontier AI models, alongside consultancy services for responsible AI development frameworks and policy guidance.
Center for AI Safety
San Francisco, United States
The Center for AI Safety is a US-based nonprofit focused on mitigating potentially catastrophic risks from advanced artificial intelligence. They conduct and fund research into AI safety, with a particular emphasis on identifying and addressing vulnerabilities in increasingly capable AI systems – offering resources like a dedicated compute cluster to support this work. Their primary audience includes AI researchers, policymakers, and stakeholders concerned with the responsible development and deployment of powerful AI technologies.
AI Fund
Palo Alto, United States
AI Fund is a venture studio that actively co-founds AI-driven companies alongside entrepreneurs and corporate partners. Leveraging over $370 million in backing and the expertise of Andrew Ng, they provide pre-seed funding, technical talent, and a company-building methodology to rapidly launch ventures. AI Fund focuses on applying AI to solve complex problems across industries like maritime, finance, and relationship management, partnering with those possessing deep domain expertise.
Intercom Fin
San Francisco, United States
Intercom Fin is a SaaS platform offering an AI agent for customer service automation. Utilizing a GPT-4 foundation and a proprietary “Fin Flywheel” continuous improvement loop, the platform is trained on a company’s specific knowledge base and policies to resolve complex customer queries across multiple channels. Intercom Fin targets businesses seeking to improve customer support efficiency and consistency through AI-powered automation and performance analytics.
Technology Innovation Institute
Abu Dhabi, United Arab Emirates
Technology Innovation Institute (TII) is a UAE-based research center focused on advanced technology development, with a core competency in artificial intelligence. TII is the creator of the open-source Falcon series of large language models, notable for their performance and accessibility. As part of the Abu Dhabi government’s research ecosystem, TII aims to advance scientific knowledge and deliver impactful technologies through both independent research and international collaboration.
Stats Perform
Chicago, United States
Stats Perform is a sports AI company that provides detailed, real-time data and analytics to the global sports industry. Their core product, Opta, is a comprehensive sports data engine powering 8 proprietary AI models used for predictive analytics and performance insights. Stats Perform serves professional sports teams, broadcasters, media outlets, and betting operators, enabling enhanced content creation, strategic decision-making, and improved fan engagement.
Ray
San Francisco, United States
Ray (Anyscale) provides a unified, open-source distributed computing framework designed to simplify the development and deployment of AI and machine learning applications. Their core technology enables developers to scale Python-based AI workloads – including training and serving for models like LLMs and computer vision applications – across diverse infrastructure, from CPUs to GPUs. Ray targets organizations facing challenges with scaling AI initiatives and optimizing resource utilization, offering a solution to overcome the “AI Complexity Wall” and accelerate time to production.
SDAIA
Riyadh, Saudi Arabia
The Saudi Data and AI Authority (SDAIA) is the governmental body responsible for developing and executing Saudi Arabia’s national AI strategy. SDAIA focuses on establishing a robust data and AI ecosystem through initiatives like national data platforms and AI-powered solutions for public sector applications. Its primary value proposition is to accelerate Saudi Arabia’s digital transformation and achieve the goals outlined in Vision 2030 by leveraging data and artificial intelligence.