Limbix (by BigHealth)
Evidence-based digital therapeutics for adolescent mental health and behavioral activation.

Clinical-grade NLP for extracting and linking medical concepts to standard ontologies with unsupervised learning.
MedCAT (Medical Concept Annotation Tool) represents the pinnacle of open-source clinical Natural Language Processing (NLP) architecture, specifically engineered for Named Entity Recognition and Linking (NER+L). Developed by the CogStack team at King's College London, it addresses the massive challenge of unstructured data in Electronic Health Records (EHRs). By 2026, MedCAT has matured into a foundational layer for clinical data science, utilizing a unique combination of shallow neural networks and unsupervised learning for concept extraction, followed by supervised online learning for context-aware disambiguation. Unlike traditional rule-based systems, MedCAT learns medical semantics directly from large-scale corpora, allowing it to map text to standardized ontologies such as SNOMED CT, ICD-10, and UMLS with high precision. Its architecture is built atop spaCy, ensuring high-throughput performance capable of processing millions of patient records. The framework's 'Learning while Annotating' capability allows clinical experts to refine models in real-time, drastically reducing the cold-start problem in medical entity extraction. As healthcare moves toward the OMOP Common Data Model (CDM), MedCAT serves as the critical translation engine between raw clinician notes and structured research databases.
Uses a context-aware neural network that learns to distinguish between identical terms with different meanings based on surrounding tokens.
Evidence-based digital therapeutics for adolescent mental health and behavioral activation.
Predictive clinical and operational intelligence to fight death and waste in healthcare.
Predictive medical data and clinical insights for streamlined enterprise underwriting.
The professional medical network for clinicians, providing HIPAA-compliant AI and telehealth solutions.
Verified feedback from the global deployment network.
Post queries, share implementation strategies, and help other users.
A modular system that adds secondary attributes to entities, such as Negated=True or Temporality=Past.
An interface for clinicians to provide active learning feedback, which updates the model weights in real-time.
Internal CDB structure can ingest any medical dictionary including SNOMED, ICD-10, UMLS, or custom local codes.
Optimized for parallel processing using spaCy's pipe architecture, enabling millions of documents to be processed across CPU clusters.
Structured outputs are pre-formatted to facilitate ingestion into the Observational Medical Outcomes Partnership Common Data Model.
Allows for the correction of false positives/negatives which are immediately utilized to update the CDB's probabilistic model.
Researchers need to find patients with specific symptoms that are only documented in free-text clinical notes, not in billing codes.
Registry Updated:2/7/2026
Hospital coding departments spend excessive time manually reading notes to assign ICD-10 codes for billing.
Identifying patients at risk of sepsis based on subtle text indicators before vital signs deteriorate.