Close Menu

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    What's Hot

    Mexico joins Venezuela earthquake rescue: June 26 mañanera

    June 26, 2026

    You can now use Riot’s anti-cheat on-demand

    June 26, 2026

    Guardians Select Cooper Ingle

    June 26, 2026
    Facebook X (Twitter) Instagram
    Select Language
    Facebook X (Twitter) Instagram
    NEWS ON CLICK
    Subscribe
    Friday, June 26
    • Home
      • United States
      • Canada
      • Spain
      • Mexico
    • Top Countries
      • Canada
      • Mexico
      • Spain
      • United States
    • Politics
    • Business
    • Entertainment
    • Fashion
    • Health
    • Science
    • Sports
    • Travel
    NEWS ON CLICK
    Home»Health & Fitness»US Health & Fitness»Have Healthcare Data Lakes Become “Data Swamps”?
    US Health & Fitness

    Have Healthcare Data Lakes Become “Data Swamps”?

    News DeskBy News DeskJune 26, 2026No Comments6 Mins Read
    Share Facebook Twitter Pinterest Copy Link LinkedIn Tumblr Email VKontakte Telegram
    Have Healthcare Data Lakes Become “Data Swamps”?
    Share
    Facebook Twitter Pinterest Email Copy Link

    Healthcare has spent the better part of a decade building data lakes. The original problem was straightforward and worth solving: clinical data lived in silos, providers could not see the full picture of a patient’s care, and aggregation across the ecosystem was a prerequisite for nearly everything that came next. 

    That foundational work has now largely succeeded. Health information exchanges, private interoperability platforms, and the expanding Qualified Health Information Network framework have made it possible to move enormous volumes of clinical data between organizations. In roughly a year, the national framework supporting this exchange has grown from 10 million records to nearly 500 million health records exchanged.

    Industry attention is shifting from collection to activation. Much of the most valuable clinical detail lives in formats a data lake struggles to reach: physician notes, imaging reports, and discharge summaries. A 2025 study in the Journal of Medical Internet Research that analyzed 1.8 million primary care records found that only 13% of the clinical concepts captured in free-text notes had matching counterparts in the structured data of those same records. Even structured records have meaningful gaps. For example, a 2022 study in the Journal of the American Medical Informatics Association found that only 59.4% of chronic conditions were consistently captured across encounter diagnoses and problem lists within a network of more than 500 community health centers. Activation of data, rather than aggregation alone, is the new differentiator.

    Yet many of these data repositories have quietly turned into what could be accurately described as “data swamps.” They are vast, deep, and nearly impossible to navigate without the right clinical lens applied on top. The real challenge facing health data platform companies and health system data teams is what to do with all of it.

    A fragmentation problem

    Industry conversations often default to framing this as a data quality problem. That framing understates the issue. Quality certainly matters. Source records routinely contain text that does not match accompanying codes, mappings that point to the wrong concepts, retired ICD-9 codes that were never updated, and placeholder values such as 9999 that masquerade as real clinical data. Many physician practices and smaller health systems lack the informaticists necessary to maintain a clean data master at the source.

    The deeper problem is fragmentation. Even when individual data points are accurate, they often arrive disconnected from the clinical context that gives them meaning. A laboratory result without its associated problem, a medication without its indication, or a diagnosis without its supporting evidence cannot answer the questions clinicians actually ask. Data quality work alone produces tidier data, which falls short of the ultimate goal of creating clinical understanding.

    Three capabilities missing from most data lakes

    Three capabilities consistently separate platforms that activate clinical data from those that merely store it.

    The first is data extraction. A significant share of clinically relevant information lives in PDFs, scanned documents, free-text notes, and discharge summaries. Without natural language processing (NLP) tools that can convert that narrative content into reliable, coded, structured data, the most clinically rich material in any patient record remains inaccessible to downstream analytics, reporting, and AI systems.

    The second is a clinical lens. Clinicians do not think in data tables. Rather, they think by problem, such as the status of any given patient’s heart failure, diabetes, or recent surgical recovery. Activating a data lake requires the ability to filter, organize, and present information by problem, by specialty, and by relevance to the decision at hand. That capability depends on a curated clinical knowledge graph that understands the relationships between diagnoses, symptoms, tests, treatments, and outcomes. Without such a layer, a longitudinal patient summary remains an undifferentiated wall of records.

    The third is scrubbing. Tools that resolve duplicates, reconcile conflicting records, normalize terminology, replace invalid or retired codes, and validate diagnoses against supporting evidence allow the underlying data foundation to be trusted. Without that layer, every downstream model, dashboard, or AI agent is operating on shifting ground.

    An activated data lake 

    Picture a clinician at the point of care opening a portal from inside the electronic health record. The portal displays everything known about the patient from external sources, organized longitudinally and filtered by the problem the clinician is treating that day. The clinician can drill down on a single specialty, screen for findings the chart may have missed, and selectively import only the data relevant to today’s encounter. The patient’s diabetes management is in clear view. The unrelated ankle sprain being treated by another provider stays in the background.

    While simple on the surface to the clinician user, that experience requires several layers underneath working together: a normalized clinical data foundation, NLP that converts narratives into structured concepts, mappings to standard terminologies and interoperability formats including SNOMED CT, LOINC, RxNorm, FHIR, and C-CDA, and a clinical knowledge graph that organizes concepts by clinical meaning. Each layer reinforces the others. Likewise, pulling any one of them out collapses the workflow.

    From aggregation to activation

    Building toward that kind of architecture is what separates a data lake from a data swamp. Health data platforms and health systems that invested heavily in aggregation are right to be proud of the foundation they laid. The next phase of competitive differentiation will favor those that turn accumulated data into clinical intelligence. That requires treating clinical context as a core architectural concern from the foundation up. Storage was the easy part. Meaning is the hard part, and meaning is what clinicians, patients, and AI systems all depend on.

    Health systems and platform vendors that take that step will quickly discover what their data was meant to do.

    Photo: Weiquan Lin, Getty Images


    David Lareau is Chief Executive Officer of Medicomp. Lareau joined Medicomp in 1995 and has responsibility for operations and product management, including customer relations and marketing. Prior to joining Medicomp, Lareau founded a company that installed management communication networks in large enterprises such as The World Bank, DuPont and Sinai Hospital in Baltimore. The Sinai Hospital project, one of the first PC-based LAN systems using email and groupware, was widely acknowledged as one of the largest and most successful implementations of this technology.

    Lareau’s work at Sinai led to the founding of a medical billing company that led, in turn, to his partnership with Medicomp. Realizing that the healthcare industry made less use of information technology than almost any other industry, particularly in the area of clinical care, Lareau immediately saw the potential for Medicomp’s powerful technologies and joined the company to help fulfill Peter Goltra’s vision.

    This post appears through the MedCity Influencers program. Anyone can publish their perspective on business and innovation in healthcare on MedCity News through MedCity Influencers. Click here to find out how.

    data data quality data silos health IT
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Telegram Copy Link
    News Desk
    • Website

    News Desk is the dedicated editorial force behind News On Click. Comprised of experienced journalists, writers, and editors, our team is united by a shared passion for delivering high-quality, credible news to a global audience.

    Related Posts

    US Health & Fitness

    Documentation Tools Aren’t Just for Doctors: Inside Reid Health’s Deployment of Abridge’s Nurse Tech

    June 26, 2026
    US Health & Fitness

    Why Generative AI Isn’t Enough: The Case for Causal Reasoning in Medicine

    June 26, 2026
    US Health & Fitness

    The Hidden Cost of “Good Enough” in Healthcare Experiences

    June 26, 2026
    US Health & Fitness

    Why ‘Still Open’ Is the Wrong Way to Measure the Hospital Crisis

    June 26, 2026
    US Health & Fitness

    A Metabolic Disease Posing Severe Pancreatic Risks Gets Its First FDA-Approved Therapy

    June 26, 2026
    US Health & Fitness

    Upside Seeks to Solve the Housing Crisis Armed with $20M Raise

    June 25, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Don't Miss

    Mexico joins Venezuela earthquake rescue: June 26 mañanera

    News DeskJune 26, 20260

    Sheinbaum’s mañanera in 60 seconds 🚑 Venezuela earthquake response: Mexico has deployed 250 military personnel, two Foreign…

    You can now use Riot’s anti-cheat on-demand

    June 26, 2026

    Guardians Select Cooper Ingle

    June 26, 2026

    Will Extreme Heat Change Fashion’s Climate Calculus?

    June 26, 2026
    Tech news by Newsonclick.com
    Top Posts

    Norway is pillaging hearts and minds – Live Updates

    June 26, 2026

    New on Crave: June 2026

    May 27, 2026

    Berkeley grad has diploma denied until he puts away Palestinian flag

    May 27, 2026

    ‘RHORI’ Ashley Iaconetti Slams Kelsey’s Claims She ‘Ignores’ Cast

    May 27, 2026
    Stay In Touch
    • Facebook
    • Twitter
    • Pinterest
    • Instagram
    • YouTube
    • Vimeo

    Subscribe to Updates

    Get the latest creative news from SmartMag about art & design.

    Editors Picks

    Mexico joins Venezuela earthquake rescue: June 26 mañanera

    June 26, 2026

    You can now use Riot’s anti-cheat on-demand

    June 26, 2026

    Guardians Select Cooper Ingle

    June 26, 2026

    Will Extreme Heat Change Fashion’s Climate Calculus?

    June 26, 2026
    About Us

    NewsOnClick.com is your reliable source for timely and accurate news. We are committed to delivering unbiased reporting across politics, sports, entertainment, technology, and more. Our mission is to keep you informed with credible, fact-checked content you can trust.

    We're social. Connect with us:

    Facebook X (Twitter) Instagram Pinterest YouTube
    Latest Posts

    Mexico joins Venezuela earthquake rescue: June 26 mañanera

    June 26, 2026

    You can now use Riot’s anti-cheat on-demand

    June 26, 2026

    Guardians Select Cooper Ingle

    June 26, 2026

    Subscribe to Updates

    Get the latest creative news from FooBar about art, design and business.

    Facebook X (Twitter) Instagram Pinterest
    • About Us
    • Editorial Policy
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Advertise
    • Contact Us
    © 2026 Newsonclick.com || Designed & Powered by ❤️ Trustmomentum.com.

    Type above and press Enter to search. Press Esc to cancel.