Skip to content
FonteumThe Graph

The capability layer

APIREST + bulk accessMCP serverCallable by AI agentsFHIR R4 APIBulk exportAttestation & audit packReconciliationSource-vs-source diffsEntity graphSnapshotsPoint-in-time, bitemporal

By use case

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligence

By buyer

Compliance & riskDevelopers & AI teams

The differentiator

Coverage & sourcesThe catalogFreshnessMethodologyCare CompareFacility qualityBrowse all datasets →
Research

The dev on-ramp

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →

Platform

APIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots

Solutions

Exclusion & sanctions screeningCredentialing & provider-data enrichmentAudit evidence & defensible programsProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams

Data

Coverage & sourcesFreshnessMethodologyCare CompareBrowse all datasets →
Research

Developers

DocsAPI referenceMCPQuickstartStatusChangelogSDKs & integrations
Pricing
Sign inTry the FHIR sandbox →Request access →

GUIDE · HEALTHCARE DATA FOR AI / RAG

Provider data an agent can cite, not hallucinate.

Ground a model or agent in U.S. healthcare provider data over an MCP server, FHIR R4, or bulk export — every field carrying the federal source row it came from, so each answer points back to a record you can re-check. This guide maps the AI on-ramps and the data behind them.

Request accessProvider data for RAG →

The short answer

How do you use healthcare provider data in AI and RAG?

Retrieval-augmented generation is only as trustworthy as the records it retrieves. Ground a model in federal provider data — identity, enrollment, quality, and exclusion status — and every assertion can carry its source. Fonteum serves that data three ways an AI system can consume: an MCP server an agent calls directly, an API for retrieval, and bulk export for an index.

Each field arrives with its 14-tuple provenance, so a RAG answer can name the dataset, source agency, and snapshot date instead of asking the user to take it on faith.

AI on-ramps

The interfaces an LLM, agent, or RAG pipeline can call. Same provider graph, same provenance, three shapes.

Buyer

Provider data for AI / RAG

The path for AI teams grounding a model in citable federal provider data.

Open →
Buyer

Fonteum for AI agents

What an autonomous agent can resolve, screen, and cite without a human in the loop.

Open →
MCP

The MCP server

A Model Context Protocol server an agent calls directly — structured for LLM consumption.

Open →
API

The REST API

Resolve a provider and return every field with its provenance in one request.

Open →
Interop

FHIR R4 API

Five US Core provider resources with row-level provenance — for clinical-system grounding.

Open →
Egress

Bulk export for an index

Whole-dataset NDJSON and CSV to build a retrieval index over the full provider graph.

Open →

What makes it citable

The provenance and integrity layer that lets a generated answer point back to a real federal record.

Platform

The capability layer

How ingest, reconcile, attest, and serve fit into one pipeline.

Open →
Time-travel

Point-in-time snapshots

Replay what a record said on a past date — so an answer can be reconstructed exactly.

Open →
Evidence

Attestation & audit pack

A signed, chained record wrapping any answer an AI system produced.

Open →
Identity

The provider entity graph

How a provider is resolved across federal sources into one entity to retrieve against.

Open →
AEO

How Fonteum gets cited

Where the data is quoted, and the citation guidance behind it.

Open →
Agent card

The machine-readable agent card

The skills declaration multi-agent frameworks parse for autonomous discovery.

Open →

The data to retrieve

Catalog

Browse every federal dataset

The full catalog — each source page is a citable surface with a stat and a table.

Open →
Coverage

Source families & cadence

Every source an agent can ground in, with refresh cadence and coverage.

Open →
Research

Original studies as a source

Reproducible, numeric studies an AI engine can quote with method and date.

Open →
Use case

Agent-driven exclusion screening

Let an agent screen a party against the exclusion lists and return a dated record.

Open →

Key terms

Glossary

FHIR, defined

The HL7 standard behind the interop API.

Open →
Glossary

HL7, defined

The health-data interoperability standards body.

Open →
Glossary

Machine-readable file, defined

The structured public files an AI index ingests.

Open →
Glossary

What is an NPI number?

The identifier a provider answer resolves on.

Open →

Common questions

Answers in one line each

Why ground a healthcare AI in federal data instead of scraping?
Scraped or stitched provider data has no source chain, so a model can't tell a user where an answer came from or whether it is current. Federal records served with their dataset, source agency, and snapshot date let a RAG answer cite a real, re-checkable record — the difference between an assertion and evidence.
Can an AI agent call Fonteum directly?
Yes. The MCP server lets an agent resolve a provider, screen against exclusion lists, and pull provenance without a human in the loop. The same data is published as machine-readable dataset declarations and an agent card so multi-agent frameworks can discover the skills autonomously.
Is the data free for AI search engines to cite?
Reads of the public federal records carry no paywall, so AI search engines can retrieve and cite them. The numbers and tables render as visible HTML — not JS-only widgets — because retrieval ignores what it can't read. Authentication raises rate limits and unlocks bulk export and signed attestation.
How does a model show an answer is current?
Every field carries the snapshot date of the file it came from, and point-in-time snapshots let a record be replayed as of a past date. A grounded answer can state both when a fact was true and when it was known — so freshness is part of the citation, not an assumption.

Go deeper

The other pillar guides

  • GuideHealthcare provider data
  • GuideExclusion & sanctions screening
  • GuideProvider credentialing data
START HERE

Add federal-data citations to your agent.

Drop the MCP server into an agent or call the API, and ground every provider answer in a record that names its source. Request access for higher limits and bulk export.

Add Fonteum to your agent →Request pilot access

Built on the authoritative federal record

The primary sources, named on every page.

These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.

  • CMS
  • HHS-OIG
  • HRSA
  • FDA
  • NLM
  • NUCC
  • Census
  • BLS
  • BEA

See the full source registry, with license and refresh cadence for each →

Reproducible by design

Every figure traces to its federal source.

14-tuple provenance

Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.

Reproducible SQL

Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.

Daily reconciliation

Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.

Named medical review

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

Read the full provenance and attestation methodology →

Two doors

Use the free API and open data

Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.

Explore the API →Browse the data catalog →

Talk to us

Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.

Talk to us →
Fonteum
Platform
Platform overviewAPIMCP serverFHIR R4 APIBulk exportAttestation & audit packReconciliationEntity graphSnapshots
Solutions
All solutionsExclusion & sanctions screeningCredentialing & enrichmentAudit evidenceProvider data for AI / RAGM&A & network diligenceCompliance & riskDevelopers & AI teams
Data & sources
Coverage & sourcesBrowse all datasetsState Medicaid exclusionsFreshnessMethodologyCare CompareSanctionsOwnershipStaffingDeficienciesSpecial Focus Facilities
Developers
Developer hubDocsAPI referenceQuickstartStatusChangelogSDKs & integrationsWebhooks
Research & guides
Research hubGuidesHealthcare provider dataExclusion & sanctions screeningProvider credentialing dataHealthcare data for AIHospital margin gapProvider access gapsGlossaryComparisonsCitationsWhy Fonteum
Company
AboutPressCustomersPricingContactEditorial policyCorrections
Trust & legal
TrustQualitySecurityPrivacy policyTerms of serviceMedical disclaimer

Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.

© 2026 Fonteum LLC. All rights reserved.

·hello@fonteum.com

The U.S. healthcare graph AI can cite — every fact carries its source.

Request access→

The substrate, by the numbers

9.2Mgraph entitiesProviders, organizations, owners, and facilities
15.7Mlinked identifiersNPIs, CCNs, LEIs and more, resolved to entities
5Mgraph edgesSource-attested relationships between entities
44federal source familiesDistinct CMS, OIG, HRSA, FDA and peer datasets
35dataset pagesCitable, downloadable /data catalog pages
67reproducible studiesEach shipping the SQL behind its figures