Provider data for AI / RAG
The path for AI teams grounding a model in citable federal provider data.
Open →Retrieval-augmented generation is only as trustworthy as the records it retrieves. Ground a model in federal provider data — identity, enrollment, quality, and exclusion status — and every assertion can carry its source. Fonteum serves that data three ways an AI system can consume: an MCP server an agent calls directly, an API for retrieval, and bulk export for an index.
Each field arrives with its 14-tuple provenance, so a RAG answer can name the dataset, source agency, and snapshot date instead of asking the user to take it on faith.
The interfaces an LLM, agent, or RAG pipeline can call. Same provider graph, same provenance, three shapes.
The path for AI teams grounding a model in citable federal provider data.
Open →What an autonomous agent can resolve, screen, and cite without a human in the loop.
Open →A Model Context Protocol server an agent calls directly — structured for LLM consumption.
Open →Resolve a provider and return every field with its provenance in one request.
Open →Five US Core provider resources with row-level provenance — for clinical-system grounding.
Open →Whole-dataset NDJSON and CSV to build a retrieval index over the full provider graph.
Open →The provenance and integrity layer that lets a generated answer point back to a real federal record.
How ingest, reconcile, attest, and serve fit into one pipeline.
Open →Replay what a record said on a past date — so an answer can be reconstructed exactly.
Open →A signed, chained record wrapping any answer an AI system produced.
Open →How a provider is resolved across federal sources into one entity to retrieve against.
Open →Where the data is quoted, and the citation guidance behind it.
Open →The skills declaration multi-agent frameworks parse for autonomous discovery.
Open →The full catalog — each source page is a citable surface with a stat and a table.
Open →Every source an agent can ground in, with refresh cadence and coverage.
Open →Reproducible, numeric studies an AI engine can quote with method and date.
Open →Let an agent screen a party against the exclusion lists and return a dated record.
Open →The HL7 standard behind the interop API.
Open →The health-data interoperability standards body.
Open →The structured public files an AI index ingests.
Open →The identifier a provider answer resolves on.
Open →Drop the MCP server into an agent or call the API, and ground every provider answer in a record that names its source. Request access for higher limits and bulk export.
Built on the authoritative federal record
The primary sources, named on every page.
These are the federal agencies whose public datasets Fonteum ingests and attributes — the issuing authorities, not customers or partners. Every figure on the site links back to one of them.
See the full source registry, with license and refresh cadence for each →
Reproducible by design
Every figure traces to its federal source.
14-tuple provenance
Every rendered fact ties to a source URL, dataset ID, snapshot date, row key, and SHA-256 — the full chain-of-custody record.
Reproducible SQL
Each study ships the exact query behind its figures, run against the cited federal snapshot. Re-run it yourself.
Daily reconciliation
Published counts are reconciled against the upstream federal datasets on a daily cadence, with drift logged.
Named medical review
Reviewed by Jennifer Montecillo, MD, medical reviewer. Non-practicing medical reviewer.
Two doors
Use the free API and open data
Query providers, facilities, sanctions, and quality scores — each field carrying its federal source. Self-serve, no call to start.
Talk to us
Managed pilots, enterprise terms, and audit-ready, signed attestation packages for compliance, risk, and research teams.
The substrate, by the numbers