Documentation
Getting startedInstallationLoading data with AirflowQuerying the databaseFinding tables in pgAdminHow SageRx worksData dictionary
Show me the dataData sourcesSource dataStaging dataIntermediate dataMartsUse cases
Use casesExample queriesShow me the data
A quick peek at the data available using SageRx.
Flow of data
datasource
schema.
datasource
schema in the database is transformed into building blocks (using dbt) that are normalized and cleaned up a bit (maybe normalizing NDCs to NDC11 format, for instance). This is stored in the staging
schema.
staging
schema are not enough and require additional transformation (using dbt) to be useful. Or, multiple data sources need to be combined to have a full effect. These transformations end up in the intermediate
schema.
staging
and intermediate
schemas to add value, we create a table in the mart
schema. Marts are easy-to-use, powerful tables that unlock actionable insights from open drug data.
Available source data
If you’re curious what data sources we already have made easily accessible within SageRx, please review the table below.
Name | Short Description | Data Source |
---|---|---|
This is the full RxNorm monthly data release. Requires (free) UMLS license. | ||
RxTerms is a drug interface terminology derived from RxNorm for prescription writing or medication history recording (e.g. in e-prescribing systems, PHRs). | ||
Product listing data submitted for all finished drugs including prescription and over-the-counter drugs, approved and unapproved drugs and repackaged and relabeled drugs. | ||
Products removed / excluded from the NDC Directory for failure to respond to FDA’s requests for correction for various reasons. | ||
Product listing data submitted for all unfinished drugs, including active pharmaceutical ingredients, drugs for further processing and bulk drug substances for compounding. | ||
Drug products with therapeutic equivalence evaluations approved on the basis of safety and effectiveness by the Food and Drug Administration (FDA) under the Federal Food, Drug, and Cosmetic Act (the Act) and related patent and exclusivity information. | ||
Information about all FDA-licensed biological products regulated by CDER, including licensed biosimilar and interchangeable products, and their reference products, and FDA-licensed allergenic, cellular and gene therapy, hematologic, and vaccine products regulated by CBER. | ||
Identifies drug products approved on the basis of safety and effectiveness by the Food and Drug Administration (FDA) under the Federal Food, Drug, and Cosmetic Act (the Act) and related patent and exclusivity information. | ||
Formulary, pharmacy network, and pricing data for Medicare Prescription Drug Plans and Medicare Advantage (MA) Prescription Drug Plans (with the exception of employer and Program of All-Inclusive Care for the Elderly plans). | ||
FDA recalls. | ||
Also known as ATCPROD. ATC4 codes to product RXCUIs as mapped by NLM. | ||
Historical NDCs mapped to product RXCUIs with start and end dates and indirect RXCUI linkages. |
Available data marts
If you’re just interested in the end product, here are the currently available data marts. Keep checking back for more because we are continuously working on new marts.
Name | Short Description |
---|---|
Current NADAC pricing only. | |
Data roadmap
Here’s a list of data sources we’re targeting in the near future. If you’d like to request a data source, please Request new data source.
Name | Short Description | Data Source |
---|---|---|
Name | Short Description | Data Source |
---|---|---|
The Drug Supply Chain Security Act (DSCSA) requires wholesale distributors and third-party logistics providers to report licensure and other information to FDA annually. | ||
https://www.medicaid.gov/medicaid/prescription-drugs/pharmacy-pricing/index.html | ||
These files contain the Level II alphanumeric HCPCS procedure and modifier codes, their long and short descriptions, and applicable Medicare administrative, coverage and pricing data. | ||
Remittance Advice Remark Codes (RARCs) are used to provide additional explanation for an adjustment already described by a Claim Adjustment Reason Code (CARC) or to convey information about remittance processing. | ||
These codes describe why a claim or service line was paid differently than it was billed. | ||
The Medicare Advantage (MA) / Part D Contract and Enrollment Data section serves as a centralized repository for publicly available data on contracts and plans, enrollment numbers, service area data, and contact information for MA, Prescription Drug Plan (PDP), cost, PACE, and demonstration organizations. | ||
This is a list of NDC/NHRIC Labeler Codes which have been electronically submitted. | ||
Get the drug interactions for a specified drug. | ||
The drug establishments current registration site is a publication of currently registered establishments which manufacture, prepare, propagate, compound or process drugs that are distributed in the U.S. or offered for import to the U.S. | ||
SIDER contains information on marketed medicines and their recorded adverse drug reactions. | ||
DDInter is a comprehensive, professional, and open-access database specific to drug-drug interactions. | ||
The Global Unique Device Identification Database (GUDID) contains key device identification information submitted to the FDA about medical devices that have Unique Device Identifiers (UDI). | ||
National Correct Coding Initiative (NCCI) Medically Unlikely Edits (MUEs) are used by the Medicare Administrative Contractors (MACs), to reduce improper payments for Part B claims. | ||
Product data for drugs in the Medicaid drug rebate program. | ||
FDA’s Global Substance Registration System enables efficient and accurate exchange of information on substances through their Unique Ingredient Identifiers (UNIIs) which can be generated at any time in the regulatory life cycle. | ||
← Previous
Next →