FDA Database Access
Overview
Access comprehensive FDA regulatory data through openFDA, the FDA's initiative to provide open APIs for public datasets. Query information about drugs, medical devices, foods, animal/veterinary products, and substances using Python with standardized interfaces.
Key capabilities:
-
Query adverse events for drugs, devices, foods, and veterinary products
-
Access product labeling, approvals, and regulatory submissions
-
Monitor recalls and enforcement actions
-
Look up National Drug Codes (NDC) and substance identifiers (UNII)
-
Analyze device classifications and clearances (510k, PMA)
-
Track drug shortages and supply issues
-
Research chemical structures and substance relationships
When to Use This Skill
This skill should be used when working with:
-
Drug research: Safety profiles, adverse events, labeling, approvals, shortages
-
Medical device surveillance: Adverse events, recalls, 510(k) clearances, PMA approvals
-
Food safety: Recalls, allergen tracking, adverse events, dietary supplements
-
Veterinary medicine: Animal drug adverse events by species and breed
-
Chemical/substance data: UNII lookup, CAS number mapping, molecular structures
-
Regulatory analysis: Approval pathways, enforcement actions, compliance tracking
-
Pharmacovigilance: Post-market surveillance, safety signal detection
-
Scientific research: Drug interactions, comparative safety, epidemiological studies
Quick Start
- Basic Setup
from scripts.fda_query import FDAQuery
Initialize (API key optional but recommended)
fda = FDAQuery(api_key="YOUR_API_KEY")
Query drug adverse events
events = fda.query_drug_events("aspirin", limit=100)
Get drug labeling
label = fda.query_drug_label("Lipitor", brand=True)
Search device recalls
recalls = fda.query("device", "enforcement", search="classification:Class+I", limit=50)
- API Key Setup
While the API works without a key, registering provides higher rate limits:
-
Without key: 240 requests/min, 1,000/day
-
With key: 240 requests/min, 120,000/day
Register at: https://open.fda.gov/apis/authentication/
Set as environment variable:
export FDA_API_KEY="your_key_here"
- Running Examples
Run comprehensive examples
python scripts/fda_examples.py
This demonstrates:
- Drug safety profiles
- Device surveillance
- Food recall monitoring
- Substance lookup
- Comparative drug analysis
- Veterinary drug analysis
FDA Database Categories
Drugs
Access 6 drug-related endpoints covering the full drug lifecycle from approval to post-market surveillance.
Endpoints:
-
Adverse Events - Reports of side effects, errors, and therapeutic failures
-
Product Labeling - Prescribing information, warnings, indications
-
NDC Directory - National Drug Code product information
-
Enforcement Reports - Drug recalls and safety actions
-
Drugs@FDA - Historical approval data since 1939
-
Drug Shortages - Current and resolved supply issues
Common use cases:
Safety signal detection
fda.count_by_field("drug", "event", search="patient.drug.medicinalproduct:metformin", field="patient.reaction.reactionmeddrapt")
Get prescribing information
label = fda.query_drug_label("Keytruda", brand=True)
Check for recalls
recalls = fda.query_drug_recalls(drug_name="metformin")
Monitor shortages
shortages = fda.query("drug", "drugshortages", search="status:Currently+in+Shortage")
Reference: See references/drugs.md for detailed documentation
Devices
Access 9 device-related endpoints covering medical device safety, approvals, and registrations.
Endpoints:
-
Adverse Events - Device malfunctions, injuries, deaths
-
510(k) Clearances - Premarket notifications
-
Classification - Device categories and risk classes
-
Enforcement Reports - Device recalls
-
Recalls - Detailed recall information
-
PMA - Premarket approval data for Class III devices
-
Registrations & Listings - Manufacturing facility data
-
UDI - Unique Device Identification database
-
COVID-19 Serology - Antibody test performance data
Common use cases:
Monitor device safety
events = fda.query_device_events("pacemaker", limit=100)
Look up device classification
classification = fda.query_device_classification("DQY")
Find 510(k) clearances
clearances = fda.query_device_510k(applicant="Medtronic")
Search by UDI
device_info = fda.query("device", "udi", search="identifiers.id:00884838003019")
Reference: See references/devices.md for detailed documentation
Foods
Access 2 food-related endpoints for safety monitoring and recalls.
Endpoints:
-
Adverse Events - Food, dietary supplement, and cosmetic events
-
Enforcement Reports - Food product recalls
Common use cases:
Monitor allergen recalls
recalls = fda.query_food_recalls(reason="undeclared peanut")
Track dietary supplement events
events = fda.query_food_events( industry="Dietary Supplements")
Find contamination recalls
listeria = fda.query_food_recalls( reason="listeria", classification="I")
Reference: See references/foods.md for detailed documentation
Animal & Veterinary
Access veterinary drug adverse event data with species-specific information.
Endpoint:
- Adverse Events - Animal drug side effects by species, breed, and product
Common use cases:
Species-specific events
dog_events = fda.query_animal_events( species="Dog", drug_name="flea collar")
Breed predisposition analysis
breed_query = fda.query("animalandveterinary", "event", search="reaction.veddra_term_name:seizure+AND+" "animal.breed.breed_component:Labrador")
Reference: See references/animal_veterinary.md for detailed documentation
Substances & Other
Access molecular-level substance data with UNII codes, chemical structures, and relationships.
Endpoints:
-
Substance Data - UNII, CAS, chemical structures, relationships
-
NSDE - Historical substance data (legacy)
Common use cases:
UNII to CAS mapping
substance = fda.query_substance_by_unii("R16CO5Y76E")
Search by name
results = fda.query_substance_by_name("acetaminophen")
Get chemical structure
structure = fda.query("other", "substance", search="names.name:ibuprofen+AND+substanceClass:chemical")
Reference: See references/other.md for detailed documentation
Common Query Patterns
Pattern 1: Safety Profile Analysis
Create comprehensive safety profiles combining multiple data sources:
def drug_safety_profile(fda, drug_name): """Generate complete safety profile."""
# 1. Total adverse events
events = fda.query_drug_events(drug_name, limit=1)
total = events["meta"]["results"]["total"]
# 2. Most common reactions
reactions = fda.count_by_field(
"drug", "event",
search=f"patient.drug.medicinalproduct:*{drug_name}*",
field="patient.reaction.reactionmeddrapt",
exact=True
)
# 3. Serious events
serious = fda.query("drug", "event",
search=f"patient.drug.medicinalproduct:*{drug_name}*+AND+serious:1",
limit=1)
# 4. Recent recalls
recalls = fda.query_drug_recalls(drug_name=drug_name)
return {
"total_events": total,
"top_reactions": reactions["results"][:10],
"serious_events": serious["meta"]["results"]["total"],
"recalls": recalls["results"]
}
Pattern 2: Temporal Trend Analysis
Analyze trends over time using date ranges:
from datetime import datetime, timedelta
def get_monthly_trends(fda, drug_name, months=12): """Get monthly adverse event trends.""" trends = []
for i in range(months):
end = datetime.now() - timedelta(days=30*i)
start = end - timedelta(days=30)
date_range = f"[{start.strftime('%Y%m%d')}+TO+{end.strftime('%Y%m%d')}]"
search = f"patient.drug.medicinalproduct:*{drug_name}*+AND+receivedate:{date_range}"
result = fda.query("drug", "event", search=search, limit=1)
count = result["meta"]["results"]["total"] if "meta" in result else 0
trends.append({
"month": start.strftime("%Y-%m"),
"events": count
})
return trends
Pattern 3: Comparative Analysis
Compare multiple products side-by-side:
def compare_drugs(fda, drug_list): """Compare safety profiles of multiple drugs.""" comparison = {}
for drug in drug_list:
# Total events
events = fda.query_drug_events(drug, limit=1)
total = events["meta"]["results"]["total"] if "meta" in events else 0
# Serious events
serious = fda.query("drug", "event",
search=f"patient.drug.medicinalproduct:*{drug}*+AND+serious:1",
limit=1)
serious_count = serious["meta"]["results"]["total"] if "meta" in serious else 0
comparison[drug] = {
"total_events": total,
"serious_events": serious_count,
"serious_rate": (serious_count/total*100) if total > 0 else 0
}
return comparison
Pattern 4: Cross-Database Lookup
Link data across multiple endpoints:
def comprehensive_device_lookup(fda, device_name): """Look up device across all relevant databases."""
return {
"adverse_events": fda.query_device_events(device_name, limit=10),
"510k_clearances": fda.query_device_510k(device_name=device_name),
"recalls": fda.query("device", "enforcement",
search=f"product_description:*{device_name}*"),
"udi_info": fda.query("device", "udi",
search=f"brand_name:*{device_name}*")
}
Working with Results
Response Structure
All API responses follow this structure:
{ "meta": { "disclaimer": "...", "results": { "skip": 0, "limit": 100, "total": 15234 } }, "results": [ # Array of result objects ] }
Error Handling
Always handle potential errors:
result = fda.query_drug_events("aspirin", limit=10)
if "error" in result: print(f"Error: {result['error']}") elif "results" not in result or len(result["results"]) == 0: print("No results found") else: # Process results for event in result["results"]: # Handle event data pass
Pagination
For large result sets, use pagination:
Automatic pagination
all_results = fda.query_all( "drug", "event", search="patient.drug.medicinalproduct:aspirin", max_results=5000 )
Manual pagination
for skip in range(0, 1000, 100): batch = fda.query("drug", "event", search="...", limit=100, skip=skip) # Process batch
Best Practices
- Use Specific Searches
DO:
Specific field search
search="patient.drug.medicinalproduct:aspirin"
DON'T:
Overly broad wildcard
search="aspirin"
- Implement Rate Limiting
The FDAQuery class handles rate limiting automatically, but be aware of limits:
-
240 requests per minute
-
120,000 requests per day (with API key)
- Cache Frequently Accessed Data
The FDAQuery class includes built-in caching (enabled by default):
Caching is automatic
fda = FDAQuery(api_key=api_key, use_cache=True, cache_ttl=3600)
- Use Exact Matching for Counting
When counting/aggregating, use .exact suffix:
Count exact phrases
fda.count_by_field("drug", "event", search="...", field="patient.reaction.reactionmeddrapt", exact=True) # Adds .exact automatically
- Validate Input Data
Clean and validate search terms:
def clean_drug_name(name): """Clean drug name for query.""" return name.strip().replace('"', '\"')
drug_name = clean_drug_name(user_input)
API Reference
For detailed information about:
-
Authentication and rate limits → See references/api_basics.md
-
Drug databases → See references/drugs.md
-
Device databases → See references/devices.md
-
Food databases → See references/foods.md
-
Animal/veterinary databases → See references/animal_veterinary.md
-
Substance databases → See references/other.md
Scripts
scripts/fda_query.py
Main query module with FDAQuery class providing:
-
Unified interface to all FDA endpoints
-
Automatic rate limiting and caching
-
Error handling and retry logic
-
Common query patterns
scripts/fda_examples.py
Comprehensive examples demonstrating:
-
Drug safety profile analysis
-
Device surveillance monitoring
-
Food recall tracking
-
Substance lookup
-
Comparative drug analysis
-
Veterinary drug analysis
Run examples:
python scripts/fda_examples.py
Additional Resources
-
openFDA Homepage: https://open.fda.gov/
-
API Documentation: https://open.fda.gov/apis/
-
Interactive API Explorer: https://open.fda.gov/apis/try-the-api/
-
GitHub Repository: https://github.com/FDA/openfda
-
Terms of Service: https://open.fda.gov/terms/
Support and Troubleshooting
Common Issues
Issue: Rate limit exceeded
- Solution: Use API key, implement delays, or reduce request frequency
Issue: No results found
- Solution: Try broader search terms, check spelling, use wildcards
Issue: Invalid query syntax
- Solution: Review query syntax in references/api_basics.md
Issue: Missing fields in results
- Solution: Not all records contain all fields; always check field existence
Getting Help
-
GitHub Issues: https://github.com/FDA/openfda/issues
-
Email: open-fda@fda.hhs.gov