Biomni
Overview
Biomni is an open-source biomedical AI agent framework from Stanford's SNAP lab that autonomously executes complex research tasks across biomedical domains. Use this skill when working on multi-step biological reasoning tasks, analyzing biomedical data, or conducting research spanning genomics, drug discovery, molecular biology, and clinical analysis.
Core Capabilities
Biomni excels at:
-
Multi-step biological reasoning - Autonomous task decomposition and planning for complex biomedical queries
-
Code generation and execution - Dynamic analysis pipeline creation for data processing
-
Knowledge retrieval - Access to ~11GB of integrated biomedical databases and literature
-
Cross-domain problem solving - Unified interface for genomics, proteomics, drug discovery, and clinical tasks
When to Use This Skill
Use biomni for:
-
CRISPR screening - Design screens, prioritize genes, analyze knockout effects
-
Single-cell RNA-seq - Cell type annotation, differential expression, trajectory analysis
-
Drug discovery - ADMET prediction, target identification, compound optimization
-
GWAS analysis - Variant interpretation, causal gene identification, pathway enrichment
-
Clinical genomics - Rare disease diagnosis, variant pathogenicity, phenotype-genotype mapping
-
Lab protocols - Protocol optimization, literature synthesis, experimental design
Quick Start
Installation and Setup
Install Biomni and configure API keys for LLM providers:
uv pip install biomni --upgrade
Configure API keys (store in .env file or environment variables):
export ANTHROPIC_API_KEY="your-key-here"
Optional: OpenAI, Azure, Google, Groq, AWS Bedrock keys
Use scripts/setup_environment.py for interactive setup assistance.
Basic Usage Pattern
from biomni.agent import A1
Initialize agent with data path and LLM choice
agent = A1(path='./data', llm='claude-sonnet-4-20250514')
Execute biomedical task autonomously
agent.go("Your biomedical research question or task")
Save conversation history and results
agent.save_conversation_history("report.pdf")
Working with Biomni
- Agent Initialization
The A1 class is the primary interface for biomni:
from biomni.agent import A1 from biomni.config import default_config
Basic initialization
agent = A1( path='./data', # Path to data lake (~11GB downloaded on first use) llm='claude-sonnet-4-20250514' # LLM model selection )
Advanced configuration
default_config.llm = "gpt-4" default_config.timeout_seconds = 1200 default_config.max_iterations = 50
Supported LLM Providers:
-
Anthropic Claude (recommended): claude-sonnet-4-20250514 , claude-opus-4-20250514
-
OpenAI: gpt-4 , gpt-4-turbo
-
Azure OpenAI: via Azure configuration
-
Google Gemini: gemini-2.0-flash-exp
-
Groq: llama-3.3-70b-versatile
-
AWS Bedrock: Various models via Bedrock API
See references/llm_providers.md for detailed LLM configuration instructions.
- Task Execution Workflow
Biomni follows an autonomous agent workflow:
Step 1: Initialize agent
agent = A1(path='./data', llm='claude-sonnet-4-20250514')
Step 2: Execute task with natural language query
result = agent.go(""" Design a CRISPR screen to identify genes regulating autophagy in HEK293 cells. Prioritize genes based on essentiality and pathway relevance. """)
Step 3: Review generated code and analysis
Agent autonomously:
- Decomposes task into sub-steps
- Retrieves relevant biological knowledge
- Generates and executes analysis code
- Interprets results and provides insights
Step 4: Save results
agent.save_conversation_history("autophagy_screen_report.pdf")
- Common Task Patterns
CRISPR Screening Design
agent.go(""" Design a genome-wide CRISPR knockout screen for identifying genes affecting [phenotype] in [cell type]. Include:
- sgRNA library design
- Gene prioritization criteria
- Expected hit genes based on pathway analysis """)
Single-Cell RNA-seq Analysis
agent.go(""" Analyze this single-cell RNA-seq dataset:
- Perform quality control and filtering
- Identify cell populations via clustering
- Annotate cell types using marker genes
- Conduct differential expression between conditions File path: [path/to/data.h5ad] """)
Drug ADMET Prediction
agent.go(""" Predict ADMET properties for these drug candidates: [SMILES strings or compound IDs] Focus on:
- Absorption (Caco-2 permeability, HIA)
- Distribution (plasma protein binding, BBB penetration)
- Metabolism (CYP450 interaction)
- Excretion (clearance)
- Toxicity (hERG liability, hepatotoxicity) """)
GWAS Variant Interpretation
agent.go(""" Interpret GWAS results for [trait/disease]:
- Identify genome-wide significant variants
- Map variants to causal genes
- Perform pathway enrichment analysis
- Predict functional consequences Summary statistics file: [path/to/gwas_summary.txt] """)
See references/use_cases.md for comprehensive task examples across all biomedical domains.
- Data Integration
Biomni integrates ~11GB of biomedical knowledge sources:
-
Gene databases - Ensembl, NCBI Gene, UniProt
-
Protein structures - PDB, AlphaFold
-
Clinical datasets - ClinVar, OMIM, HPO
-
Literature indices - PubMed abstracts, biomedical ontologies
-
Pathway databases - KEGG, Reactome, GO
Data is automatically downloaded to the specified path on first use.
- MCP Server Integration
Extend biomni with external tools via Model Context Protocol:
MCP servers can provide:
- FDA drug databases
- Web search for literature
- Custom biomedical APIs
- Laboratory equipment interfaces
Configure MCP servers in .biomni/mcp_config.json
- Evaluation Framework
Benchmark agent performance on biomedical tasks:
from biomni.eval import BiomniEval1
evaluator = BiomniEval1()
Evaluate on specific task types
score = evaluator.evaluate( task_type='crispr_design', instance_id='test_001', answer=agent_output )
Access evaluation dataset
dataset = evaluator.load_dataset()
Best Practices
Task Formulation
-
Be specific - Include biological context, organism, cell type, conditions
-
Specify outputs - Clearly state desired analysis outputs and formats
-
Provide data paths - Include file paths for datasets to analyze
-
Set constraints - Mention time/computational limits if relevant
Security Considerations
⚠️ Important: Biomni executes LLM-generated code with full system privileges. For production use:
-
Run in isolated environments (Docker, VMs)
-
Avoid exposing sensitive credentials
-
Review generated code before execution in sensitive contexts
-
Use sandboxed execution environments when possible
Performance Optimization
-
Choose appropriate LLMs - Claude Sonnet 4 recommended for balance of speed/quality
-
Set reasonable timeouts - Adjust default_config.timeout_seconds for complex tasks
-
Monitor iterations - Track max_iterations to prevent runaway loops
-
Cache data - Reuse downloaded data lake across sessions
Result Documentation
Always save conversation history for reproducibility
agent.save_conversation_history("results/project_name_YYYYMMDD.pdf")
Include in reports:
- Original task description
- Generated analysis code
- Results and interpretations
- Data sources used
Resources
References
Detailed documentation available in the references/ directory:
-
api_reference.md
-
Complete API documentation for A1 class, configuration, and evaluation
-
llm_providers.md
-
LLM provider setup (Anthropic, OpenAI, Azure, Google, Groq, AWS)
-
use_cases.md
-
Comprehensive task examples for all biomedical domains
Scripts
Helper scripts in the scripts/ directory:
-
setup_environment.py
-
Interactive environment and API key configuration
-
generate_report.py
-
Enhanced PDF report generation with custom formatting
External Resources
-
Web Platform: https://biomni.stanford.edu
-
Paper: https://www.biorxiv.org/content/10.1101/2025.05.30.656746v1
-
Evaluation Dataset: https://huggingface.co/datasets/biomni/Eval1
Troubleshooting
Common Issues
Data download fails
Manually trigger data lake download
agent = A1(path='./data', llm='your-llm')
First .go() call will download data
API key errors
Verify environment variables
echo $ANTHROPIC_API_KEY
Or check .env file in working directory
Timeout on complex tasks
from biomni.config import default_config default_config.timeout_seconds = 3600 # 1 hour
Memory issues with large datasets
-
Use streaming for large files
-
Process data in chunks
-
Increase system memory allocation
Getting Help
For issues or questions:
-
GitHub Issues: https://github.com/snap-stanford/biomni/issues
-
Documentation: Check references/ files for detailed guidance
-
Community: Stanford SNAP lab and biomni contributors