Spatial Multi-Omics Analysis Pipeline
Comprehensive biological interpretation of spatial omics data. Transforms spatially variable genes (SVGs), domain annotations, and tissue context into actionable biological insights.
KEY PRINCIPLES:
- Report-first approach - Create report file FIRST, then populate progressively
- Domain-by-domain analysis - Characterize each spatial region independently before comparison
- Gene-list-centric - Analyze user-provided SVGs and marker genes with ToolUniverse databases
- Biological interpretation - Go beyond statistics to explain biological meaning of spatial patterns
- Disease focus - Emphasize disease mechanisms and therapeutic opportunities when disease context is provided
- Evidence grading - Grade all evidence as T1 (human/clinical) to T4 (computational)
- Multi-modal thinking - Integrate RNA, protein, and metabolite information when available
- Validation guidance - Suggest experimental validation approaches for key findings
- Source references - Every statement must cite tool/database source
- English-first queries - Always use English terms in tool calls
When to Use This Skill
Apply when users:
- Provide spatially variable genes from spatial transcriptomics experiments
- Ask about biological interpretation of spatial domains/clusters
- Need pathway enrichment of spatial gene expression data
- Want to understand cell-cell interactions from spatial data
- Ask about tumor microenvironment heterogeneity from spatial omics
- Need druggable targets in specific spatial regions
- Ask about tissue zonation patterns (liver, brain, kidney)
- Want to integrate spatial transcriptomics + proteomics data
NOT for: Single gene interpretation (use target-research), variant interpretation, drug safety, bulk RNA-seq, GWAS analysis.
Input Parameters
| Parameter | Required | Description | Example |
|---|---|---|---|
| svgs | Yes | Spatially variable genes | ['EGFR', 'CDH1', 'VIM', 'MYC', 'CD3E'] |
| tissue_type | Yes | Tissue/organ type | brain, liver, lung, breast |
| technology | No | Spatial omics platform | 10x Visium, MERFISH, DBiTplus |
| disease_context | No | Disease if applicable | breast cancer, Alzheimer disease |
| spatial_domains | No | Domain -> marker genes dict | {'Tumor core': ['MYC','EGFR']} |
| cell_types | No | Cell types from deconvolution | ['Epithelial', 'T cell'] |
| proteins | No | Proteins detected (multi-modal) | ['CD3', 'PD-L1', 'Ki67'] |
| metabolites | No | Metabolites (SpatialMETA) | ['glutamine', 'lactate'] |
Spatial Omics Integration Score (0-100)
Data Completeness (0-30): SVGs (5), Disease context (5), Spatial domains (5), Cell types (5), Multi-modal (5), Literature (5)
Biological Insight (0-40): Pathway enrichment FDR<0.05 (10), Cell-cell interactions (10), Disease mechanism (10), Druggable targets (10)
Evidence Quality (0-30): Cross-database validation 3+ DBs (10), Clinical validation (10), Literature support (10)
| Score | Tier | Interpretation |
|---|---|---|
| 80-100 | Excellent | Comprehensive characterization, strong insights, druggable targets |
| 60-79 | Good | Good pathway/interaction analysis, some therapeutic context |
| 40-59 | Moderate | Basic enrichment, limited domain comparison |
| 0-39 | Limited | Minimal data, gene-level annotation only |
Evidence Grading
| Tier | Criteria | Examples |
|---|---|---|
| [T1] | Direct human/clinical evidence | FDA-approved drug, validated biomarker |
| [T2] | Experimental evidence | Validated spatial pattern, known L-R pair |
| [T3] | Computational/database evidence | PPI prediction, pathway enrichment |
| [T4] | Annotation/prediction only | GO annotation, text-mined association |
Analysis Phases Overview
Phase 0: Input Processing & Disambiguation (ALWAYS FIRST)
Resolve tissue/disease identifiers, establish analysis context. Get MONDO/EFO IDs for disease queries.
- Tools:
OpenTargets_get_disease_id_description_by_name,OpenTargets_get_disease_description_by_efoId,HPA_search_genes_by_query
Phase 1: Gene Characterization
Resolve gene IDs, annotate functions, tissue specificity, subcellular localization.
- Tools:
MyGene_query_genes,UniProt_get_function_by_accession,HPA_get_subcellular_location,HPA_get_rna_expression_by_source,HPA_get_comprehensive_gene_details_by_ensembl_id,HPA_get_cancer_prognostics_by_gene,UniProtIDMap_gene_to_uniprot
Phase 2: Pathway & Functional Enrichment
Identify enriched pathways globally and per-domain. Filter FDR < 0.05.
- Tools:
STRING_functional_enrichment(PRIMARY),ReactomeAnalysis_pathway_enrichment,GO_get_annotations_for_gene,kegg_search_pathway,WikiPathways_search
Phase 3: Spatial Domain Characterization
Characterize each domain biologically, assign cell types from markers, compare domains.
- Tools: Phase 2 tools +
HPA_get_biological_processes_by_gene,HPA_get_protein_interactions_by_gene
Phase 4: Cell-Cell Interaction Inference
Predict communication from spatial patterns. Check ligand-receptor pairs across domains.
- Tools:
STRING_get_interaction_partners,STRING_get_protein_interactions,intact_search_interactions,Reactome_get_interactor,DGIdb_get_drug_gene_interactions
Phase 5: Disease & Therapeutic Context
Connect to disease mechanisms, identify druggable targets, find clinical trials.
- Tools:
OpenTargets_get_associated_targets_by_disease_efoId,OpenTargets_get_target_tractability_by_ensemblID,OpenTargets_get_associated_drugs_by_target_ensemblID,clinical_trials_search,DGIdb_get_gene_druggability,civic_search_genes
Phase 6: Multi-Modal Integration
Integrate protein/RNA/metabolite data. Compare spatial RNA with protein detection.
- Tools:
HPA_get_subcellular_location,HPA_get_rna_expression_in_specific_tissues,Reactome_map_uniprot_to_pathways,kegg_get_pathway_info
Phase 7: Immune Microenvironment (Cancer/Inflammation only)
Classify immune cells, check checkpoint expression, assess Hot vs Cold vs Excluded patterns.
- Tools:
STRING_functional_enrichment,OpenTargets_get_target_tractability_by_ensemblID,iedb_search_epitopes
Phase 8: Literature & Validation Context
Search published evidence, suggest validation experiments (smFISH, IHC, PLA).
- Tools:
PubMed_search_articles,openalex_literature_search
See phase-procedures.md for detailed workflows, decision logic, and tool parameter specifications per phase.
Report Structure
Create file: {tissue}_{disease}_spatial_omics_report.md
# Spatial Multi-Omics Analysis Report: {Tissue Type}
**Report Generated**: {date} | **Technology**: {platform}
**Tissue**: {tissue_type} | **Disease**: {disease or "Normal tissue"}
**Total SVGs**: {count} | **Spatial Domains**: {count}
**Spatial Omics Integration Score**: (calculated after analysis)
## Executive Summary
## 1. Tissue & Disease Context
## 2. Spatially Variable Gene Characterization
- 2.1 Gene ID Resolution
- 2.2 Tissue Expression Patterns
- 2.3 Subcellular Localization
- 2.4 Disease Associations
## 3. Pathway Enrichment Analysis
- 3.1 STRING, 3.2 Reactome, 3.3-3.5 GO (BP, MF, CC)
## 4. Spatial Domain Characterization (per-domain + comparison)
## 5. Cell-Cell Interaction Inference
- 5.1 PPI, 5.2 Ligand-Receptor, 5.3 Signaling Pathways
## 6. Disease & Therapeutic Context
- 6.1 Disease Gene Overlap, 6.2 Druggable Targets, 6.3 Drug Mechanisms, 6.4 Trials
## 7. Multi-Modal Integration (if data available)
## 8. Immune Microenvironment (if relevant)
## 9. Literature & Validation Context
## Spatial Omics Integration Score (breakdown table)
## Completeness Checklist
## References (tools used, database versions)
See report-template.md for full template with table structures.
Completeness Checklist
- Gene ID resolution complete
- Tissue expression patterns analyzed (HPA)
- Subcellular localization checked (HPA)
- Pathway enrichment complete (STRING + Reactome)
- GO enrichment complete (BP + MF + CC)
- Spatial domains characterized individually
- Domain comparison performed
- PPI analyzed (STRING)
- Ligand-receptor pairs identified
- Disease associations checked (OpenTargets)
- Druggable targets identified
- Multi-modal integration performed (if data available)
- Immune microenvironment characterized (if relevant)
- Literature search completed
- Validation recommendations provided
- Integration Score calculated
- Executive summary written
- All sections have source citations
Common Use Cases
- Cancer Spatial Heterogeneity: Visium with tumor/stroma/immune domains -> pathways, immune infiltration, druggable targets, checkpoints
- Brain Tissue Zonation: MERFISH with neuronal subtypes -> synaptic signaling, receptors, hippocampal zonation
- Liver Metabolic Zonation: Periportal vs pericentral -> CYP450, Wnt gradient, drug metabolism enzymes
- Tumor-Immune Interface: DBiTplus RNA+protein -> checkpoint L-R pairs, immune exclusion, multi-modal concordance
- Developmental Patterns: Morphogen gradients (Wnt, BMP, FGF, SHH), TF patterns, cell fate genes
- Disease Progression: Disease gradient -> inflammatory response, neuronal loss, therapeutic windows
Reference Files
- phase-procedures.md - Detailed phase workflows, decision logic, tool usage per phase
- tool-reference.md - Tool parameter names, response formats, fallback strategies, limitations
- reference-data.md - Cell type markers, ligand-receptor pairs, immune checkpoint reference
- report-template.md - Full report template with all table structures
- test_spatial_omics.py - Test suite
Summary
Spatial Multi-Omics Analysis provides:
- Gene characterization (ID resolution, function, localization, tissue expression)
- Pathway & functional enrichment (STRING, Reactome, GO, KEGG)
- Spatial domain characterization (per-domain and cross-domain)
- Cell-cell interaction inference (PPI, ligand-receptor, signaling)
- Disease & therapeutic context (disease genes, druggable targets, trials)
- Multi-modal integration (RNA-protein concordance, metabolic pathways)
- Immune microenvironment (cell types, checkpoints, immunotherapy)
- Literature context & validation recommendations
Outputs: Markdown report with Spatial Omics Integration Score (0-100) Uses: 70+ ToolUniverse tools across 9 analysis phases Time: ~10-20 minutes depending on gene list size