Applications — From Discovery to Therapeutics¶
This directory (docs/applications/) contains domain-specific workflows and use cases for Agentic-SpliceAI. Each application demonstrates how the multi-layer pipeline translates computational predictions into biological and clinical insights. As applications mature, they graduate to docs/products/ for production deployment guides.
The Translational Pathway¶
Current State: Limited by Canonical Annotations¶
Drug discovery today is constrained by incomplete gene annotations:
Known Genes (20,000)
|
Canonical Isoforms (~20,000 in MANE)
|
Druggable Targets (~3,000)
|
Approved Drugs (~1,500 targets)
Bottleneck: 90% of splice sites are non-canonical. Current annotations capture only the most common isoforms, leaving the vast majority of biologically active splice variants undiscovered.
Expanding the Druggable Genome¶
Agentic-SpliceAI discovers context-specific isoforms to expand the therapeutic space. Consider BRCA1 as a concrete example:
| Scope | Splice Sites | Isoforms | Targeting Strategy |
|---|---|---|---|
| Canonical (MANE) | 44 | 1-2 main | Target "BRCA1" broadly |
| Full potential (Ensembl + Discovery) | 1,218+ (27x more) | 100+ (tissue/disease-specific) | Target tumor-specific isoforms selectively |
By moving beyond canonical annotations, we unlock 10-100x more therapeutic targets with the potential for reduced toxicity through isoform-selective drug design.
Application Domains¶
Note: CLI commands shown in the workflows below reflect planned Phase 7-9 functionality. Current capabilities cover base layer prediction, feature engineering, and meta layer training (Phases 1-6).
1. Oncology: Cancer-Specific Isoform Targets¶
Challenge: Tumors express aberrant splice isoforms that drive proliferation, evade apoptosis, and resist therapy. These cancer-specific isoforms are invisible to standard genomic pipelines.
Workflow:
# Predict splice sites for BRCA1 with cancer context
agentic-spliceai-predict --genes BRCA1 --base-model openspliceai
# Discover cancer-specific isoforms (Phase 9)
agentic-spliceai discover --genes BRCA1 \
--context cancer:breast \
--evidence-sources gtex,tcga,clinvar
# Validate with agentic layer (Phase 7)
agentic-spliceai validate --isoforms output/brca1_novel.parquet \
--agents literature,expression,clinical
Impact: Identify tumor-specific BRCA1 isoforms for isoform-selective inhibitors that spare normal tissue function.
2. Clinical Genetics: VUS Interpretation¶
Challenge: Thousands of variants of uncertain significance (VUS) in clinical sequencing may affect splicing but lack functional evidence for classification.
Workflow:
# Analyze splice impact of TP53 variants (Phase 8)
agentic-spliceai variant --vcf patient_variants.vcf \
--genes TP53 \
--clinvar-cross-reference
# Generate clinical report (Phase 7)
agentic-spliceai report --variants output/tp53_splice_impact.parquet \
--format clinical \
--evidence-level acmg
Impact: Reclassify VUS variants by quantifying their splice-disrupting potential with multi-source evidence, enabling actionable clinical decisions.
3. Neurology: Brain-Specific Isoforms¶
Challenge: The brain has the highest rate of alternative splicing of any tissue, with many neurological disorders linked to isoform dysregulation. Autism risk genes alone have hundreds of brain-specific splice variants.
Workflow:
# Discover brain-specific isoforms for autism risk genes (Phase 9)
agentic-spliceai discover \
--genes SHANK3 NRXN1 SYNGAP1 CHD8 \
--context tissue:brain \
--evidence-sources gtex,brainspan
# Cross-reference with known autism associations (Phase 7)
agentic-spliceai validate --isoforms output/asd_novel.parquet \
--agents literature,expression,conservation
Impact: Map the brain-specific isoform landscape of autism risk genes to identify therapeutic targets for splice-modulating interventions.
4. Drug Development: Isoform-Selective Therapeutics¶
Challenge: Most drugs target proteins without distinguishing between isoforms, leading to off-target effects when disease-relevant and normal isoforms are both inhibited.
Workflow:
# Identify druggable isoform-specific regions (Phase 9)
agentic-spliceai druggability --genes BCL2 MCL1 \
--isoform-catalog output/novel_isoforms.parquet \
--assess-selectivity
# Generate drug target report (Phase 7)
agentic-spliceai report --targets output/druggable_isoforms.parquet \
--format pharma \
--include-structure-prediction
Impact: Enable isoform-selective drug design that targets disease-specific variants while sparing normal protein function, reducing toxicity and improving therapeutic windows.
5. Biomarker Discovery: Liquid Biopsy¶
Challenge: Current liquid biopsy approaches rely on known mutations and methylation patterns. Disease-specific splice isoforms detectable in circulating RNA represent an untapped biomarker source.
Workflow:
# Discover tissue-specific isoform biomarkers (Phase 9)
agentic-spliceai discover --gene-list data/cancer_panel.txt \
--context cancer:lung \
--biomarker-mode \
--min-tissue-specificity 0.8
# Validate biomarker candidates (Phase 7)
agentic-spliceai validate --isoforms output/biomarker_candidates.parquet \
--agents expression,clinical \
--cross-tissue-comparison
Impact: Identify cancer-specific splice isoforms as novel liquid biopsy biomarkers for early detection, treatment monitoring, and companion diagnostics.
Last Updated: March 2026