Skip to content

Agentic-SpliceAI

Overview

pleiadian53/agentic-spliceai

Foundation Models — Documentation¶

Topic-specific documentation for the foundation_models sub-package.

`evo2/` — Evo2 Model Topics¶

Document	Description
junction_support_labels.md	Why and how to use RNA-seq junction support counts as confidence-weighted per-nucleotide exon labels; connection to adaptive splice site prediction

`training/` — Training Infrastructure¶

Document	Description
deepspeed_training.md	DeepSpeed ZeRO configurations for LoRA fine-tuning on RunPods (ZeRO-2, ZeRO-3 + CPU offload); `AdaptiveTrainer` wrapper that auto-selects between DeepSpeed and standard PyTorch
remote_training_options.md	Modal vs SkyPilot vs RunPods — programmatic remote training comparison (canonical: `docs/compute/`)

Quick orientation¶

foundation_models/
├── foundation_models/          # Python package
│   ├── evo2/                   # Evo2 model wrappers
│   │   ├── config.py           # Evo2Config (M1 Mac + GPU presets)
│   │   ├── model.py            # Evo2Model (HuggingFace AutoModel wrapper)
│   │   ├── embedder.py         # Evo2Embedder (chunking + HDF5 cache)
│   │   └── classifier.py      # ExonClassifier (Linear / MLP / CNN / LSTM)
│   └── utils/
│       ├── quantization.py     # Device detection, INT8/INT4, memory helpers
│       └── chunking.py         # Sequence chunking, embedding stitching,
│                               # exon label derivation, window generation
├── configs/                    # DeepSpeed JSON configs (to be created)
│   ├── deepspeed_zero2.json
│   └── deepspeed_zero3_offload.json
├── docs/                       # This directory
│   ├── evo2/
│   └── training/
└── examples/
    └── 01_load_evo2_local.py