Skip to content

Foundation Models — Documentation

Topic-specific documentation for the foundation_models sub-package.


evo2/ — Evo2 Model Topics

Document Description
junction_support_labels.md Why and how to use RNA-seq junction support counts as confidence-weighted per-nucleotide exon labels; connection to adaptive splice site prediction

training/ — Training Infrastructure

Document Description
deepspeed_training.md DeepSpeed ZeRO configurations for LoRA fine-tuning on RunPods (ZeRO-2, ZeRO-3 + CPU offload); AdaptiveTrainer wrapper that auto-selects between DeepSpeed and standard PyTorch
remote_training_options.md Modal vs SkyPilot vs RunPods — programmatic remote training comparison (canonical: docs/compute/)

Quick orientation

foundation_models/
├── foundation_models/          # Python package
│   ├── evo2/                   # Evo2 model wrappers
│   │   ├── config.py           # Evo2Config (M1 Mac + GPU presets)
│   │   ├── model.py            # Evo2Model (HuggingFace AutoModel wrapper)
│   │   ├── embedder.py         # Evo2Embedder (chunking + HDF5 cache)
│   │   └── classifier.py      # ExonClassifier (Linear / MLP / CNN / LSTM)
│   └── utils/
│       ├── quantization.py     # Device detection, INT8/INT4, memory helpers
│       └── chunking.py         # Sequence chunking, embedding stitching,
│                               # exon label derivation, window generation
├── configs/                    # DeepSpeed JSON configs (to be created)
│   ├── deepspeed_zero2.json
│   └── deepspeed_zero3_offload.json
├── docs/                       # This directory
│   ├── evo2/
│   └── training/
└── examples/
    └── 01_load_evo2_local.py