Experiment 002: Paired Delta Prediction¶

Date: December 2025
Status: Completed
Outcome: Moderate correlation (r=0.38), insufficient for practical use

Hypothesis¶

A Siamese architecture that compares reference and alternate sequences can learn to predict splice site score changes (deltas) better than the base model.

Setup¶

Data¶

Input: (ref_seq, alt_seq) pairs from SpliceVarDB variants
Target: base_model(alt) - base_model(ref) (per-position deltas)
Context: 501 nucleotides centered on variant

Model Variations Tested¶

Model	Architecture	Loss
DeltaPredictorV2	Simple CNN	MSE
SimpleCNNDeltaPredictor	Gated CNN	MSE
+ Quantile	Gated CNN	Quantile (τ=0.9)
+ Scaled	Gated CNN	Scaled MSE
+ Temperature	Gated CNN	Temperature scaling

Results¶

Correlation with True Deltas¶

Variation	Pearson r	Notes
V2 Original	-0.04	No learning
V2 + 10x data	0.002	Data alone insufficient
Gated CNN	0.36	Architecture matters!
+ Quantile loss	0.38	Best for this approach
+ Scaling	0.22	Overfitting
+ Temperature	-0.03	No improvement
+ Multi-task	-0.07	Task interference

Best Model Performance¶

Metric	Value
Pearson r	0.38
SA Detection (>0.1)	22%
False Positive Rate	8%

Analysis¶

What Worked¶

Gated CNN architecture: Dilated convolutions capture long-range dependencies
Quantile loss (τ=0.9): Focuses on large deltas, reduces noise learning
LayerNorm + GELU: Better than BatchNorm + ReLU

What Limited Performance¶

Target quality: Base model deltas may be inaccurate for non-SA variants
Training on wrong labels: If base model predicts delta for normal variant, we learn noise
Correlation ceiling: r=0.38 suggests fundamental limitation

Key Insight¶

The target (base model delta) is the limiting factor.

If base model prediction for a variant is wrong, we're training on wrong labels. This limits how much we can improve over the base model.

Lessons Learned¶

Architecture matters: Gated CNN >> Simple CNN
Loss matters: Quantile loss >> MSE for sparse deltas
Target quality limits learning: Can't exceed base model accuracy with base model labels
Need validated targets: SpliceVarDB can help filter reliable training examples

Next Steps¶

This experiment motivated: 1. Binary Classification (Experiment 003) - Decomposed approach 2. Validated Delta Prediction (Experiment 004) - Ground truth filtering

Files¶

File	Description
`models/delta_predictor.py`	Siamese architecture
`models/delta_predictor_v2.py`	Per-position output
`models/hyenadna_delta_predictor.py`	SimpleCNNDeltaPredictor
`models/delta_predictor_calibrated.py`	Calibration wrappers