Moneyball uses a Bayesian prediction framework β the same architecture as our NCAA tournament engine β adapted for MLB's unique statistical richness. A quantitative simulation engine produces the prior probability from 25+ sabermetric inputs, and Claude Sonnet applies qualitative adjustments for factors the math can't fully capture.
Baseball is the most quantifiable major sport. Decades of sabermetric research have produced metrics that genuinely predict performance better than traditional stats. Moneyball's edge is combining the best of these metrics into a unified simulation framework, then layering in real-time context: starting pitcher news, bullpen usage, lineup changes, weather, and sharp money movement.
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β 15+ DATA TOOLS β
β MLB Stats API Β· ESPN Β· The Odds API Β· Baseball Savant β
β FanGraphs scrapers Β· weather API Β· injury reports β
β Vegas lines Β· starting pitcher logs Β· bullpen availability β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 1: DATA GATHERING β
β Parallel fetches: team stats Β· starter ERA/FIP/xFIP β
β Β· bullpen ERA Β· lineup Β· park factors Β· weather β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 2: PYTHAGOREAN BASELINE β
β Run differential β expected win% (fundamental predictor) β
β RSΒ²/(RSΒ²+RAΒ²) = Pythagorean expectation β
β Adjusted for strength of schedule, park effects β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 3: PITCHING ADJUSTMENT β
β SP FIP/xFIP vs opposing lineup wOBA/wRC+ β
β Bullpen ERA + usage + days rest for each arm β
β Platoon splits, strikeout rates, walk rates β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 4: SITUATIONAL FACTORS β
β Ballpark run factor Β· wind speed/direction Β· temperature β
β Home field advantage Β· recent form (L10) Β· day/night split β
β Head-to-head matchup history Β· travel/rest advantage β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 5: 10,000-GAME MONTE CARLO β
β Each sim: compute per-inning run distributions β
β SP Γ bullpen Γ lineup Γ park Γ weather Γ variance β
β Output: win% Β± 1% (95% CI) β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β PHASE 6: QUALITATIVE LAYER (Claude Sonnet Β±5pp) β
β Undisclosed injuries Β· lineup decisions Β· narrative context β
β Sharp money signals Β· manager tendencies Β· weather details β
ββββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββββββ
β
βΌ
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β FINAL PREDICTION β
β Win% Β· Moneyline value Β· Run line edge Β· Total lean β
β Parlay eligibility Β· Confidence tier Β· Key factors β
βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ