How Football Intelligence Predicts Matches
Last updated 2026‑05‑31·Version 14.9· hello@footballinteligence.com
Full disclosure of the statistical model behind every prediction on this platform — we publish this for YMYL transparency and so anyone can verify the math. The model is mathematically grounded; it is not magic. There are no guarantees, only probabilities. See the AI football analysis calculator in action →
Data sources & league coverage
Football Intelligence ingests historical and live match data from three primary providers:
| Source | Used for | Update frequency |
|---|---|---|
| apifootball.com | Live scores, events, standings, bookmaker odds snapshots | Real-time, every 30 s |
| football-data.org | Historical results, Poisson seed index, fixtures | Daily refresh |
| Odds aggregator | Live bookmaker odds across 40+ sportsbooks for value detection | Real-time on demand |
Leagues covered. Premier League, La Liga, Bundesliga, Serie A, Ligue 1, UEFA Champions League, Europa League, Conference League, Eredivisie, Primeira Liga, Ekstraklasa, Scottish Premiership, Belgian Pro League.
The Poisson distribution model
The Poisson distribution is a discrete probability distribution that expresses the probability of a given number of events (goals) occurring in a fixed interval, given a known average rate λ (lambda). Maher (1982) and Dixon & Coles (1997) showed that goals in football closely follow independent Poisson processes for each team.
The core formula:
By computing P(home_goals = i) and P(away_goals = j) for all i, j from 0–7, we generate a full 8×8 scoreline probability matrix. Summing the appropriate cells gives home win probability (Σ cells where i > j), draw probability (Σ diagonal), and away win probability (Σ cells where j > i).
Attack strength & defensive record
For each team playing at home, we compute two relative indices from their season-to-date results:
These indices measure performance relative to the league average. A value of 1.0 means average; above 1.0 means better than average in that role.
Lambda calculation for a domestic match:
League averages are maintained in a calibration table updated each season. Example: Premier League home 1.53, away 1.19; Bundesliga home 1.72, away 1.28.
Bayesian regularisation
Raw attack and defense indices are unreliable when a team has played few matches (e.g. early in the season). We apply Bayesian regularisation — shrinkage toward the prior mean of 1.0:
This ensures teams with fewer than 6 matches do not generate extreme predictions. The sample window caps at 14 games to keep the index recency-weighted.
Home advantage weighting
Home advantage is encoded in the separate home and away league averages used as multipliers. We do not apply a fixed additive home-advantage constant; the multiplicative structure naturally captures the league-specific home edge through the league_avg_home_goals parameter, calibrated from 5+ seasons of historical data per competition.
European competition adjustments
For Champions League and Europa League matches, teams come from different domestic leagues with different strength levels. We normalise using the UEFA country coefficient ranking (5-year method):
| League | Strength factor |
|---|---|
| Premier League | 1.10 |
| La Liga | 1.06 |
| Bundesliga | 1.03 |
| Serie A | 0.98 |
| Ligue 1 | 0.93 |
| Primeira Liga | 0.84 |
| Eredivisie | 0.81 |
Each team's domestic lambda is scaled by the ratio of their league's strength factor. European matches use neutral-ground averages (home 1.40, away 1.18) as the baseline, reflecting the reduced home advantage in two-legged European ties.
Scoreline probability matrix
Using λ_home and λ_away, we compute P(i, j) for all scorelines i ∈ [0,7] and j ∈ [0,7]. From the 8×8 matrix we derive:
- Home win probability: Σ P(i,j) where i > j
- Draw probability: Σ P(i,j) where i = j
- Away win probability: Σ P(i,j) where j > i
- BTTS (both teams to score): 1 − P(home=0) − P(away=0) + P(0,0)
- Over 2.5 goals: Σ P(i,j) where i + j ≥ 3
- Exact score probabilities: each P(i,j) cell directly
Fair decimal odds are computed as the inverse of the probability: fair_odds = 1 / probability. Unlike bookmaker odds, these contain no overround margin.
Value betting detection
A value bet exists when a bookmaker's offered decimal odds are higher than the fair odds derived by our model:
We fetch live odds from 40+ sportsbooks and display the highest available odds for each market. Only bets with a positive value edge are surfaced. An edge of 5% or more is generally considered meaningful for long-term profitability — smaller edges sit within model uncertainty.
Bookmaker odds are not fair odds. A typical 1X2 market carries an overround of 5–8%. The "fair" prices we publish strip that margin out using the matrix derived above. Comparing fair prices to fair prices is meaningless — the edge appears only against actual bookmaker quotes.
Kelly Criterion stake calculator
The Kelly Criterion (Kelly, 1956) is a mathematical formula for the theoretically optimal fraction of a bankroll to stake on a bet with known edge and odds:
We display both Full Kelly and Half-Kelly (50% of the calculated fraction). Use Half-Kelly or lower, because:
- No model has perfect probability estimation — model error pulls actual edge below the calculated value.
- Full Kelly leads to high variance and bankroll drawdowns even with a genuine edge.
- The Kelly formula assumes independent bets and a long-run perspective.
Maximum recommended stake. Never exceed 5% of your bankroll on a single bet, regardless of what the Kelly formula returns. Treat Half-Kelly as a ceiling, not a target.
Risk tiers — data-driven pick filtering
After a value pick survives the Kelly and edge gates, it is classified into one of three risk tiers using historical-ROI bands derived from a 380-pick analysis. Picks that historically lost money in their (market × odds) band are rejected by the algorithm and never reach users — even if the raw value calculation is positive.
The tiers below reflect only the (market × odds) bands that survive our historical-ROI filter, not every theoretically positive-EV calculation. The percentages below are hit-rate (how often a pick wins), not profit — a high-hit-rate, low-odds tier can still be net-negative on ROI. See our published accuracy & losses for the honest, settled numbers.
Low Highest historical win rate (~63%)
- Over 2.5 goals at odds 1.65–1.85, value ≥ 2.5%, Kelly ≥ 1.0%
- BTTS at odds 1.50–1.85, value ≥ 2.5%
- Win at odds 1.65–1.80, value ≥ 4%, Kelly ≥ 1.5%
Medium Solid edge with moderate variance (~50% WR)
- Over 2.5 at odds 1.85–2.10, value ≥ 2%
- BTTS at odds 1.85–2.10, value ≥ 2%
- Win at odds 1.80–2.10, value ≥ 3%, Kelly ≥ 1%
High Long-odds edge, higher variance (~30% WR, high payout)
- Win at odds 3.00–3.50, value ≥ 5%, Kelly ≥ 1.5%
- Draw at odds 3.00–3.50, value ≥ 5%
Rejected Never published
- Over 3.5 goals (any odds — too volatile)
- Win at odds < 1.65 (heavy-favorite trap)
- Win at odds 2.10–3.00 (historical −71% ROI)
- Win at odds > 3.50 (longshot variance)
- Draw at odds outside 3.00–3.50
- Any pick with raw value > 25% (model error indicator)
- Under 1.5 / 2.5 / 3.5 and Over 1.5 markets — not in our model
Each tier is calibrated against historical settled picks. Bands are reviewed quarterly and adjusted when sample size and observed ROI justify it. Public dashboard cards "Low + Medium" show the recommended-tier ROI and exclude the speculative High tier from the headline number.
Claude Sonnet AI layer — SportGPT
SportGPT is a conversational AI assistant powered by Claude Sonnet 4.6 (Anthropic). It receives the Poisson model output for a specific match as context and answers natural-language questions about probability breakdowns, form analysis, staking recommendations, and market interpretation.
SportGPT does not independently calculate probabilities — it interprets and explains the outputs of the Poisson model. The AI layer adds natural-language reasoning, contextual knowledge (injuries, team news, head-to-head from training data), and personalised staking advice based on the user's bankroll and risk profile.
Model: Anthropic claude-sonnet-4-6·Context window: up to 20 messages of history (Elite plan)·Temperature: 0.7 (balanced reasoning).
Model limitations & accuracy
We are direct about what the model cannot do:
- Football is high-variance. Even an accurate statistical model cannot reliably predict an individual match. A team with a 70% win probability still loses 30% of the time.
- In-match events are not included. The model does not account for red cards, tactical substitutions, or weather in real time.
- Injury data is partial. The model uses publicly available data. Undisclosed injuries or late team-news changes are not reflected until the index is rebuilt.
- Market efficiency is high. Modern bookmaker odds are very efficient. Genuine value edges of 5%+ are rare and require large samples to realise.
- No profit guarantee. Past model performance does not predict future results. Value betting requires discipline over hundreds of bets to see statistical advantage materialise.
- Small-sample leagues. For leagues with fewer historical seasons in our dataset, predictions carry higher uncertainty and lambda values are more regularised toward league averages.
Football Intelligence displays model accuracy metrics and a settled-pick history so users can evaluate actual performance over time. We do not cherry-pick results or hide losses.
Legal disclaimer
Football Intelligence is an educational and informational tool. Nothing on this platform constitutes financial advice, investment advice, or a recommendation to place any bet. All statistical outputs are for analytical purposes and carry inherent uncertainty.
Sports betting is regulated differently across jurisdictions. It is your responsibility to ensure that sports betting is legal in your country before using this information. This service is strictly for users aged 18 and over.
Sports betting involves significant financial risk. You may lose money. Never bet more than you can afford to lose. If gambling is causing you harm:
- UK: BeGambleAware.org · 0808 802 0133
- UK: GamCare.org.uk · 0808 802 0133
- PL: Centrum Wsparcia — 801 889 880
- International: GamblingTherapy.org
Questions about the methodology? Email hello@footballinteligence.com.