Who Will Win It?
An In-game Win Probability Model for Football
Pieter Robberechts, Jan Van Haaren and Jesse Davis
Who Will Win It? An In-game Win Probability Model for Football - - PowerPoint PPT Presentation
Who Will Win It? An In-game Win Probability Model for Football Pieter Robberechts , Jan Van Haaren and Jesse Davis " Belgium stuns Japan with exceptional comeback NyTimes How exceptional was this comeback? We need an in-game win
An In-game Win Probability Model for Football
Pieter Robberechts, Jan Van Haaren and Jesse Davis
NyTimes
We need an in-game win probability model!
Japan Belgium Time remaining 38 min (+ stoppage time) Score 2 Team strength 1699 Elo points 2009 Elo points Red cards Shots on goal 2 3 Possession 42% 58% ....
A win-probability model provides the likelihood that a particular team will win/draw/lose a game based upon a specific game state
⚾ Baseball 🏉 Football 🏁 Basketball
A number of relevant use cases.
Why? Football is a lot harder to model!
You do not know how much time is left
T = 93? 95?
Capture the game state in each frame T = 100
You do not know how much time is left
Find a minimal set of features that describe the current state and have the most impact on the game outcome
cards received
Event-stream data
that the home team will win the game
that the game will end in a tie that the away team will win the game the game state at time t Given: Do: Estimate probabilities
Logistic regression classifier
All game states in test set where the model predicts a win probability of 60% About 60% of these games should actually end in a win Well calibrated =
Logistic regression classifier
Poorly calibrated win / loss probabilities Struggle to predict ties Tie Win Loss
Struggle to predict ties
Multiple logistic regression classifiers per time frame
Tie Win Loss
Breaks down in late game situations
Random forest classifier
Tie Win Loss
Find a minimal set of features that describe the current state and have the most impact on the game outcome
You do not know how much time is left
Football games are often very close, with a margin ≤ 1 goal
Goals often shift the tone of a game
Expected number of goals scored after time t
Time remaining Scoring intensities estimated from the game state
Reds Duel Strength Time
Idea 2: Modelling the number of goals scored between now and the end of the game
Scoring intensities estimated from the game state
αt ∼ 𝒪(αt−1, σ2)
θt, home = invlogit(αt xt, home + β + Ha) θt, away = invlogit(αt xt, away + β)
regression coefficients change over time
Idea 2: Modelling the number of goals scored between now and the end of the game
Proper probability calibration! 😄 Tie Win Loss
Proposed
Better
How accurately can the model predict the final match outcome?
A great "story stat"
An Added Goal Value metric
Hypothesis
Scoring / conceding a goal has a large impact on win probability High mental pressure Example: Identifying "clutch" goal scorers
AGVp90i = ∑Ki
k=1 3 * ΔP(win|xtk) + ΔP(tie|xtk)
Mi * 90
The total added value that occurred from each of player i’s goals, averaged over the number of games played
An Added Goal Value metric
In-game win probability models for football ...