Data from each single match ... <tackle,15.4,41.1,112> - - PowerPoint PPT Presentation

data from each single match
SMART_READER_LITE
LIVE PREVIEW

Data from each single match ... <tackle,15.4,41.1,112> - - PowerPoint PPT Presentation

Data from each single match ... <tackle,15.4,41.1,112> <pass,25.0,67.1,113> <pass,65.0,87.1,115> <assist,82.1,35.8,120> <goal attempt,82.1,35.8,121> THE PASSES NETWORK AMONG PLAYERS de degree ee = =


slide-1
SLIDE 1
slide-2
SLIDE 2
slide-3
SLIDE 3

... <tackle,15.4,41.1,112> <pass,25.0,67.1,113> <pass,65.0,87.1,115> <assist,82.1,35.8,120> <goal attempt,82.1,35.8,121> ……

Data from each single match

slide-4
SLIDE 4

THE PASSES NETWORK AMONG PLAYERS

de degree ee = = number number of

  • f

neighbor neighbors

2 3 2 4 2 4 2 1 3 4

Variance of degree: 1.16

slide-5
SLIDE 5

FOOTBALL AS A NETWORK

Juventus passes network from last champions league game Opponents Goal

slide-6
SLIDE 6

FOOTBALL AS A NETWORK

Opponents Goal Barcelona passes network from last champions league game

slide-7
SLIDE 7

NETWORK ANALYSIS FOR PERFOMANCE EVALUATION

  • Networks characteristics are a proxy for performance

evaluation and prediction

  • We use only passing networks to outperform the results
  • f standard predictors

Measures involved in our model: we combine different passing indexes into one single indicator (H)

slide-8
SLIDE 8

EVALUATING THE EVALUATOR

Using the average passes per match, the correlation with goals is 0.77… …while the H indicator has a correlation with goals equal to 0.82

slide-9
SLIDE 9

H INDICATOR IN EUROPEAN LEAGUES

slide-10
SLIDE 10

ASSESSING TEAM PERFORMANCES

For each game we consider the H indicator of both teams and we cluster this points according to the real outcome. Centroids of such clusters are confirming the goodness of our approach.

slide-11
SLIDE 11

FOOTBALL GAMES PREDICTION

  • We train several prediction model with a dataset composed by H indicator of

teams and we try to predict games outcome

  • We used the best result from three dummy classifiers (random, class

distribution, most frequent label) as baseline

  • We have cross-validated the results of each classifier

Results of our predictions for the main football leagues

slide-12
SLIDE 12

Luca Pappalardo @sif iffolone

  • lone

Paolo Cintia

@mes mesos

  • sbr

brodlet

  • dleto

Salvatore Rinzivillo @rinz inziv iv

THANKS!

Follow us on Twitter: @bigdatatales

slide-13
SLIDE 13
slide-14
SLIDE 14

“E’ la dura legge del gol fai un gran bel gioco però se non hai difesa gli altri segnano… …e poi vincono.”

Max Pezzali, 1998

slide-15
SLIDE 15

Pezzaliscore(A) = gA

A

tA

A

* tB

B

gB

B

The harsh (mathematic) law

  • f the goals

A,B= team A, team B g: goals t: attempts

Avg Inter: 0.4 Avg Juventus: 1.5

slide-16
SLIDE 16