CSIR Elections Forecasting 2016 Local Government Elections Zaid - - PowerPoint PPT Presentation

csir elections forecasting
SMART_READER_LITE
LIVE PREVIEW

CSIR Elections Forecasting 2016 Local Government Elections Zaid - - PowerPoint PPT Presentation

CSIR Elections Forecasting 2016 Local Government Elections Zaid Kimmie 28 October 2016 Overview 1. Team Members 2. Some History 3. Why Forecast? 4. Methods: Clustering 5. Methods: Predictions 6. Model Performance 7. What Next? 1


slide-1
SLIDE 1

CSIR Elections Forecasting

2016 Local Government Elections Zaid Kimmie 28 October 2016

slide-2
SLIDE 2

Overview

1. Team Members 2. Some History 3. Why Forecast? 4. Methods: Clustering 5. Methods: Predictions 6. Model Performance 7. What Next?

1

slide-3
SLIDE 3

The CSIR Team

Statisticians, computer scientists, and programmers . . .

  • Peter Schmitz, Jenny Holloway, Nontembeko Dudeni-Thlone
  • Brenwen Ntlangu, Tyrone Naidoo
  • Zaid Kimmie, Ndumiso Cingo and Luyanda Vappie
  • Paul Mokilane, Quintin van Heerden, Sumarie Meintjes
  • Hans Ittmann, Jan Greben, Renee Koen

2

slide-4
SLIDE 4

Some History

  • Worked with IEC for 1999 national 2000 municipal elec-

tions – Checking inconsistencies in voting patterns – Forecasts a “by-product” of this methodology

  • Worked with SABC for all elections since 2004

– Produce a forecast of the final results

3

slide-5
SLIDE 5

Why Forecast?

  • Election results are released by Voting District (VD)

– Some 22,500 VDs in total

  • If the VDs reported in a random order there would be noth-

ing much to do

– The final result would become clear relatively quickly – E.g. 5% of VDs have reported the tabulated results are within a couple of percentage points of their final values

4

slide-6
SLIDE 6

Why Forecast?

  • Fortunately (for us) VDs do not report randomly

– There is in fact a systematic bias in the reporting order – The difference, in the early stages, between the “live” and final results are often substantial – These differences may persist . . .

  • People (including political analysts and the curious member
  • f the public) looking at the “scoreboard” will find that it gives

them very little useful information

5

slide-7
SLIDE 7

4am 9am 2pm 7pm 12am 10am 3pm 8pm 1am 6am 11am 5pm

38 40 42 44 46 48

Time % Vote

ANC – Johannesburg Metro

Final ANC%

slide-8
SLIDE 8

10% 20% 30% 40% 50% 60% 70% 80% 90%

38 40 42 44 46 48

% VDs % Vote

ANC – Johannesburg Metro

Final ANC%

slide-9
SLIDE 9

Why Forecast?

  • In this “window of opportunity” election forecasts can pro-

vide useful insights

– What do the the initial results really mean? – Identify interesting patterns that are emerging

  • The combination of pre-election polling data and exit polls

can get it wrong . . .

– Brexit, UK 2015 general election

8

slide-10
SLIDE 10

Why Forecast?

  • In this “window of opportunity” election forecasts can pro-

vide useful insights

– What do the the initial results really mean? – Identify interesting patterns that are emerging

  • The combination of pre-election polling data and exit polls

can get it wrong . . .

– Brexit, UK 2015 general election

  • It can make you look smarter than you actually are . . .

9

slide-11
SLIDE 11

Why Forecast?

10

slide-12
SLIDE 12

Forecasting Model: Basics

  • Method published by Greben, Elphinstone & Holloway, 2006

in ORiON: The Journal of ORSSA

11

slide-13
SLIDE 13

Forecasting Model: Basics

  • Method published by Greben, Elphinstone & Holloway, 2006

in ORiON: The Journal of ORSSA There are a couple of basic principles:

1. Voters do not randomly allocate their electoral preferences – they are influenced by political, socio-economic and demographic fac- tors, as well as past voting history; 2. Changes in voting behaviour between one election and the next are also not random, but are correlated with past voting behaviour, demographic and socio-economic factors.

12

slide-14
SLIDE 14

Example

Suppose our area of interest consists of 200 VDs, and that in the previous election party A has obtained 70% of the vote in the area, with relatively small variation between VDs

  • When the first VD reports . . .
  • When 10 VDs have reported . . .
  • When 30 VDs have reported . . .

13

slide-15
SLIDE 15

Methods: Clustering

The first step is to create clusters of VDs based on previous voting results.

  • Fuzzy-c-means
  • Fixed number of clusters, c
  • Fuzzy clustering performs better than other methods (k-means,

k-means with discriminant analysis) – smallest prediction error

  • How many clusters?

14

slide-16
SLIDE 16

Methods: Predictions

Two-step process:

  • Estimate turnout for outstanding VDs
  • Assign fuzzy-cluster estimates to VDs

15

slide-17
SLIDE 17

Methods: 2016 Predictions

  • Metro predictions based on provincial clusters
  • This method allowed us to (accurately) predict eThekwini

when no results had been released

  • But this setup can let us down when inter-provincial varia-

tions do occur, as was the case with Tshwane

16

slide-18
SLIDE 18

Model Performance

There are two aspects of model performance – the technical performance of the model and our ability to communicate the model output to the general public

17

slide-19
SLIDE 19

Model Performance

  • Assuming that IT snafus have not rendered us mute . . .
  • Early on the Thursday morning after election day – some-

where between 5am and 9am, when only about 10% of all VDs have reported – we forecast the final results

  • We continually update our forecasts, but the numbers do

not change all that much, and the level of interest in the forecast declines as the “scoreboard” starts to match the final score

18

slide-20
SLIDE 20

How did we do?

  • Pretty well!
  • By 5am on Thursday we identified the major trends well

before they could be inferred just by looking at the data

– That the DA would be the largest party in NMB, but not achieve a majority – That the ANC would lose its majorities in all the Gauteng metros – That the DA would increase its majority in Cape Town – That the ANC would continue to hold a majority in Buffalo City, Mangaung and eThekwini

19

slide-21
SLIDE 21

How did we do?

  • We were able to predict that the ANC’s share of the na-

tional vote would fall to 54%

  • We did not get the final result in Tshwane right – our model

predicted (and continued to predict until quite late into the reporting) that the ANC would be the largest party

  • In general we were able to get within 1.5 percentage points
  • f the final result for the larger parties, and in most cases

within 0.5 percentage points

20

slide-22
SLIDE 22

Forecasts

Metro Party Predicted 5am Final Actual 5am Johannesburg ANC 44.5 44.9 39.5 DA 38.9 38.4 45.3 EFF 10.7 10.9 9.8 Tshwane DA 41.5 43.1 47.0 ANC 42.8 41.5 41.0 EFF 10.7 11.6 7.8 Ekurhuleni ANC 47.8 48.9 38.8 DA 35.8 34.2 50.0 EFF 10.7 11.1 7.8 21

slide-23
SLIDE 23

Forecasts

Metro Party Predicted 5am Final Actual 5am Cape Town DA 65.7 66.8 72.5 ANC 25.1 24.5 18.8 EFF 2.9 3.1 3.1 Nelson Mandela DA 48.3 46.6 58.6 ANC 42.2 41.5 32.9 EFF 3.9 5.0 3.8 eThekwini ANC 58.8 60 DA 27.7 27.5 IFP 4.0 4.3 22

slide-24
SLIDE 24

Johannesburg

4am 9am 2pm 7pm 12am 10am 3pm 8pm 1am 6am 11am 5pm

38 40 42 44 46 48

Time % Vote

ANC – Johannesburg Metro

Prediction Thursday 5am

23

slide-25
SLIDE 25

Johannesburg

4am 9am 2pm 7pm 12am 10am 3pm 8pm 1am 6am 11am 5pm

36 38 40 42 44 46 48

Time % Vote

DA – Johannesburg Metro

Prediction Thursday 5am

24

slide-26
SLIDE 26

Ekurhuleni

3am 8am 1pm 6pm 11pm 8am 1pm 6pm 11pm 4am

30 34 38 42 46 50

Time % Vote

ANC – Ekurhuleni Metro

Prediction Thursday 5am

25

slide-27
SLIDE 27

Ekurhuleni

  • The difference between the predicted and actual ANC vote

count in Ekurhuleni was less than 9,000 votes – just under 900,000 people voted in this metro.

26

slide-28
SLIDE 28

Ekurhuleni

3am 8am 1pm 6pm 11pm 8am 1pm 6pm 11pm 4am

30 34 38 42 46 50

Time % Vote

DA – Ekurhuleni Metro

Prediction Thursday 5am

27

slide-29
SLIDE 29

Ekurhuleni

3am 8am 1pm 6pm 11pm 8am 1pm 6pm 11pm 4am

4 8 12 16 20

Time % Vote

EFF – Ekurhuleni Metro

Prediction Thursday 5am

28

slide-30
SLIDE 30

Nelson Mandela Bay

12am 3am 6am 9am 12pm 3pm 6pm 9pm 12am

40 45 50 55 60 65

Time % Vote

DA – Nelson Mandela Bay

Prediction Thursday 5am

29

slide-31
SLIDE 31

Nelson Mandela Bay

12am 3am 6am 9am 12pm 3pm 6pm 9pm 12am

30 35 40 45 50

Time % Vote

ANC – Nelson Mandela Bay

Prediction Thursday 5am

30

slide-32
SLIDE 32

Nelson Mandela Bay

12am 3am 6am 9am 12pm 3pm 6pm 9pm 12am

4 8 12 16

Time % Vote

EFF – Nelson Mandela Bay

Prediction Thursday 5am

31

slide-33
SLIDE 33

Cape Town

12am 4am 8am 12pm 4pm 8pm 1am

55 60 65 70 75

Time % Vote

DA – Cape Town

Prediction Thursday 5am

32

slide-34
SLIDE 34

Ethekwini

3pm 6pm 9pm 9am 12pm 3pm 6pm

50 55 60 65 70

Time % Vote

ANC – eThekwini

Prediction Thursday 5am

33

slide-35
SLIDE 35

Tshwane

12am 4am 8am 12pm 4pm 8pm 12am 10am 2pm 6pm 10pm 2am 6am 11am

35 40 45 50

Time % Vote

ANC – Tshwane

Prediction Thursday 5am Prediction Friday 8am

34

slide-36
SLIDE 36

Tshwane

12am 4am 8am 12pm 4pm 8pm 12am 10am 2pm 6pm 10pm 2am 6am 11am

35 40 45 50

Time % Vote

DA – Tshwane

Prediction Thursday 5am Prediction Friday 8am

35

slide-37
SLIDE 37

Cluster Comparisons

36

slide-38
SLIDE 38

Cluster Comparisons

37

slide-39
SLIDE 39

What Next?

  • Running multiple models with different clustering options
  • Improved diagnostics
  • 2019 SA National Election
  • 2016 US Election . . .
  • 2020 UK Election

38