PRESIDENTIAL CAMPAIGN Josemar Alves Caetano, Jussara Almeida, - - PowerPoint PPT Presentation
PRESIDENTIAL CAMPAIGN Josemar Alves Caetano, Jussara Almeida, - - PowerPoint PPT Presentation
CHARACTERIZING POLITICALLY ENGAGED USERS' BEHAVIOR DURING THE 2016 US PRESIDENTIAL CAMPAIGN Josemar Alves Caetano, Jussara Almeida, Humberto Torres Marques-Neto ASONAM 2018 2 Social networks and political campaings ASONAM 2018 3 Reach of
Social networks and political campaings
ASONAM 2018
2
Reach of the candidates on Twitter (election day)
17 million followers 35 thousand published tweets 12 million followers 9 thousand published tweets
ASONAM 2018
3
ASONAM 2018
4
Political biases on social networks
Advocates
ASONAM 2018
5
Political biases on social networks
Other political groups
Political Bots Regular Users
ASONAM 2018
6
Characterize users in an online social network taking into account political biases and therefore different behaviors
ASONAM 2018
7
Main objective
Characterizations
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
ASONAM 2018
8
Feature characterization
ASONAM 2018
9
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
Language characterization
ASONAM 2018
10
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
Profile characterization
ASONAM 2018
11
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
Mood characterization
ASONAM 2018
12
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
To perform these characterizations…
ASONAM 2018
13
Which features highlight each group Language Patterns Analysis Mood Variation Analysis Popular users
- f each group
Methodology
ASONAM 2018
14
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
Collecting Twitter data
ASONAM 2018
15
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
Data collection process
ASONAM 2018
16
Data collection period
- Data collected over 122 days (August 1st to November 30th 2016)
August 1st Data collection start September 26th First televised debate October 9th Second televised debate October 19th Second televised debate November 8th Election day November 30th Data collection end
ASONAM 2018
17
Dataset
# of tweets 23 mi # of users 115 k # of relationships 1.8 mi
ASONAM 2018
18
Identifying political tweets
ASONAM 2018
19
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
Candidates references considered
Donald Trump Hillary Clinton @realDonaldTrump @HillaryClinton Trump Hillary DT HC
ASONAM 2018
20
Political hashtags
Donald Trump Hillary Clinton 1 #Trump #ImWithHer 2 #MAGA #NeverTrump 3 #TrumpTrain #Hillary 4 #TrumpPence16 #HillaryClinton 5 #DrainTheSwamp #Hillary2016 6 #tcot #UniteBlue 7 #Trump2016 #VoteBlue 8 #GOP #HillaryBecause 9 #PJNET #OHHillYes 10 #cco #HillYes
ASONAM 2018
21
Tweet sentiment analysis
ASONAM 2018
22
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
How sentiment analysis works?
- SentiStrength tool
- Dictionary containing emotional words
- 5
- 1
1 5
ASONAM 2018
23
Political sentiment analysis
I love Hillary Clinton and her ideas. I hate Hillary Clinton and her ideas.
ASONAM 2018
24
Political sentiment analysis problem
I love Hillary Clinton but I hate Donald Trump. I hate Hillary Clinton but I love Donald Trump.
ASONAM 2018
25
Sentiment Analysis Non-political Tweets Whole text Political Tweets Tweets about
- ne candidate
Whole text Tweets about both candidates Words associated with each candidate
ASONAM 2018
26
Sentiment analysis approaches
Sentiment Analysis Non-political Tweets Whole text Political Tweets Tweets about
- ne candidate
Whole text Tweets about both candidates Words associated with each candidate
ASONAM 2018
27
Non-political tweets
Sentiment Analysis Non-political Tweets Whole text Political Tweets Tweets about
- ne candidate
Whole text Tweets about both candidates Words associated with each candidate
ASONAM 2018
28
Political tweets about one candidate
Sentiment Analysis Non-political Tweets Whole text Political Tweets Tweets about
- ne candidate
Whole text Tweets about both candidates Words associated with each candidate
ASONAM 2018
29
Political tweets about both candidates
Identifying words related to candidates
- Stanford Parser tool
- Natural language processor
ASONAM 2018
30
Hillary’s Advocates Trump’s Advocates Political Bots Regular Users
ASONAM 2018
31
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
Identifying politically engaged user groups
Data mining process
ASONAM 2018
32
Features
Removing outliers
ASONAM 2018
33
Eliminated users that did not have political tweets
Features
Identifying political bots
ASONAM 2018
34
BotOrNot
Features
Users with BotOrNot score >= 0.75
Feature set engineering
ASONAM 2018
35
User Metadata 6 Political Bias 11 Sentiment Analysis 17
44 Features
Syntax 10 Features
Identifying Regular Users, Trump’s Advocates, and Hillary’s Advocates
ASONAM 2018
36
Silhouette Index Greedy Selection K-Means
1 3 2 Features
Two steps clustering
Non-political Bots Regular Users Advocates Hillary’s Advocates Trump’s Advocates
ASONAM 2018
37
Identifying regular users and advocates
Non-political Bots Regular Users Advocates Hillary’s Advocates Trump’s Advocates
ASONAM 2018
38
Identifying Trump’s Advocates and Hillary’s Advocates
Non-political Bots Regular Users Advocates Hillary’s Advocates Trump’s Advocates
ASONAM 2018
39
ASONAM 2018
40
Tweet Tweet Tweet 𝑻𝒄 𝒗 Retweet Tweet Tweet Tweet 𝑻𝒃 𝒗
Mood variation analysis
Collecting Twitter Data Identifying Political Tweets Tweet Sentiment Analysis Identifying Politically Engaged User Groups Mood Variation Analysis
- 𝑂𝑞𝑣 𝑢1, 𝑢2 : positive tweets total
- 𝑂𝑜𝑣 𝑢1, 𝑢2 : negative tweets total
- 𝑇𝑣 𝑢1, 𝑢2 : −1 ≤ 𝑇𝑣 ≤ 1
Subjective Well-Being definition
Graduate Program in Informatics
41
𝑇𝑣 𝑢1, 𝑢2 =
𝑂𝑞𝑣 𝑢1,𝑢2 − 𝑂𝑜𝑣 𝑢1,𝑢2 𝑂𝑞𝑣 𝑢1,𝑢2 + 𝑂𝑜𝑣 𝑢1,𝑢2
- 𝑇𝑣 𝑢, 𝑢 + 𝜀 : SWB after retweet
- 𝑇𝑣 𝑢, 𝑢 − 𝜀 : SWB before retweet
- ∆𝑇𝑣 values: −2 ≤ ∆𝑇𝑣≤ 2
Mood variation definition
Graduate Program in Informatics
42
∆𝑇𝑣 = 𝑇𝑣 𝑢, 𝑢 + 𝜀 - 𝑇𝑣 𝑢, 𝑢 − 𝜀
What does it mean?
- 2
- 1
1 2
ASONAM 2018
43
Results
Which features highlight each group Language Patterns Analysis
ASONAM 2018
44
Mood Variation Analysis Popular users of each group
Clustering Regular Users and Advocates
ASONAM 2018
45
Regular Users (70,290) Advocates (40,003) µ σ µ σ political discourse 0.0871 0.4083 0.4614 1.5802 avg number of political hashtags related to Trump per tweet
- 0.0005
0.0088
- 0.0080
0.0297 avg number of political hashtags related to Hillary per tweet
- 0.0066
0.0141
- 0.0318
0.0385 positive/negative bias towards Trump 0.0759 0.0617 0.3431 0.1050 positive/negative bias towards Hillary 0.0833 0.4276 0.6592 2.1534
Sillhouette index: 0.81
Clustering Hillary’s Advocates and Trump’s Advocates
ASONAM 2018
46
Hillary’s Advocates (26,230) Trump’s Advocates (13,733) µ σ µ σ # hashtags in user's description 0.4030 0.1494 0.3516 0.1886 avg number of words per tweet 0.2787 0.1934 0.3429 0.1961 % tweets with some reference to Trump 0.5578 0.2349 0.7702 0.2532 % tweets with some reference to Hillary 0.8355 0.1864 0.6504 0.2624 std of the sentiment score of tweets with some reference to Trump 3.7241 4.8296 7.8192 5.2273 std of the sentiment score of tweets with some reference to Hillary 0.4692 1.4341 0.7009 1.7545
Sillhouette index: 0.72
ASONAM 2018
47
Hillary’s Advocates Trump’s Advocates Regular Users Political Bots
Language patterns
Top 5 Hillary’s Advocates
1 2 3 4 5
ASONAM 2018
48
Top 5 Trump’s Advocates
ASONAM 2018
49
1 2 3 4 5
Top 5 Political Bots
Twitter suspended the top 10 Political Bots accounts
ASONAM 2018
50
Top 5 Regular Users
ASONAM 2018
51
1 2 3 4 5
Mood variation – Hillary’s tweets
ASONAM 2018
52
Mood variation – Trump’s tweets
ASONAM 2018
53
Main contributions
ASONAM 2018
54
- Better understanding of the political engagement of users on online social
networks
- How candidates may influence their voters using Twitter
- How users interact with each other
Future work
ASONAM 2018
55
Questions?
ASONAM 2018
56