Russia Finland Reflection on Neighbours Next Door 1 June 2018 - - PowerPoint PPT Presentation

russia finland
SMART_READER_LITE
LIVE PREVIEW

Russia Finland Reflection on Neighbours Next Door 1 June 2018 - - PowerPoint PPT Presentation

Russia Finland Reflection on Neighbours Next Door 1 June 2018 Alexey Igonen, Arturs Polis Ilona Repponen, Miika Lampi Mila Oiva, Victoria Tkachenko Group leaders : Andrey Indukaev, Daria Gritsenko Research question What are the images of


slide-1
SLIDE 1

Russia ⇔ Finland

Reflection on Neighbours Next Door

1 June 2018 Alexey Igonen, Arturs Polis Ilona Repponen, Miika Lampi Mila Oiva, Victoria Tkachenko Group leaders: Andrey Indukaev, Daria Gritsenko

slide-2
SLIDE 2

Research question

What are the images of Finland in the Russian media and Russia in the Finnish media?

slide-3
SLIDE 3

120.000+ articles

Period: 1997-2017 Newspapers: Russian Federal Russian Regional YLE Finland

slide-4
SLIDE 4

Case studies

  • 1. News agenda
  • 2. Dynamic geography
  • 3. Understanding of the ‘neighbour’
slide-5
SLIDE 5

What’s on the agenda?

SPORTS and POLITICS

slide-6
SLIDE 6

Finnish media (YLE)

slide-7
SLIDE 7

Russian media (federal)

Sports national team team match

championship Politics

EU child war Ukraine NATO

slide-8
SLIDE 8

Where things happen?

slide-9
SLIDE 9

Where things happen?

slide-10
SLIDE 10

Where things happen?

slide-11
SLIDE 11

Neighbour/Naapuri/Сосед

What is neighbourhood?

www.iltalehti.fi

slide-12
SLIDE 12

Neighbour/Naapuri/Сосед

That’s where ‘neighbour’ comes all political

slide-13
SLIDE 13

Neighbour/Naapuri/Сосед

That’s where ‘neighbour’ comes all political

slide-14
SLIDE 14

Neighbour/Naapuri/Сосед

That’s where neighbourhood comes all political

slide-15
SLIDE 15

Neighbour/Naapuri/Сосед

Patterns in color

slide-16
SLIDE 16

Computational techniques

slide-17
SLIDE 17

Data Cleaning

json to csv lemmatization -> returning the words to their basic form removing the stop words -> the not meaningful “and”, “or”...

Cincinnati Bell Historical Archives

slide-18
SLIDE 18

Sports and Politics - techniques

Dominant annual agendas TF-IDF The most significant word from each article

slide-19
SLIDE 19

Where things happen? - techniques

slide-20
SLIDE 20

Understanding the ‘neighbour’ - techniques

W2V library nearby words - “use-synonyms” 5-year window clustering of nearby words

slide-21
SLIDE 21

Challenges and Limitations

Wordclouds, Method: TF-IDF

  • Lemmatization
  • Timestamps
  • Running TF-IDF on individual articles VS combined yearly data
  • 1 pass VS 2 pass TF-IDF
  • Can miss short-lived keywords!
slide-22
SLIDE 22

Challenges and Limitations

Geo Mapping and Topic Modelling, Method: POI mapping, STM

  • Lemmatization
  • Place name transliteration and disambiguation
  • Selecting the number of topics and topic clusters
  • Ambiguous articles and topics
slide-23
SLIDE 23

Challenges and Limitations

Word neighbourhoods, Method: Word2Vec

  • Timestamps and Lemmatization
  • Quantity of data for shorter periods (5 years at a time)
  • Picking the right dimensionality and threshold
  • Reading too much into it!
slide-24
SLIDE 24

Ideas for future research

Yearly words per topic - sports, culture, economics Compare seemingly similar concepts across languages - same or not? Causality and sentiment analysis - connect events to sentiment

slide-25
SLIDE 25

Public outreach during the hackathon

slide-26
SLIDE 26

Спасибо! Kiitos! Thank you! Questions?