Recommendation System for Opinion Articles in Turkish Newspapers - - PowerPoint PPT Presentation

recommendation system for opinion articles in turkish
SMART_READER_LITE
LIVE PREVIEW

Recommendation System for Opinion Articles in Turkish Newspapers - - PowerPoint PPT Presentation

Recommendation System for Opinion Articles in Turkish Newspapers stn zgr System Components Article Metadata Scraper Article Metadata Consumer Article Text Extractor Article Text Analyzer Article Metadata Scraper


slide-1
SLIDE 1

Recommendation System for Opinion Articles in Turkish Newspapers Üstün Özgür

slide-2
SLIDE 2

System Components

  • Article Metadata Scraper
  • Article Metadata Consumer
  • Article Text Extractor
  • Article Text Analyzer
slide-3
SLIDE 3

Article Metadata Scraper

  • Article Metadata Consumer
  • Article Text Scraper
  • Article Text Analyzer
slide-4
SLIDE 4

Article Metadata Scraper

slide-5
SLIDE 5

Article Metadata Scraper (contd)

  • Rewritten in node.js
  • Due to impedance mismatch between

developer tools an Python

  • Outputs a JSON document containing an array
  • f documents
  • Each document has several metadata, such as

author name, newspaper name, article link

slide-6
SLIDE 6
slide-7
SLIDE 7
  • Article Metadata Consumer
  • Existing Python codebase modified
  • Data stored in RDMS
  • Just consumes incoming data
  • “Dumb” on purpose
slide-8
SLIDE 8
  • Article Text Extractor
  • Consumes either the output of metadata

scraper (currently implemented) or metadata consumer

  • Separate scrapers for each article content
slide-9
SLIDE 9
slide-10
SLIDE 10
  • Article Text Analyzer
slide-11
SLIDE 11

Demo

  • http://localhost:3000/yazi-short/286
  • http://localhost:3000/yazi-short/100

http://localhost:3000/yazi-short/3

slide-12
SLIDE 12

Remaining Work

  • More sophisticated comparison methods
  • Other similarity measures
  • Most common words and phrases for

categorization

– Documents containing those