recommendation system for opinion articles in turkish
play

Recommendation System for Opinion Articles in Turkish Newspapers - PowerPoint PPT Presentation

Recommendation System for Opinion Articles in Turkish Newspapers stn zgr System Components Article Metadata Scraper Article Metadata Consumer Article Text Extractor Article Text Analyzer Article Metadata Scraper


  1. Recommendation System for Opinion Articles in Turkish Newspapers Üstün Özgür

  2. System Components ● Article Metadata Scraper ● Article Metadata Consumer ● Article Text Extractor ● Article Text Analyzer

  3. Article Metadata Scraper ● Article Metadata Consumer ● Article Text Scraper ● Article Text Analyzer

  4. Article Metadata Scraper

  5. Article Metadata Scraper (contd) ● Rewritten in node.js ● Due to impedance mismatch between developer tools an Python ● Outputs a JSON document containing an array of documents ● Each document has several metadata, such as author name, newspaper name, article link

  6. ● Article Metadata Consumer ● Existing Python codebase modified ● Data stored in RDMS ● Just consumes incoming data ● “Dumb” on purpose

  7. ● Article Text Extractor ● Consumes either the output of metadata scraper (currently implemented) or metadata consumer ● Separate scrapers for each article content

  8. ● Article Text Analyzer

  9. Demo ● http://localhost:3000/yazi-short/286 ● http://localhost:3000/yazi-short/100 http://localhost:3000/yazi-short/3

  10. Remaining Work ● More sophisticated comparison methods ● Other similarity measures ● Most common words and phrases for categorization – Documents containing those

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend