using naiad to analyze twitter data in batch and real time
play

Using Naiad to Analyze Twitter Data in Batch and Real-time George - PowerPoint PPT Presentation

Using Naiad to Analyze Twitter Data in Batch and Real-time George Wort University of Cambridge 2017 Naiad Timely Dataflow System. Batch Processing. Stream Processing. Graph Processing. Supports iterative and incremental


  1. Using Naiad to Analyze Twitter Data in Batch and Real-time George Wort University of Cambridge 2017

  2. Naiad • Timely Dataflow System. • Batch Processing. • Stream Processing. • Graph Processing. • Supports iterative and incremental data analysis. • Low latency. • High throughput.

  3. Naiad • Complex system offering a lot of options. • Too complex for most applications? • Overheads and ease of use? • Additions: • Differential Dataflow. • GraphLINQ.

  4. Twitter Data Processing • Implement real-time and batch processing of tweet stream. • Geographically categorise word frequencies. • Allow selection of different levels of granularity. • Query geographical data. • Extend to allow similarity comparison between areas or cluster areas in batch. • Extend to view frequency of spelling mistakes in English.

  5. Assessment • Implement on a single machine and distributed environment. • Using: • The base Naiad system. • Differential dataflow. • GraphLINQ. • Assessing: • Ease of use. • Flexibility. • Latency. • Throughput.

  6. Questions?

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend