nov 7 douwe osinga dosinga smart big data what is smart
play

Nov 7, Douwe Osinga, @dosinga Smart Big Data What is Smart Travel - PowerPoint PPT Presentation

Nov 7, Douwe Osinga, @dosinga Smart Big Data What is Smart Travel Guides Algorithm based. Covering the entire world Suggestions Nearby Start with the web Put it back together Push it to the users Stu fg nearby Weather based Weather data


  1. Nov 7, Douwe Osinga, @dosinga

  2. Smart Big Data

  3. What is

  4. Smart Travel Guides

  5. Algorithm based.

  6. Covering the entire world

  7. Suggestions

  8. Nearby

  9. Start with the web

  10. Put it back together

  11. Push it to the users

  12. Stu fg nearby

  13. Weather based

  14. Weather data

  15. Usage on a sunny day

  16. Pictures on a rainy day

  17. Weather suggestions

  18. Time based

  19. Users keep time

  20. Spider the web at large

  21. Opinion mining

  22. Time based

  23. Done!

  24. Thanks! @dosinga

  25. Building data-intensive services (aka. immutability and idempotence) @knutin GameAnalytics

  26. Instrument your game to send events on user action, such as log in, purchase, level up etc. � Analyse game performance with UI. � Improve game.

  27. SDK Collection 
 API Log Stream 
 Funnels … analytics User

  28. 15M devices daily ‣ 3B events per day (35k per second) ‣ 750 GB uncompressed ‣

  29. Lesson 1 Store events in a log (immutability)

  30. Lesson 1 0 1 2 3 4 Log: immutable, write by appending ‣ Split producer & consumers ‣ High-availability write path (S3) ‣

  31. Lesson 1 producer 0 1 2 3 4 5 Log: immutable, write by appending ‣ Split producer & consumers ‣ High-availability write path (S3) ‣

  32. Lesson 1 producer 0 1 2 3 4 5 consumer Log: immutable, write by appending ‣ Split producer & consumers ‣ High-availability write path (S3) ‣

  33. Lesson 2 If you mess up, redo it (idempotency)

  34. Lesson 2

  35. Lesson 2 get_checkpoint() return “2014-10-01” ‣

  36. Lesson 2 get_checkpoint() return “2014-10-01” ‣ Process events from log o ff set “2014-10-01” to ‣ log o ff set “2014-10-02”

  37. Lesson 2 get_checkpoint() return “2014-10-01” ‣ Process events from log o ff set “2014-10-01” to ‣ log o ff set “2014-10-02” When all messages for 2014-10-01 are ‣ processed, write to DB, overwrite any existing data (idempotence)

  38. Lesson 2 get_checkpoint() return “2014-10-01” ‣ Process events from log o ff set “2014-10-01” to ‣ log o ff set “2014-10-02” When all messages for 2014-10-01 are ‣ processed, write to DB, overwrite any existing data (idempotence) set_checkpoint(“2014-10-02”) ‣

  39. Where can I get one? Apacha Samza! ‣ Does everything we do and much more ‣ Released after we went live … :/ ‣

  40. Thank you

  41. Q & A

  42. Building data-intensive services (aka. immutability and idempotence) @knutin GameAnalytics

  43. Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  44. Why you don’t want your realtime analytics to be exact Mikio Braun, TU Berlin/streamdrill @mikiobraun GOTO Berlin, Nov 7, 2014 Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  45. Analyzing User Interaction Scale Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  46. What can we do besides scaling? Approximate? But is that ok? Do we really want our analytics to be exact? Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  47. Why you don't want your real-time analytics to be exact 1. Results are changing all the time anyway. 2. You can't have exactness, real-time, and big data at the same time (or it costs a lot). 3. Exactness is often not necessary. 4. You probably already have a batch system in place. Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  48. Reason 2: You can't have exactness, real-time, and big data at the same time (or it costs a lot) Real-Time Exactness Big Data http://www.slideshare.net/acunu/realtime-analytics-with-casaandra Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  49. Why you don't want your real-time analytics to be exact 1. Results are changing all the time anyway. 2. You can't have exactness, real-time, and big data at the same time (or it costs a lot). 3. Exactness is often not necessary. 4. You probably already have a batch system in place. Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  50. Reason 3: Exactness is often not necessary Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  51. Why you don't want your real-time analytics to be exact 1. Results are changing all the time anyway. 2. You can't have exactness, real-time, and big data at the same time (or it costs a lot). 3. Exactness is often not necessary. 4. You probably already have a batch system in place. Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  52. streamdrill ● Core Engine ● Features – true real-time, low latency (ms) – approximative – Dashboard & REST interface counting and trends – about 20 events/sec, track 1M – rolling time windows objects/1GB RAM based on exponential ● Applications decay – real-time user profiling – secondary indices – recommendation – ... Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  53. Dashboard Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  54. Trend view Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  55. Real-time Recommendation at serienjunkies.de Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  56. Realtime User Profiles Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  57. Realtime User Profiles Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  58. Realtime User Profiles Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  59. Summary ● real-time doesn't have to be exact ● streamdrill: real-time analytics plattform ● Contact us at info@streamdrill.com if you're interested in – real-time profiling – real-time recommendation – anything else real-time related! Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

  60. Mikio Braun Why real-time analytics don't have to be exact (c) 2014 streamdrill

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend