The Uninstrumentable; Getting Apache Spark and Prometheus to Play - - PowerPoint PPT Presentation

the uninstrumentable getting apache spark and prometheus
SMART_READER_LITE
LIVE PREVIEW

The Uninstrumentable; Getting Apache Spark and Prometheus to Play - - PowerPoint PPT Presentation

The Uninstrumentable; Getting Apache Spark and Prometheus to Play Nicely DAN RATHBONE & JOE STRINGER PROMCON 2017, AUGUST 2017 2 Healthcare data processing system


slide-1
SLIDE 1

DAN RATHBONE & JOE STRINGER PROMCON 2017, AUGUST 2017

The Uninstrumentable; Getting Apache Spark and Prometheus to Play Nicely

slide-2
SLIDE 2

2

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5
  • Healthcare data processing system using Apache PySpark
  • Failed attempts and the crazy ideas that followed
  • Actually working with lots of pretty graphs
slide-6
SLIDE 6

6

slide-7
SLIDE 7

https://cwiki.apache.org/confluence/display/SPARK/PySpark+Internals

7

slide-8
SLIDE 8

8

https://cwiki.apache.org/confluence/display/SPARK/PySpark+Internals

slide-9
SLIDE 9

“Occasionally you will need to monitor components which cannot be

  • scraped. They might live behind a firewall, or they might be too

short-lived to expose data reliably via the pull model. The Prometheus Pushgateway allows you to push time series from these components to an intermediary job which Prometheus can scrape.”

9

slide-10
SLIDE 10

10

“The Pushgateway is explicitly not an aggregator

  • r distributed counter but rather a metrics cache”
slide-11
SLIDE 11

11

slide-12
SLIDE 12

12

slide-13
SLIDE 13
slide-14
SLIDE 14

14

slide-15
SLIDE 15

15

slide-16
SLIDE 16

16

slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19
slide-20
SLIDE 20
slide-21
SLIDE 21
slide-22
SLIDE 22
  • Realtime is worth the effort, visibility is key
  • Nothing’s uninstrumentable
  • The solution is often quite simple
  • Prometheus is pretty flexible
slide-23
SLIDE 23

JOE STRINGER

@joeds13

DAN RATHBONE

@thetrilemma

www.infinityworks.com