High throughput High throughput kafka for science kafka for - PowerPoint PPT Presentation

High throughput High throughput kafka for science kafka for science Testing Kafka’s limits for science J Wyngaard, PhD wyngaard@jpl.nasa.gov

UTLINE O UTLINE O ● Streaming Science Data ● Benchmark Context ● Tests and Results ● Conclusions

● Streaming Science Data ● Benchmark Context ● Tests and Results ● Conclusions Streaming Science Data SOODT, Kafka, Science data streams

DIA GROUP DIA GROUP ● Using open source tools extensively, to enable JPL scientists to handle their big data. – Apache OODT – Apache Tika – Apache Hadoop – Apache Kafka – Apache Mesos – Apache Spark – ...so many more...

SCIENCE DATA SCIENCE DATA ● Earth Science – Satellite data ~5GB/day ● Radio Astronomy – Antenna arrays ~4Tbps >>1K 10Gbps ● Airborne missions – ~5GB files, 0.5TB per flight ● Bioinformatics

STREAMING SOODT STREAMING SOODT

APACHE KAFKA APACHE KAFKA P0- P1- P2- P3- P4- P5- Producer nodes . Topic0 Topic1 Topic2 Topic0 Topic1 Topic2 Broker Cluster . T00, T01, T02, T03 T00, T01, T02, T03 T00, T01, T02, T03 G0 G1 . G2 G3 . Consumer Nodes .

10G X 1024 - ? 10G X 1024 - ? ● Low Frequency Apature array: ● 0.25M antennas – 1024 stations ● 16 Processing modules ● = 4Tbps from 1024 stations at 10Gbps each Artists' impression of LFAA, SKA image https://www.skatelescope.org/multimedia/image/l ow-frequency-array-ska-wide-field/

● Streaming Science Data ● Benchmark Context ● Tests and Results ● Conclusions Benchmark Context Benchmark Context Reality check – kafka was not designed for this

TACC WRANGLER TACC WRANGLER ● Primary system – 96 nodes ● 24Core Haswells ● 128GB RAM – Infniband FDR and 40 Gb/s Ethernet connectivity. – 0.5PB NAND Flash ● 1 Tbps ● >200 million IOPS. ● A 24 node replicate cluster resides at University of Indiana, connected by a 100 Gb/s link

“LAZY” BENCHMARKING LAZY” BENCHMARKING “ ● “Lazy” being: – Off-the shelf cheap hardware – Untuned default configuration https://engineering.linkedin.com/kafka/benchm arking-apache-kafka-2-million-writes-second- three-cheap-machines

6 CHEAP MACHINES 6 CHEAP MACHINES ● OTS benchmark ● Wrangler nodes – 6 core 2.5GHz – 2x 12core 2.5GHz Xeons Xeons – ~ 100 IOPS – >200 IOPS flash harddrives – 1Gb Ethernet – 128GB RAM – 40Gb Ethernet

“LAZY” CONFIGURATION LAZY” CONFIGURATION “ ● Kafka trunk 0.8.1 ● New producer ● Default configurations ● Small messages ● Setup – 3 Broker nodes – 3 Zookeeper, Consumer, Producer nodes ● Kafka builtin performance tools

STRAIGHTLINE “LAZY” STRAIGHTLINE “LAZY” SPEED TEST SPEED TEST ● 1 Producer ● 0 Consumer P0- Producer nodes . ● 1 Topic Topic0 ● 6 partition ● 1 replicates (i.e 0) ● 50M 100B messages T00, T01, T02, Broker Cluster . . . . T03, T04, T05 (small for worst case) Consumer Nodes .

STRAIGHTLINE “LAZY” STRAIGHTLINE “LAZY” SPEED TEST SPEED TEST ● 1 Producer 6 cheap machines ● 0 Consumer 78.3MB/s* 78.3MB/s* ● 1 Topic (0.6Gbps) (0.6Gbps) ● 6 partition ● 1 replicates (i.e 0) ● 50M 100B messages Wrangler (small for worst case) 170.27 MB/sec* (1.3Gbps) *Network overhead not accounted for *Network overhead not accounted for

Δ MESSAGE SIZE Δ MESSAGE SIZE ~100MBps at 100KB message sizesec

OTHER PARAMETER IMPACTS OTHER PARAMETER IMPACTS ● Replication: – Single producer thread, 3x replication, 1 partition ● Asynchronous – 0.59Gbps ● Synchronous – 0.31 Gbps ● Parallelism: – Three producers, 3x asynchronous replication ● Independant machines – 1.51 MB/sec < 3*0.59 = 1.77 Reference straight line producer speed: Reference straight line producer speed: 0.61Gbps 0.61Gbps

● Streaming Science Data ● Benchmark Setup ● Wrangler Performance ● Conclusions Wrangler Performance Wrangler Performance Limits

TARGETTING 10G TARGETTING 10G ● 40x networks speed ● Starting point ● 4x core counts – Bigger messages ● 2x IOPS – No replication ● 128x RAM – In node paralleism – Big Buffers – Large Batches

Δ MESSAGE SIZE Δ MESSAGE SIZE ? Sustainable B KB MB Averaged throughput over changing message size

PARTITIONS PARTITIONS ● 3 producers, 1 topic, asynchronous, 3 consumer threads – Averager 6.49Gbps (8000 messages)

PARTITIONS PARTITIONS ● 6 producers, 1 topic, asynchronous, 6 consumer threads – Averager 2.6Gbps (8000 messages)

PARTITIONS PARTITIONS ● 6 producers, 1 topic, asynchronous, 6 consumer threads, and 6 brokers – Averager 1.2Gbps (8000 messages)

● Context ● TACC Wrangler Data Analysis System ● Benchmark Setup ● Tests and Results ● Conclusions Conclusions Conclusions And where to from here

TARGETTING 10G TARGETTING 10G ● Apparent optimum for a single node producer on this hardware: – ~10MB messages – 3 Producers matching 3 consumers/consumer trheads ● More brokers, producers, consumers are detremental ● 6.49Gbps < 10Gbps

AL TERNATIVE AVENUES AL TERNATIVE AVENUES ● Parallelism -multiple topics if this is tollerable – (Potential ordering and chunking overheads) ● In a shared file system environment perhaps the Taget file pointers rather than files should be moved – (not suitable in many applications) ● Nothing to be gained in better hardware

HPC PRODUCTION CLUSTER HPC PRODUCTION CLUSTER ENVIRONMENT ENVIRONMENT ● Pros – Shared files system – tmpfs – Scale ● Cons: – User space installs only – SLURM ● Idev ● Job times-out loosing configurations, leaving a mess – Queuing for time – Loading cost and impremanance of data – Stability of Kafka / Other users interferring - ?

HPC PRODUCTION CLUSTER HPC PRODUCTION CLUSTER ENVIRONMENT ENVIRONMENT ● Lessons learned: – Develop in your destination environment – Flash Storage makes life easy ● Caveat -it is wiped when your reservation runs outs. – Lustr… ● No battle scars – credit to XSEDE wrangler management team and Kafak builders

REFERENCES REFERENCES ● Benchmarking Apache Kafka: 2 Million Writes Per Second (On Three Cheap Machines)Benchmark https://engineering.linkedin.com/kafka/benchmarking-apache-kafka- 2-million-writes-second-three-cheap-machines

ACKNOWLEDGEMENTS ACKNOWLEDGEMENTS ● NASA Jet Propulsion Laboratory – Research & Technology Development: “Archiving, Processing and Dissemination for the Big Data Era” ● XSEDE – This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI-1053575. – "XSEDE: Accelerating Scientific Discovery" ● John Towns, Timothy Cockerill, Maytal Dahan, Ian Foster, Kelly Gaither, Andrew Grimshaw, Victor Hazlewood, Scott Lathrop, Dave Lifka, Gregory D. Peterson, Ralph Roskies, J. Ray Scott, Nancy Wilkins-Diehr, , Computing in Science & Engineering, vol.16, no. 5, pp. 62-74, Sept.-Oct. 2014, doi:10.1109/MCSE.2014.80

High throughput High throughput kafka for science kafka for - PowerPoint PPT Presentation

High throughput High throughput kafka for science kafka for science Testing Kafkas limits for science J Wyngaard, PhD wyngaard@jpl.nasa.gov UTLINE O UTLINE O Streaming Science Data Benchmark Context Tests and Results

FROM HTTP TO KAFKA-BASED FROM HTTP TO KAFKA-BASED MICROSERVICES MICROSERVICES Wojciech Rzsa,

Introduction to Kafka Instructor: Ekpe Okorafor 1. Big Data Academy - Accenture 2. Computer

Kafka Needs No Keeper Colin McCabe 2 Introduction Kafka has gotten its mileage out of

READING KAFKA IN QATAR Qatar-TESOL Conference, April 2011 Magdalena Rostron Academic Bridge

Apache Kafka Real-Time Data Pipelines http://kafka.apache.org/ Joe Stein Developer,

Kafka in Jail Running Kafka in container orchestrated clusters Sean Glover, Lightbend @seg1o

Day 4 Lab1: Docker container for Kafka - Spark streaming - Cassandra This Dockerfile sets up

Apache Kafka + Apache Mesos Highly Scalable Streaming Microservices with Kafka Streams Kai

Cr Cruise Co Control: Effo l: Effortle less M Manage gement o of K f Kafka fka Clu

Cloud-Native and Scalable Kafka Allen Wang @allenxwang About Me Real Time Data

I Logs Apache Kafka, Stream Processing, and Real-time Data Jay Kreps The Plan 1. What is Data

Blockchain consensus Protocols in the Wild Tao Wang, Lihang Pan ECS 265 Apache Kafka

Day 3 Lab1: Spark Streaming with Kafka Example Introductions In this example, we will write a

INTEGRATING SYSTEMS INTEGRATING SYSTEMS IN THE AGE OF IN THE AGE OF QUARKUS, KNATIVE AND KAFKA

Evaluation of Improved Scalability Comparison points Throughput (IPC/Node)

A GPU-Inspired Soft Processor for High- Throughput Acceleration Throughput Acceleration Jeffrey

: Taming the Cloud Object Storage Ali Anwar , Yue Cheng , Aayush Gupta , Ali R. Butt

TOG web pages EVN pages: http://www.evlbi.org/ Radionet wiki:

Parameter Hub A Rack-Scale Parameter Server for Efficient Cloud-based Distributed Deep Neural

Gigabit Broadband, Interconnec1on proposi1ons, and the Challenge of Managing Expecta1ons Steven

Fresco SIX-G The NEXT Generator Who Is Murideo? Murideo was created out of the need to bring

Buffer sizing and Video QoE Measurements at Netflix Bruce Spang , Brady Walsh, Te-Yuan Huang,

An Approximation Algorithm for Path Computation and Function Placement in SDNs Matthias Rost

Automatic Generation of 100 Gbps Packet Parsers from P4 Description cek 1 s 1 a 2 Pavel Ben a