High throughput High throughput kafka for science kafka for science
Testing Kafka’s limits for science
J Wyngaard, PhD wyngaard@jpl.nasa.gov
High throughput High throughput kafka for science kafka for - - PowerPoint PPT Presentation
High throughput High throughput kafka for science kafka for science Testing Kafkas limits for science J Wyngaard, PhD wyngaard@jpl.nasa.gov UTLINE O UTLINE O Streaming Science Data Benchmark Context Tests and Results
J Wyngaard, PhD wyngaard@jpl.nasa.gov
SOODT, Kafka, Science data streams
– Apache OODT – Apache Tika – Apache Hadoop – Apache Kafka – Apache Mesos – Apache Spark – ...so many more...
– Satellite data ~5GB/day
– Antenna arrays ~4Tbps >>1K 10Gbps
– ~5GB files, 0.5TB per flight
Broker Cluster . Consumer Nodes . T00, T01, T02, T03 G0 G1 . T00, T01, T02, T03 T00, T01, T02, T03 G2 G3 . Producer nodes . P0- Topic0 P1- Topic1 P2- Topic2 P3- Topic0 P4- Topic1 P5- Topic2
– 1024 stations
Artists' impression of LFAA, SKA image
https://www.skatelescope.org/multimedia/image/l
Reality check – kafka was not designed for this
– 96 nodes
– Infniband FDR and 40 Gb/s
– 0.5PB NAND Flash
https://engineering.linkedin.com/kafka/benchm arking-apache-kafka-2-million-writes-second- three-cheap-machines
– Off-the shelf
– Untuned default
– 6 core 2.5GHz
– ~ 100 IOPS
– 1Gb Ethernet
– 2x 12core 2.5GHz
– >200 IOPS flash – 128GB RAM – 40Gb Ethernet
– 3 Broker nodes – 3 Zookeeper, Consumer, Producer nodes
Producer nodes . Broker Cluster . . . . P0- Topic0 T00, T01, T02, T03, T04, T05 Consumer Nodes .
~100MBps at 100KB message sizesec
– Single producer thread, 3x replication, 1 partition
– 0.59Gbps
– 0.31 Gbps
– Three producers, 3x asynchronous replication
– 1.51 MB/sec < 3*0.59 = 1.77
Limits
– Bigger messages – No replication – In node paralleism – Big Buffers – Large Batches
B KB MB
Averaged throughput over changing message size
Sustainable
– Averager 6.49Gbps (8000 messages)
–Averager 2.6Gbps (8000 messages)
– Averager 1.2Gbps (8000 messages)
System
And where to from here
– ~10MB messages – 3 Producers matching 3 consumers/consumer
– (Potential ordering and chunking overheads)
– (not suitable in many applications)
Taget
– Shared files system – tmpfs – Scale
– User space installs only – SLURM
– Queuing for time – Loading cost and impremanance of data – Stability of Kafka / Other users interferring - ?
– Develop in your destination environment – Flash Storage makes life easy
– Lustr…
– Research & Technology Development: “Archiving,
– This work used the Extreme Science and Engineering
– "XSEDE: Accelerating Scientific Discovery"