[PPT] - Index-Free Log Analytics with Kafka Kresten Krab Thorup, Humio CTO PowerPoint Presentation

SLIDE 1

Index-Free Log Analytics   with Kafka

Kresten Krab Thorup, Humio CTO

Log Everything, Answer Anything, In Real-Time.

SLIDE 2

Log Analytics Wish List

Record everything - TB’s of data per day
Interactive/ad-hoc search on historic data - 100’s of TB
Generate metrics and alerts from the logs in real-time
Can be installed on-premises (privacy / security)
Affordable - TCO (hardware, license, operations)

SLIDE 3

Humio

Data Driven SecOps

30k PC’s BRO network 6 AD’s 2k servers CEP Log Store Alerts/dashboards Incident Response ~1M/sec 20TB/day

SLIDE 4

Put Logs in an Index?

Low Volume High Volume

DATA INDEX

SLIDE 5

Index-Free

Low Volume High Volume

DATA

SLIDE 6

Index-Free

DATA DATA DATA DATA

Low Volume High Volume

SLIDE 7

Low Volume

Index-Free

High Volume

DATA DATA DATA DATA

SLIDE 8

Low Volume

Index-Free

High Volume

DATA DATA DATA DATA

TIME “INDEX”

WANT

SLIDE 9

Stream Query

Index-Free

High Volume

DATA DATA DATA DATA Stream Query Stream Query ALERTS & DASHBOARD Ad-Hoc Queries

SLIDE 10

State Machine Event Store Query /error/i | count() State Machine count: 473 count: 243,565

SLIDE 11

Log Store Design

Build minimal index and compress data

Store order of magnitude more events

Fast “grep” for filtering events

Filtering and time/metadata selection   reduces the problem space

SLIDE 12

Event Store

10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) . . .

SLIDE 13

Event Store

1GB (start-time, end-time, metadata) 1GB (start-time, end-time, metadata) 1GB (start-time, end-time, metadata) 1GB (start-time, end-time, metadata) . . . compress 1 month x 30GB/day ingest 90GB data, <1MB index 1 month x 1TB/day ingest 4TB data, <1MB index 40MB 40MB 40MB . . . 40MB Bloom Filters +4% overhead

SLIDE 14

Query

1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB time #dc1, #web #dc1, #app #dc2, #web datasource

SLIDE 15

Query

1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB time datasource 10GB #dc1, #web #dc1, #app #dc2, #web

SLIDE 16

Real-time Processing Brute-Force Search

“Materialized views”

for dashboards/alerts.

Processed when data

is in-memory anyway.

Fast response times

for “known” queries.

Shift CPU load to

query time

Data compression
Filtering, not Indexing
Requires “Full stack”  
wnership to perform

#IndexFreeLogging

+

SLIDE 17

Humio Ingest Data Flow

Agent API/ Ingest Digest Storage

Send data
HTTP/TCP API
Authenticate
Field Extraction
Live queries
Write segment files
Replication

alerts / dashboards

SLIDE 18

Use Kafka for the ‘hard parts’

Coordination
Commit-log / ingest buffer
No KSQL

SLIDE 19

Kafka 101

Kafka is a reliable distributed log/queue system
A Kafka queue consists of a number of partitions
Messages within a partition are sequenced
Partitions are replicated for durability
Use ‘partition consumers’ to parallelise work

SLIDE 20

topic partition #1 partition #2 partition #3 consumer consumer producer partition=hash(key)

Kafka 101

SLIDE 21

Coordination ‘global data’

Zookeeper-like system in-process
All cluster node keep entire K/V set in memory
Make decisions locally/fast without crossing a

network boundary.

Allows in-memory indexes of meta data.

SLIDE 22

Coordination ‘global data’

Coordinated via single-partition Kafka queue
Ops-based CRDT-style event sourcing
Bootstrap from snapshot from any node
Kafka config: low latency

SLIDE 23

Durability

Don’t loose people’s data.
Control and manage data life expectancy
Store, Replicate, Archive, Multi-tier Data storage

SLIDE 24

Durability

Agent Ingest Digest Storage

Send data
Authenticate
Field Extraction
Streaming queries
Write segment files
Replication
Queries on ‘old data’

Kafka

SLIDE 25

Durability

Agent API/ Ingest Kafka HTTP 200 response => Kafka ACK’ed the store

SLIDE 26

Durability

Kafka Digest WIP  (buffer) Segment File records last consumed   sequence number from disk QE Retention must be long enough to deal with crash

SLIDE 27

Durability

Kafka Digest WIP  (buffer) Segment QE Ingest Kafka ingest latency p99 p50

SLIDE 28

SLIDE 29

SLIDE 30

topic partition #1 partition #2 partition #3 consumer consumer producer partition=hash(key)

Hash? ?

SLIDE 31

Consumers falling behind…

Reasons:
Data volume
Processing time for real-time processing
Measure ingest latency
Increase parallelism when running 10s behind
Log scale (1, 2, 4, …) randomness added to key.

SLIDE 32

topic partition #1 partition #2 partition #3

Data Sources

…100.000 … 100.000

multiplexing

SLIDE 33

Data Model

Repository Data Source

Storage limits
User admin
Time series identified by

set of key-value ‘tags’

*

Event

Timestamp +

Map[String,String]

*

#type=accesslog,#host=ops01 Hash ( )

SLIDE 34

High variability tags ‘auto grouping’

Tags (hash key) may be chosen with large value domain
User name
IP-address
This causes many datasources => growth in metadata,

resource issues.

SLIDE 35

High variability tags ‘auto grouping’

Tags (hash key) may be chosen with large value domain
User name
IP-address
Humio sees this and hashes tag value into a smaller

value domain before the Kafka partition hash.

SLIDE 36

High variability tags ‘auto grouping’

For example, before Kafka ingest hash(“kresten”)

#user=kresten => #user=13

Store the actual value ‘kresten’ in the event
At query time, a search is then rewritten to search the

data source #user=13, and re-filter based on values.

SLIDE 37

Multiplexing in Kafka

Ideally, we would just have 100.000 dynamic topics that

perform well and scales infinitely.

In practice, you have to know your data, and control the
sharding. Default Kafka configs work for many

workloads, but for maximum utilisation you have to do go beyond defaults.

Humio automates this problem for log data w/ tags.

SLIDE 38

Using Kafka in an on-prem Product

Leverage the stability and fault tolerance of Kafka
Large customers often have Kafka knowledge
We provide kafka/zookeeper docker images
Only real issue is Zookeper dependency
Often runs out of disk space in small setups

SLIDE 39

Other Issues

Observed GC pauses in the JVM
Kafka and HTTP libraries compress data
JNI/GC interactions with byte[] can block global GC.
We replaced both with custom compression
JLibGzip (gzip in pure Java)
Zstd and LZ4/JNI using DirectByteBuffer

SLIDE 40

Resetting Kafka/Zookeeper

Kafka provides a ‘cluster id’ we can use as epoch
All Kafka sequence numbers (offsets) are reset
Recognise this situation, no replay beyond such a reset.

SLIDE 41

What about KSQL?

Kafka now has KSQL which is in many ways similar to

the engine we built

Humio moves computation to the data,
KSQL moves the data to the computation
We provide interactive end-user friendly experience

SLIDE 42

Final thoughts

With #IndexFreeLogging you can eat your cake and

have it too: fast, useful, low footprint logging.

Many difficult problems go away by deferring them to

Kafka.

SLIDE 43

Thanks for your time.

Kresten Krab Thorup Humio CTO

SLIDE 44

Filter 1GB data

SLIDE 45

Filter 1GB data

SLIDE 46

Filter 1GB data

SLIDE 47

Filter 1GB data

SLIDE 48

Index-Free Log Analytics with Kafka

Log Everything, Answer Anything, In Real-Time.

Log Analytics Wish List

Data Driven SecOps

30k PC’s BRO network 6 AD’s 2k servers CEP Log Store Alerts/dashboards Incident Response ~1M/sec 20TB/day

Put Logs in an Index?

Low Volume High Volume

Index-Free

Low Volume High Volume

Index-Free

Low Volume High Volume

Low Volume

Index-Free

High Volume

Low Volume

Index-Free

High Volume

Index-Free

High Volume

State Machine Event Store Query /error/i | count() State Machine count: 473 count: 243,565

Log Store Design

Store order of magnitude more events

Filtering and time/metadata selection reduces the problem space

Event Store

10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) 10GB (start-time, end-time, metadata) . . .

Event Store

Query

1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB time #dc1, #web #dc1, #app #dc2, #web datasource

Query

1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB 1GB time datasource 10GB #dc1, #web #dc1, #app #dc2, #web

Real-time Processing Brute-Force Search

for dashboards/alerts.

is in-memory anyway.

for “known” queries.

query time

#IndexFreeLogging

+

Humio Ingest Data Flow

Agent API/ Ingest Digest Storage

Use Kafka for the ‘hard parts’

Kafka 101

Kafka 101

Coordination ‘global data’

network boundary.

Coordination ‘global data’

Durability

Durability

Agent Ingest Digest Storage

Kafka

Durability

Agent API/ Ingest Kafka HTTP 200 response => Kafka ACK’ed the store

Durability

Kafka Digest WIP (buffer) Segment File records last consumed sequence number from disk QE Retention must be long enough to deal with crash

Durability

Kafka Digest WIP (buffer) Segment QE Ingest Kafka ingest latency p99 p50

Hash? ?

Consumers falling behind…

Data Sources

…100.000 … 100.000

Data Model

Repository Data Source

*

Event

*

High variability tags ‘auto grouping’

resource issues.

High variability tags ‘auto grouping’

value domain before the Kafka partition hash.

High variability tags ‘auto grouping’

#user=kresten => #user=13

data source #user=13, and re-filter based on values.

Multiplexing in Kafka

perform well and scales infinitely.

workloads, but for maximum utilisation you have to do go beyond defaults.

Using Kafka in an on-prem Product

Other Issues

Resetting Kafka/Zookeeper

What about KSQL?

the engine we built

Final thoughts

Index-Free Log Analytics   with Kafka

Filtering and time/metadata selection   reduces the problem space

Kafka Digest WIP  (buffer) Segment File records last consumed   sequence number from disk QE Retention must be long enough to deal with crash

Kafka Digest WIP  (buffer) Segment QE Ingest Kafka ingest latency p99 p50