Apache NiFi
Better Analytics Demand Better Dataflow
Presented by: Joe Witt Apache NiFi PPMC Member
Apache NiFi Better Analytics Demand Better Dataflow Presented by: - - PowerPoint PPT Presentation
Apache NiFi Better Analytics Demand Better Dataflow Presented by: Joe Witt Apache NiFi PPMC Member Apache NiFis job: Enterprise Dataflow Management Automate the flow of data from any source to systems which extract meaning and insight
Better Analytics Demand Better Dataflow
Presented by: Joe Witt Apache NiFi PPMC Member
Apache NiFi’s job: Enterprise Dataflow Management
1
Automate the flow of data from any source …to systems which extract meaning and insight …and to those that store and make it available for users
Analytics need data with the following characteristics:
2
Quality Correct, complete, reliable Relevance Right size, rate, format, schema, content, lightweight analysis Timeliness All data has a half-life. Not all data is created equal. Secure Confidential, unaltered Compliant Authorized, traceable Recoverable Errors happen. Iterate until it’s right.
Enterprise Dataflow: “What could possibly go wrong?”
3
Dataflow – Route, Transform, Mediate Acquire Analyze Store
Dataflow across the enterprise
4
Edge Sites Regional Sites Corporate Datacenters Partners
Challenges at the edge
5
Edge Sites
Challenges at the core
6
Corporate Datacenters
Data may need transformation
Systems may be
Scaling and reliability
Governance
The basic building blocks Real-time Command and Control The Power of Provenance
7
Apache NiFi Foundational Concepts
2 3 1
Attributes Map [[Key | Value]]
Flow File
8
Flow File Processor
9
Connections
10
Flow Controller
11
NiFi Architecture
12
NiFi Clustering Model
13
Tighten the feedback loop
Continuous Improvement
Intuitive user experience
14
Real-time command and control
2
Latency Optimization
Compliance
Understanding
15
The Power of Provenance aka “Dude, where’s my data?”
3
Status and direction for NiFi
16
Efficient use of each node
Simple / Effective scaling model Runtime Command and Control Data Provenance Distributed durability of data
High Availability Cluster Manager Live / Rolling Upgrades Provenance Query Language / Reporting A complete user experience enabled by provenance
Existing Strengths Roadmap Highlights
Apache NiFi (incubating) site http://nifi.incubator.apache.org Subscribe to and collaborate at dev@nifi.incubator.apache.org Submit Ideas or Issues https://issues.apache.org/jira/browse/NIFI @ApacheNifi
17
Learn more about Apache NiFi