Streamline Hadoop DevOps with Apache Ambari Alejandro - PowerPoint PPT Presentation

Streamline ¡Hadoop ¡DevOps ¡with ¡ Apache ¡Ambari ¡ Alejandro Fernandez May ¡18, ¡2017 ¡

Speaker Alejandro Fernandez � Staff Software Engineer @ Hortonworks � Apache Ambari PMC � alejandro@apache.org

WHY ARE WE HERE? “WORKING FROM MIAMI”

What is Apache Ambari? Apache Ambari is the open-source platform to deploy, manage and monitor Hadoop clusters

Single Pane of Glass for Hadoop

20.5k commits over 4.5 years by 80 committers/contributors AND GROWING # ¡of ¡Jiras ¡ 1,764 ¡ 2,335 ¡ 1,764 ¡ 1,688 ¡ 1,499 ¡ April ¡’15 ¡ Jul-‑Sep ¡’15 ¡ Dec ¡’15-‑Feb ¡’16 ¡ Aug-‑Nov ¡’16 ¡ Mar’17 ¡

Exciting Enterprise Features in Ambari 2.5 Security Service Features AMBARI-18650: Password Credential Store AMBARI-2330: Service Auto-Restart AMBARI-19275: Download All Client Configs AMBARI-18365: API Authentication � Using SPNEGO AMBARI-7748: Manage JournalNode HA Core AMBARI-18731: Scale Testing on 2500 Agents AMBARI-18990: Self-Heal DB Inconsistencies Ambari Metrics System Alerts & Log Search AMBARI-17859: New Grafana dashboards AMBARI-19257: Built-in SNMP Alert � AMBARI-15901: AMS High Availability � AMBARI-16880: Simplified Log Rotation � AMBARI-19320: HDFS TopN User and � Configs Operation Visualization

Simply Operations - Lifecycle Deploy Scale, Secure/ Extend, LDAP Analyze Ease-of-Use Deploy Smart Monitor Configs Upgrade

Deploy On Premise Mix-and-Match

Deploy On The Cloud Certified environments Sysprepped VMs Hundreds of similar clusters � Ephemeral workloads

Deploy with Blueprints • Systematic way of defining a cluster Topology Configs Hosts Cluster • Export existing cluster into blueprint � /api/v1/clusters/:clusterName?format=blueprint

Create ¡a ¡cluster ¡with ¡Blueprints ¡ 2. POST /api/v1/clusters/my-cluster � 1. POST /api/v1/blueprints/my-blueprint � { � { � "configurations" : [ � "blueprint" : "my-blueprint", � { � "host_groups" :[ � "hdfs-site" : { � { � � "dfs.datanode.data.dir" : "/hadoop/1,   "name" : "master-host", � /hadoop/2,/hadoop/3" � "hosts" : [ � } � { � } � "fqdn" : "master001.ambari.apache.org" � ], � � } � "host_groups" : [ � ] � { � }, � "name" : "master-host", � { � "components" : [ � "name" : "worker-host", � { "name" : "NAMENODE” }, � "hosts" : [ � { "name" : "RESOURCEMANAGER” }, � { � … � "fqdn" : "worker001.ambari.apache.org" � ], � � }, � "cardinality" : "1" � { � }, � "fqdn" : "worker002.ambari.apache.org" � { � � }, � "name" : "worker-host", � … � "components" : [ � { � { "name" : "DATANODE" }, � "fqdn" : "worker099.ambari.apache.org" � { "name" : "NODEMANAGER” }, � � } � … � ] � ], � } � "cardinality" : "1+" � ] � }, � } � ], � � "Blueprints" : { � � "stack_name" : "HDP", � � "stack_version" : "2.5" � � } � � } � �

Blueprints for Large Scale • Kerberos , secure out-of-the-box   • High Availability is setup initially for NameNode, YARN, Hive, Oozie, etc   • Host Discovery allows Ambari to automatically install services for a Host when it comes online • Stack Advisor for config recommendations

Blueprint Host Discovery POST /api/v1/clusters/MyCluster/hosts � � [ � { � "blueprint" : "single-node-hdfs-test2", � "host_groups" :[ � { � "host_group" : "worker", � "host_count" : 3, � "host_predicate" : "Hosts/cpu_count>1” � }, { � "host_group" : "super-worker", � "host_count" : 5, � "host_predicate" : "Hosts/cpu_count>2&   Hosts/total_mem>3000000" � } � ] � } � ] �

Service Layout Common Services Stack Override

Streamline Hadoop DevOps with Apache Ambari Alejandro - PowerPoint PPT Presentation

Streamline Hadoop DevOps with Apache Ambari Alejandro Fernandez May 18, 2017 Speaker Alejandro Fernandez Staff Software Engineer @ Hortonworks Apache Ambari PMC alejandro@apache.org WHY ARE WE

Streamline Hadoop DevOps with Apache Ambari Alejandro Fernandez Speaker

BY SRIJHA REDDY GANGIDI What is Hadoop ? Evolution of Hadoop Created by dough cutting, a part

SAS Data Loader for Hadoop Agenda Intro What is Hadoop? What do I get from Hadoop?

Extension: Combiner Functions import org.apache.hadoop.io.IntWritable; import

Leveraging Ambari to Build Comprehensive Management UIs For Your Hadoop Applications by

Apache Hadoop 3.x State of The Union and Upgrade Guidance Wei-Chiu Chuang Wangda Tan

COMP9313: Big Data Management Hadoop and HDFS Hadoop Apache Hadoop is an open-source

Apache Calcite for Enabling SQL Access to NoSQL Data Systems such as Apache Geode Christian

Sergey Beryozkin, T alend Sergey Beryozkin, T alend Apache CXF Apache CXF Practical JOSE

Datenanalyse mit Hadoop Quelle: Apache Software Foundation Datenanalyse mit Hadoop Gideon Zenz

Apache Felix Web Console Carsten Ziegeler | cziegeler@apache.org ApacheCon NA 2014 About

The Apache Way The Apache Way Nick Burch Nick Burch CTO, Quanticate CTO, Quanticate The

Distributed Computation of with Apache Hadoop Tsz-Wo Sze Yahoo! Cloud Computing Apache

Hadoop on HPC: Integrating Hadoop and Pilot-based Dynamic Resource Management Andre Luckow,

HOW OPEN SOURCE IS DRIVING DEVOPS INNOVATION Gordon Haff @ghaff William Henry @ipbabble Cloud

DevOps & AWS Chris Econn Head of DevOps CorpInfo | AWS Premier Partner DevOps Bill of Rights

Object-Oriented Design Roman Kontchakov / Carsten Fuhs Birkbeck, University of London Outline

OpenMP 4.0 and Beyond! Aidan Chalk, Hartree Centre, STFC What is OpenMP? OpenMP is an API

Collective Traffic Prediction with Partially Observed Traffic History using Location-Based Social

SENSATA FIRST QUARTER 2020 EARNINGS PRESENTATION APRIL 29, 2020 Forward-Looking Statements and

auto-parallelism April 9, 2019 1 Automatic Parallelism In [1]: import d2l import mxnet as mx

Time-Inconsistent Planning: A Computational Problem in Behavioral Economics Jon Kleinberg Sigal

Welcome Project Team Parsons Brinckerhoff Darryl Phillips, P.E., PTOE

Making the Most with Stormwater From each home and down to the San Pedro River Catlow

Sambuz

Useful Links

Newsletter

Mail Us

Streamline Hadoop DevOps with Apache Ambari Alejandro - PowerPoint PPT Presentation

Streamline Hadoop DevOps with Apache Ambari Alejandro Fernandez May 18, 2017 Speaker Alejandro Fernandez Staff Software Engineer @ Hortonworks Apache Ambari PMC alejandro@apache.org WHY ARE WE

Streamline Hadoop DevOps with Apache Ambari Alejandro Fernandez Speaker

BY SRIJHA REDDY GANGIDI What is Hadoop ? Evolution of Hadoop Created by dough cutting, a part

SAS Data Loader for Hadoop Agenda Intro What is Hadoop? What do I get from Hadoop?

Extension: Combiner Functions import org.apache.hadoop.io.IntWritable; import

Leveraging Ambari to Build Comprehensive Management UIs For Your Hadoop Applications by

Apache Hadoop 3.x State of The Union and Upgrade Guidance Wei-Chiu Chuang Wangda Tan

COMP9313: Big Data Management Hadoop and HDFS Hadoop Apache Hadoop is an open-source

Apache Calcite for Enabling SQL Access to NoSQL Data Systems such as Apache Geode Christian

Sergey Beryozkin, T alend Sergey Beryozkin, T alend Apache CXF Apache CXF Practical JOSE

Datenanalyse mit Hadoop Quelle: Apache Software Foundation Datenanalyse mit Hadoop Gideon Zenz

Apache Felix Web Console Carsten Ziegeler | cziegeler@apache.org ApacheCon NA 2014 About

The Apache Way The Apache Way Nick Burch Nick Burch CTO, Quanticate CTO, Quanticate The

Distributed Computation of with Apache Hadoop Tsz-Wo Sze Yahoo! Cloud Computing Apache

Hadoop on HPC: Integrating Hadoop and Pilot-based Dynamic Resource Management Andre Luckow,

HOW OPEN SOURCE IS DRIVING DEVOPS INNOVATION Gordon Haff @ghaff William Henry @ipbabble Cloud

DevOps &amp; AWS Chris Econn Head of DevOps CorpInfo | AWS Premier Partner DevOps Bill of Rights

Object-Oriented Design Roman Kontchakov / Carsten Fuhs Birkbeck, University of London Outline

OpenMP 4.0 and Beyond! Aidan Chalk, Hartree Centre, STFC What is OpenMP? OpenMP is an API

Collective Traffic Prediction with Partially Observed Traffic History using Location-Based Social

SENSATA FIRST QUARTER 2020 EARNINGS PRESENTATION APRIL 29, 2020 Forward-Looking Statements and

auto-parallelism April 9, 2019 1 Automatic Parallelism In [1]: import d2l import mxnet as mx

Time-Inconsistent Planning: A Computational Problem in Behavioral Economics Jon Kleinberg Sigal

Welcome Project Team Parsons Brinckerhoff Darryl Phillips, P.E., PTOE

Making the Most with Stormwater From each home and down to the San Pedro River Catlow

Sambuz

Useful Links

Newsletter

Mail Us

DevOps & AWS Chris Econn Head of DevOps CorpInfo | AWS Premier Partner DevOps Bill of Rights