SAS Data Loader for Hadoop Agenda Intro What is Hadoop? What do I - - PowerPoint PPT Presentation

sas data loader for hadoop agenda
SMART_READER_LITE
LIVE PREVIEW

SAS Data Loader for Hadoop Agenda Intro What is Hadoop? What do I - - PowerPoint PPT Presentation

SAS Data Loader for Hadoop Agenda Intro What is Hadoop? What do I get from Hadoop? Hadoop components Why SAS Data Loader for Hadoop? SAS Data Loader for Hadoop overview Demo Introduction Doug Cutting, creator of Hadoop


slide-1
SLIDE 1

SAS Data Loader for Hadoop

slide-2
SLIDE 2

Agenda

  • Intro
  • What is Hadoop?
  • What do I get from Hadoop?
  • Hadoop components
  • Why SAS Data Loader for Hadoop?
  • SAS Data Loader for Hadoop overview
  • Demo
slide-3
SLIDE 3

Introduction

Doug Cutting, creator of Hadoop Hadoop, a toy elephant owned by his son

slide-4
SLIDE 4

What is Hadoop?

Hadoop is an open-source software framework for storing and processing big data in a distributed fashion on a large cluster of commodity hardware. Essentially it accomplishes two tasks: massive data storage and faster processing.

  • pen-source

distributed massive data storage faster processing

slide-5
SLIDE 5

What do I get from Hadoop?

  • Speed of handling huge amounts of data, of any kind.
  • Cost of open-source framework and use of commodity hardware.
  • Power of processing data in a distributed computing model.
  • Scalability of the platform.
  • Flexibility of what data you choose to store.
  • Reliability with data stored and replicated across the distributed hardware
slide-6
SLIDE 6

Hadoop Components

Core Components:

  • HDFS
  • MapReduce
  • YARN

Additional Components:

  • Pig
  • Hive
  • Hbase
  • Zookeeper
  • Ambari
  • Flume
  • Sqoop
  • Oozie
slide-7
SLIDE 7

Why SAS Data Loader for Hadoop

Performing a simple task in Hadoop can require writing hundreds of lines

  • f code, and the number of resources with the required skills are limited.

SAS Data Loader provides a self service interface that turns your Hadoop environment into a productive environment where the barriers are removed and the data is accessible and usable. This mean you can:

  • Manage data inside Hadoop
  • Reduce the complexity of Hadoop
  • Accelerate user adoption
slide-8
SLIDE 8

SAS Data Loader for Hadoop Architecture

slide-9
SLIDE 9

SAS Data Loader for Hadoop Demo

slide-10
SLIDE 10

Hadoop Documentation

SAS Data Loader 2.2 for Hadoop: Installation and Configuration Guide http://support.sas.com/documentation/onlinedoc/dmdd/ SAS Data Loader 2.2 for Hadoop: Users Guide Guide http://support.sas.com/documentation/onlinedoc/dmdd/ SAS 9.4 Support for Hadoop http://support.sas.com/resources/thirdpartysupport/v94/hadoop/ Hadoop with Kerberos: Architecture Considerations http://support.sas.com/resources/papers/Hadoop_Architecture.pdf Hadoop with Kerberos: Deployment Considerations http://support.sas.com/resources/papers/Hadoop_Deployment.pdf SAS 9.4 Hadoop Configuration Guide for Base SAS and SAS/Access http://support.sas.com/resources/thirdpartysupport/v94/hadoop/hadoopbacg.pdf

slide-11
SLIDE 11

www.SAS.com Thank You, any questions?