GridGain vs Hadoop Why Elephants Cant Fly GridGain System 1065 - - PowerPoint PPT Presentation

gridgain vs hadoop why elephants can t fly
SMART_READER_LITE
LIVE PREVIEW

GridGain vs Hadoop Why Elephants Cant Fly GridGain System 1065 - - PowerPoint PPT Presentation

GridGain vs Hadoop Why Elephants Cant Fly GridGain System 1065 East Hillsdale Boulevard Suite 230 Foster City, CA 94404 www.gridgain.com GridGain In A Glance GridGain is Java based open source middleware for transactional real time big


slide-1
SLIDE 1

GridGain vs Hadoop Why Elephants Can’t Fly

GridGain System

1065 East Hillsdale Boulevard Suite 230 Foster City, CA 94404 www.gridgain.com

slide-2
SLIDE 2

GridGain Real Time Big Data Slide

GridGain In A Glance

2

GridGain is Java based open source middleware for transactional real time big data processing that scales up from one server to thousands of machines. Unlike complex, decade-old Hadoop MapReduce systems which use stale data for batch offline analytics, our platform allows companies to harness live data for smarter, faster real time processing.

slide-3
SLIDE 3

GridGain Real Time Big Data Slide

GridGain History

> GridGain ¡Systems ¡founded ¡in ¡2005 > VC ¡funded > Headquarter ¡in ¡Foster ¡City, ¡California, ¡USA > 12 ¡product ¡releases:

> GridGain 1.x, Jul 2007 > GridGain 2.x, Feb 2008 > GridGain 3.0, Aug 2010

> Current ¡release ¡is ¡GridGain ¡3.6

3

slide-4
SLIDE 4

GridGain Real Time Big Data Slide

GridGain Facts

Over ¡8,000,000 ¡starts ¡worldwide 1000 ¡unique ¡IP/month 400 ¡acIve ¡projects/month 4000 ¡forum ¡views/month GridGain ¡starts ¡every ¡10 ¡seconds ¡around ¡the ¡globe

slide-5
SLIDE 5

GridGain Real Time Big Data Slide

GridGain Users GridGain Partners

slide-6
SLIDE 6

GridGain Real Time Big Data Slide

GridGain Technology

> Fully integrated cloud middleware: Compute Grid + Data Grid > Real Time Transactional Big Data > Zero Deployment > Two editions:

> Community Edition: License: GPLv3 + Basic Features > Enterprise Edition: Commercial License + Enterprise Features

> Language support:

> Java 1.6 > Scala 2.9.1

6

slide-7
SLIDE 7

GridGain Real Time Big Data Slide

GridGain - Compute Grid

> Direct support for MapReduce > Auto discovery > Checkpoints for long running tasks > Load Balancing > Affinity co-location with data grids > Automatic fault tolerance

7

slide-8
SLIDE 8

GridGain Real Time Big Data Slide

GridGain - Data Grid

8

> Replication & Partitioning > Pessimistic & Optimistic Tx > Read-Through and Write-Through > Pluggable data overflow storage > Distributed Queries > Distributed Queues and Latches > Distributes Java Atomics

slide-9
SLIDE 9

GridGain Real Time Big Data Slide

Hadoop Processing

9

> Very large data sets, BUT... > Not Real Time > Mandatory data snapshots > HDFS instead of live databases > Analytics based on offline data

slide-10
SLIDE 10

GridGain Real Time Big Data Slide

GridGain Processing

10

> Large data sets > Near Real Time Processing > Online databases > In-memory data caching > Co-location of analytics and data > Business analytics on Live Data

slide-11
SLIDE 11

Cloud ¡CompuIng ¡with ¡Scala ¡and ¡GridGain Slide ¡

Live Coding - Real Time Word Count

>

Real time uploading of books into Cache

>

Real time updates of word counts

>

Real time SQL queries for popular words

>

Real time print-outs of most popular words

>

... using Scala & GridGain

11

slide-12
SLIDE 12

GridGain Real Time Big Data Slide 12

Thank You!