The General-Purpose Storage Revolution Jeff Bonwick Sun Fellow and - - PowerPoint PPT Presentation

the general purpose storage revolution
SMART_READER_LITE
LIVE PREVIEW

The General-Purpose Storage Revolution Jeff Bonwick Sun Fellow and - - PowerPoint PPT Presentation

The General-Purpose Storage Revolution Jeff Bonwick Sun Fellow and CTO, Storage Sun Microsystems 1 1 The General-Purpose Storage Revolution I know you're out there. I know you're out there. I can feel you now. I can feel you now.


slide-1
SLIDE 1

1

The General-Purpose Storage Revolution

Jeff Bonwick Sun Fellow and CTO, Storage Sun Microsystems

1

slide-2
SLIDE 2

The General-Purpose Storage Revolution

“ “I know you're out there. I know you're out there. I can feel you now. I can feel you now. I know that you're afraid... I know that you're afraid... you're afraid of us. you're afraid of us. You're afraid of change. You're afraid of change. I don't know the future. I don't know the future. I didn't come here to tell you I didn't come here to tell you how this is going to end. how this is going to end. I came here to tell you I came here to tell you how it's going to begin.” how it's going to begin.” – – Neo, “The Matrix” (1999) Neo, “The Matrix” (1999)

slide-3
SLIDE 3

The General-Purpose Storage Revolution

General-Purpose Compute Revolution

  • It happened in compute over the last two decades
  • Volume CPUs replaced

special-purpose hardware

  • Driven by economic

fundamentals – resistance is futile

slide-4
SLIDE 4

The General-Purpose Storage Revolution

General-Purpose Storage Revolution

  • It's happening in storage right now – on three fronts:
  • Hardware
  • Special-purpose hardware vs. volume microprocessors
  • Software
  • Special-purpose kernels vs. volume operating systems
  • Networking
  • Special-purpose interconnects vs. Ethernet
  • General-purpose storage is inevitable
  • Market is ripe for disruption
  • More than just cost reduction
  • New things become possible
slide-5
SLIDE 5

The General-Purpose Storage Revolution

Clock Rate and Core/Thread Trends

1982 1987 1992 1997 2002 2007 10 100 1000 10000

Clock rate hit a brick wall in 2002...

Year MHz

slide-6
SLIDE 6

The General-Purpose Storage Revolution

Clock Rate and Core/Thread Trends

  • ... but cores are proliferating:
  • AMD – 4 cores
  • Intel – 4 cores
  • Sun – 8 cores, 64 threads
  • Vertical scale in every socket
slide-7
SLIDE 7

The General-Purpose Storage Revolution

Implications

  • Compute becomes cheap – but requires scalable software
  • Multi-core & CMT: vertical scale in every socket – even at the low end
  • Solaris is really good at this
  • RAID, crypto, checksums move from controllers to CPUs
  • Eliminates special-purpose hardware
  • Reduces power consumption
  • Enables end-to-end data integrity, stronger security
  • Storage will host applications
  • Data mining, indexing – anything you'd rather do near the data
  • Applications can be deployed as virtual appliances
  • But how will we talk to them?
slide-8
SLIDE 8

The General-Purpose Storage Revolution

Network Performance Trends

1992 1997 2002 2007 0.01 0.1 1 10 100

The writing is on the wall...

Ethernet Fibre Channel InfiniBand

Year Gb/s

slide-9
SLIDE 9

The General-Purpose Storage Revolution

Implications

  • The storage network is the computer
  • Intelligent storage systems join the real network
  • A new breed of storage systems becomes possible
  • From moving bytes to answering questions (Google, Greenplum)
  • Leverage inside-the-box data speeds
slide-10
SLIDE 10

The General-Purpose Storage Revolution

Our Strategy

  • Cheap, fast hardware
  • Simple, powerful, resilient software
  • Horizontal scale on fast, commodity networks
  • Virtual appliances providing network data services

Our Special Sauce

  • System-level integration, everywhere
  • Novel use of general-purpose components
  • Building storage developer communities
slide-11
SLIDE 11

The General-Purpose Storage Revolution

Thumper: Cheap, Fast, Integrated H/W

  • Integrates compute and storage
  • 4 dozen drives + 4 CPUs + 4 GigE ports in 4U; 4 racks per PB
  • 3 GB/s internal bandwidth – platter speed on all 48 drives
  • Lots of cheap, powerful compute near the data
  • No H/W RAID – not needed with ZFS
  • No SAN required
  • The first in a family
  • Thumper JBOD (SAS)
  • More cores, many more disks
  • Very low structural overhead
  • Powered by ZFS
slide-12
SLIDE 12

The General-Purpose Storage Revolution

ZFS: Simple, Powerful, Resilient S/W

  • ZFS does for disks what VM did for memory
  • You don't manage DIMMs and ECC – why should you manage arrays?
  • Integrates filesystem, volume management, RAID
  • Pooled storage – no more volumes, partitions, provisioning
  • Creates reliable storage out of unreliable components
  • Protects the entire data path – disks, HBAs, switches, driver bugs, etc.
  • Self-healing – detects and corrects silent data corruption
  • File, block, and transactional object interfaces
  • Full or thin provisioning
  • Unlimited, constant-time snapshots and clones
  • Built-in compression
  • De-duplication, CDP, remote replication, and encryption in the works
slide-13
SLIDE 13

The General-Purpose Storage Revolution

pNFS: Standardized Horizontal Scaling

  • Parallel NFS
  • Part of IETF NFSv4.1 specification
  • Namespace and data placement scale horizontally
  • Works with existing, heterogeneous NFS clients
  • Commodity network hardware
  • No special interconnect – runs on Ethernet
  • Also runs on FC and IB, if you're so inclined
  • Potentially much more than NFS
  • Overall architecture isn't NFS-specific
  • New data services can use the same foundation
  • pCIFS
slide-14
SLIDE 14

The General-Purpose Storage Revolution

ADM: Integrated HSM

  • Automated Data Migration (next-generation SAM)
  • Simple to manage
  • Scalable to vast datasets
  • Integrated ZFS support
  • Policy-based data migration and archiving
  • Plugs into ZFS on top, tape libraries on the bottom
  • Automatically migrates data from disk to tape
  • Replaces traditional backup with continuous archiving
  • With de-duplication in ZFS, replaces VTL as well
  • 100% open data formats
slide-15
SLIDE 15

The General-Purpose Storage Revolution

FishWorks: Integrated Appliances

  • Build turn-key solutions from general-purpose components
  • Special-purpose capabilities
  • Volume economics
  • Deep system-level integration
  • Hardware fault domains fully modeled by FMA
  • High-level services like remote replication built on ZFS
  • Business analytics and data forensics built on DTrace
  • Link aggregation, multi-pathing, etc built on Solaris
  • Simple, cheap heartbeat card for active/active HA
  • Beautiful browser-based management built on AJAX and XML-RPC
  • Common look and feel across all Sun appliances
slide-16
SLIDE 16

The General-Purpose Storage Revolution

Solaris: The Storage OS of Choice

  • More scalable than Linux, BSD or Windows
  • Lightweight virtualization (zones) ideal for virtual appliances
  • High-quality NFS and (very soon) in-kernel CIFS
  • Industry-leading tools (DTrace, mdb)
  • ZFS – the most active OpenSolaris community
  • ZFS ported to Linux (FUSE), FreeBSD, MacOS X (Leopard)
  • New algorithms for compression, block allocation, etc
  • Amazing effect on pace – communication drives development & adoption
  • Sun is reclaiming thought leadership
  • Proprietary, closed-source vendors have no way to respond
slide-17
SLIDE 17

The General-Purpose Storage Revolution Network

WebDAV pNFS metadata HSM Block Object File Data Management Unit Storage Pool Allocator CIFS pNFS iSCSI Local (DB, etc) pNFS server iSCSI target FC target Commodity clients on commodity network Horizontally-scaled namespace and placement Industry-standard block and file protocols Unified (multi-protocol) pooled storage system Transactional semantics – always consistent; unlimited, instantaneous snapshots and clones; fast incrementals for real-time remote replication Policy-driven automated data migration Compression, de-duplication, encryption, error detection and correction, replication (mirroring, 1- and 2-parity RAID-Z, ditto blocks) Tens to thousands of disks Up to ¼ million tapes

ZFS

Solaris Network Services Heterogeneous Network Clients

Appliance dashboard: configuration, policy, health, telemetry, business analytics, data forensics

The Storage Network is the Computer