High Availability with the openais project Prepared by: Steven - - PowerPoint PPT Presentation

high availability with the openais project
SMART_READER_LITE
LIVE PREVIEW

High Availability with the openais project Prepared by: Steven - - PowerPoint PPT Presentation

High Availability with the openais project Prepared by: Steven Dake October 2005 Agenda Service Availability Forum Reliability and Availability Application Interface Specification The openais project Service Availability Forum


slide-1
SLIDE 1

High Availability with the openais project

Prepared by: Steven Dake October 2005

slide-2
SLIDE 2

Agenda

Service Availability Forum Reliability and Availability Application Interface Specification The openais project

slide-3
SLIDE 3

Service Availability Forum – Mission

The Service Availability™ Solution helps meet end-user expectations for voice, data and multimedia services delivered with the dependability of traditional telecommunications. The Service Availability™ Forum is addressing this by fostering an ecosystem to enable the use of commercial off-the-shelf building blocks in the creation of high availability network infrastructure products, systems and services. The Service Availability™ Forum will accomplish this through developing and publishing high availability and management software interface specifications as well promoting and facilitating their adoption by the industry.

slide-4
SLIDE 4

Service Availability Forum Member Companies

  • Artesyn Technologies
  • MySQL AB
  • Augmentix Corporation
  • NEC
  • Clovis Solutions
  • Nokia
  • Continuous Computing
  • Nortel Networks
  • Ericsson
  • NTT
  • Force Computers
  • Oracle Corporation
  • Fujitsu Siemens Computers
  • OSA Technologies
  • GNP
  • Phoenix Technologies
  • GoAhead Software
  • Radisys
  • Hewlett-Packard
  • Siemens
  • IBM
  • Solid Information Technology
  • Intel
  • Sun Microsystems
  • Kontron
  • TietoEnator
  • MontaVista Software
  • UXComm
  • Motorola
  • Veritas Software
  • Wind River Systems
slide-5
SLIDE 5

Service Availability Forum – The Software Stack

slide-6
SLIDE 6

Reliability and Availability – Availability Equation

A= MTTF MTTF+MTTR

Where MTTF is the mean time to failure and MTTR is the mean time to repair.

slide-7
SLIDE 7

Reliability and Availability – Availability with fixed MTTF and variable MTTR

Availability with MTTF of 10000

0.2 0.4 0.6 0.8 1 1.2 5000 10000 15000 20000 Mean Time To Repair (MTTR) Availability (A)

slide-8
SLIDE 8

Application Interface Specification - Overview

High Availability Specification Application Failover

Checkpoint Service Availability Management Framework

Communication

Cluster Membership Service Event Service Message Service

Mutual Exclusion

Distributed Lock Service

slide-9
SLIDE 9

Availability Management Framework

  • Overview

Allows service to be registered or unregistered Instantiates services as active or standby Detects service faults Provides mechanisms to gather instantiation

state

Mechanism to enable and disable services Allows reporting of errors and canceling errors

slide-10
SLIDE 10

Availability Management Framework – Service Group

Service Unit A Component A Component B Service Unit B Component C Component D

slide-11
SLIDE 11

Checkpoint Service - Overview

Checkpoints are named Checkpoints have sections which store

data

Checkpoint sections can be read and

written

When an standy component is directed

active by AMF, standby reads checkpoint sections and recovers state

slide-12
SLIDE 12

Cluster Membership Service - Overview

Maintains view of current configuration Allows for asynchronous notification of

configuration changes via tracking API

Provides mechanism to read current

configuration

slide-13
SLIDE 13

Eventing Service - Overview

Provides named event channels for

publish and subscribe

Publish events to an event channel Callback executed when filtered event is

delivered

Events can be filtered by api

slide-14
SLIDE 14

Messaging Service - Overview

Named queue identifiers for sending and

receiving messages

Mechanism to send a request and wait

for the response

Load balancing messages

slide-15
SLIDE 15

Locking Service - Overview

Resources can be locked and unlocked Asynchronous notification of many

  • perational types

Locks can be reclaimed in case of failure

  • f locker

deadlock detection

slide-16
SLIDE 16

The openais project - Agenda

Setup and Configuration Project History Architecture Performance Project Statistics

slide-17
SLIDE 17
  • penais – setup and configuration
  • Create shared key:

Linux# ./keygen OpenAIS Authentication key generator. Gathering 1024 bits for key from /dev/random. Writing openais key to /etc/ais/authkey.

  • Save /etc/ais/network.conf:

Bindnetaddr: 192.168.1.0 Mcastaddr: 226.94.1.1 Mcastport: 6000 Read QUICKSTART file in source package for more details.

slide-18
SLIDE 18
  • penais – project history

Project started in January 2002 to support hotswap on

ATCA chassis

Morphed into SA Forum in April 2003 Virtual Synchrony merged January 2004 Released to open source under Revised BSD license

by MontaVista Software in June 2004 as the openais project hosted at Open Source Development Labs.

Event service merged September 2004 Open Source Development Labs and SA Forum

  • fficially announce via press release their support for

the openais project in November 2004.

3rd generation implementation Virtual Synchrony

protocol merged January 2005

slide-19
SLIDE 19
  • penais – checkpoint performance

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 1 2 3 4 5 6 7 8 9 10

Throughput

KB / MSG M B / SEC

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 250 500 750 1000 1250 1500 1750 2000 2250

Transactions Per Second

KB / MSG TRANS / SEC

slide-20
SLIDE 20

Openais – performance with many processors

1 2 3 4 5 6 7 8 9 10 11 12 1 2 3 4 5 6 7 8 9 10

Group Messaging Throughput

No Encryption Encryption

Processor Count MB / SEC

slide-21
SLIDE 21
  • penais – project statistics

Executive LOC: 29728 Library LOC: 9505 Include LOC: 4231 Total LOC: 43464 Changesets since openais inception: 880 estimated 4500 hours of development,

9000 hours of testing and QA

slide-22
SLIDE 22

Accomplishments 2005

3 rd Generation Protocol implemented (totem) Jan Apr June July Au 98% pass rate w ith SAFtest Integration w ith Redhat's CMAN Integration w ith Linux- HA Patrick Caulfield submitted Ipv6 support AMF B.01.01 initial patch Distributed locks API merged Initial Redundant Ring protocol GIG-e, jumbo frame support, 70MB/sec throughput achieved

slide-23
SLIDE 23

Sister Projects

The totem protocol in openais integrated

into Redhat's CMAN

The EVS library used by Linux-HA to

support membership and messaging

Integration of openais's AMF, CKPT with

Asterisk as POC

  • penais has generated 85 bug fixes

(patches) for saftest AIS B

slide-24
SLIDE 24

roadmap

Q1/05 Q4/05 Q4/06 Q4/05 Q4/05 Q1/06 Release 1 plus: DLCK service (exP) MSG service (exp) AMF B.01.01 Prototype redundant ring AMF manageme nt Targeted f s Release 1 plus: DLCK service MSG service AMF B.01.01 Productio n redundant ring AMF manageme nt Targeted Picacho AMF service CKPT service EVT service CLM service EVS B.01.01 B.01.01 85% code coverage SAF test 95% pass rateargete d for december

slide-25
SLIDE 25

Conclusion

Reduce MTTR to improve availability SA Forum AIS provides APIs to reduce

MTTR

  • pen source solution available of AIS

(http://developer.osdl.org/dev/openais

  • penais is suitable for deployment today
slide-26
SLIDE 26

Questions?