Achieving Performance Isolation with Lightweight Co-Kernels Jiannan - PowerPoint PPT Presentation

Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang , Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015

HPC Architecture Traditional In Situ Data Processing Analytic / Simulation Visualization Operating System and Runtimes Supercomputer Processing Cluster (OS/R) Compute Node  Move computation to data  Improved data locality  Reduced power consumption Shared Storage Cluster Problem: massive data movement  Reduced network traffic over interconnects 2

Challenge: Predictable High Performance  Tightly coupled HPC workloads are sensitive to OS noise and overhead [Petrini SC’03, Ferreira SC’08, Hoefler SC’10]  Specialized kernels for predictable performance  Tailored from Linux: CNL for Cray supercomputers  Lightweight kernels (LWK) developed from scratch: IBM CNK, Kitten  Data processing workloads favor Linux environments  Cross workload interference  Shared hardware (CPU time, cache, memory bandwidth)  Shared system software How to provide both Linux and specialized kernels on the same node, while ensuring performance isolation?? 3

Approach: Lightweight Co-Kernels Analytic / Analytic / Simulation Visualization Visualization Simulation Linux Linux LWK Hardware Hardware  Hardware resources on one node are dynamically composed into multiple partitions or enclaves  Independent software stacks are deployed on each enclave  Optimized for certain applications and hardware  Performance isolation at both the software and hardware level 4

Agenda  Introduction  The Pisces Lightweight Co-Kernel Architecture  Implementation  Evaluation  Related Work  Conclusion 5

Building Blocks: Kitten and Palacios  the Kitten Lightweight Kernel (LWK)  Goal: provide predictable performance for massively parallel HPC applications  Simple resource management policies  Limited kernel I/O support + direct user-level network access  the Palacios Lightweight Virtual Machine Monitor (VMM)  Goal: predictable performance  Lightweight resource management policies  Established history of providing virtualized environments for HPC [Lange et al. VEE ’11, Kocoloski and Lange ROSS ‘12] Kitten: https://software.sandia.gov/trac/kitten Palacios: http://www.prognosticlab.org/palacios http://www.v3vee.org/

The Pisces Lightweight Co-Kernel Architecture Applications Isolated Virtual + Isolated Machine Virtual Application Palacios VMM Machines Kitten Co-kernel Kitten Co-kernel Linux (1) (2) Pisces Pisces Hardware Pisces Design Goals  Performance isolation at both software and hardware level  Dynamic creation of resizable enclaves  Isolated virtual environments 7 http://www.prognosticlab.org/pisces /

Design Decisions  Elimination of cross OS dependencies  Each enclave must implement its own complete set of supported system calls  No system call forwarding is allowed  Internalized management of I/O  Each enclave must provide its own I/O device drivers and manage its hardware resources directly  Userspace cross enclave communication  Cross enclave communication is not a kernel provided feature  Explicitly setup cross enclave shared memory at runtime (XEMEM)  Using virtualization to provide missing OS features 8

Cross Kernel Communication Isolated' Shared*Mem* Communica6on*Channels* Processes'' Linux' +' Compa)ble' Shared*Mem* Workloads' Control' Virtual' Control' User% *Ctrl*Channel* Process' Machines' Process' Context% Kernel% Cross%Kernel* Ki@en' Messages* Context% Linux' CoAKernel' Hardware'Par))on' Hardware'Par))on' XEMEM: Efficient Shared Memory for Composed Applications on Multi-OS/R Exascale Systems [Kocoloski and Lange, HPDC ‘15] 9

Challenges & Approaches  How to boot a co-kernel?  Hot-remove resources from Linux, and load co-kernel  Reuse Linux boot code with modified target kernel address  Restrict the Kitten co-kernel to access assigned resources only  How to share hardware resources among kernels?  Hot-remove from Linux + direct assignment and adjustment (e.g. CPU cores, memory blocks, PCI devices)  Managed by Linux and Pisces (e.g. IOMMU)  How to communicate with a co-kernel?  Kernel level: IPI + shared memory, primarily for Pisces commands  Application level: XEMEM [Kocoloski HPDC’15]  How to route device interrupts? 11

I/O Interrupt Routing Legacy Interrupt Forwarding Direct Device Assignment (w/ MSI) Management Management Kernel Co-Kernel Co-Kernel Kernel IPI IRQ IRQ IRQ IRQ Forwarder Handler Forwarder Handler INTx MSI MSI IO-APIC MSI/MSI-X MSI/MSI-X Device Device Legacy Device • Legacy interrupt vectors are potentially shared among multiple devices • Pisces provides IRQ forwarding service • IRQ forwarding is only used during initialization for PCI devices • Modern PCI devices support dedicated interrupt vectors (MSI/MSI-X) • Directly route to the corresponding enclave 12

Implementation  Pisces  Linux kernel module supports unmodified Linux kernels (2.6.3x – 3.x.y)  Co-kernel initialization and management  Kitten (~9000 LOC changes)  Manage assigned hardware resources  Dynamic resource assignment  Kernel level communication channel  Palacios (~5000 LOC changes)  Dynamic resource assignment  Command forwarding channel Pisces: http://www.prognosticlab.org/pisces / Kitten: https://software.sandia.gov/trac/kitten 13 Palacios: http://www.prognosticlab.org/palacios http://www.v3vee.org/

Evaluation  8 node Dell R450 cluster  Two six-core Intel “Ivy-Bridge” Xeon processors  24GB RAM split across two NUMA domains  QDR Infiniband  CentOS 7, Linux kernel 3.16  For performance isolation experiments, the hardware is partitioned by NUMA domains.  i.e. Linux on one NUMA domain, co-kernel on the other 15

Fast Pisces Management Operations Operations Latency (ms) Booting a co-kernel 265.98 Adding a single CPU core 33.74 Adding a 128MB memory block 82.66 Adding an Ethernet NIC 118.98 16

Eliminating Cross Kernel Dependencies solitary workloads (us) w/ other workloads (us) Linux 3.05 3.48 co-kernel fwd 6.12 14.00 co-kernel 0.39 0.36 Execution Time of getpid()  Co-kernel has the best average performance  Co-kernel has the most consistent performance  System call forwarding has longer latency and suffers from cross stack performance interference 17

Noise Analysis (a) without competing workloads (b) with competing workloads 20 15 Latency (us) Linux 10 5 0 0 1 2 3 4 5 0 1 2 3 4 5 Time (seconds) Time (seconds) (a) without competing workloads (b) with competing workloads 20 15 Latency (us) Kitten co-kernel 10 5 0 0 1 2 3 4 5 0 1 2 3 4 5 Time (seconds) Time (seconds) Co-Kernel : less noise + better isolation 18 * Each point represents the latency of an OS interruption

Single Node Performance 21250 without bg without bg 85 with bg with bg Completion Time (Seconds) 21000 84 Throughput (GUPS) 20750 83 20500 82 20250 1 250 0 0 CentOS Kitten/KVM co-Kernel CentOS Kitten/KVM co-Kernel CoMD Performance Stream Performance Co-Kernel: consist performance + performance isolation 19

8 Node Performance 20 co-VMM native 18 KVM Throughput (GFLOP/s) co-VMM bg 16 native bg KVM bg 14 12 10 8 6 4 2 1 2 3 4 5 6 7 8 Number of Nodes w/o bg: co-VMM achieves native Linux performance w/ bg: co-VMM outperforms native Linux 20

Co-VMM for HPC in the Cloud Co-VMM Native KVM 100 80 CDF (%) 60 40 20 0 44 45 46 47 48 49 50 51 Runtime (seconds) CDF of HPCCG Performance (running with Hadoop, 8 nodes) co-VMM: consistent performance + performance isolation 21

Related Work  Exascale operating systems and runtimes (OS/Rs)  Hobbes (SNL, LBNL, LANL, ORNL, U. Pitt, various universities)  Argo (ANL, LLNL, PNNL, various universities)  FusedOS (Intel / IBM)  mOS (Intel)  McKernel (RIKEN AICS, University of Tokyo) Our uniqueness: performance isolation, dynamic resource composition, lightweight virtualization 22

Conclusion  Design and implementation of the Pisces co-kernel architecture  Pisces framework http://www.prognosticlab.org/pisces /  Kitten co-kernel https://software.sandia.gov/trac/kitten  Palacios VMM for Kitten co-kernel http://www.prognosticlab.org/palacios  Demonstrated that the co-kernel architecture provides  Optimized execution environments for in situ processing  Performance isolation 23

Thank You Jiannan Ouyang  Ph.D. Candidate @ University of Pittsburgh  ouyang@cs.pitt.edu  http://people.cs.pitt.edu/~ouyang/  The Prognostic Lab @ U. Pittsburgh  http://www.prognosticlab.org

Achieving Performance Isolation with Lightweight Co-Kernels Jiannan - PowerPoint PPT Presentation

Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang , Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015 HPC Architecture Traditional In Situ

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

Serializable Snapshot Isolation Making ISOLATION LEVEL SERIALIZABLE Provide Serializable

Introduction to pixel track isolation The purpose of track isolation algorithm is an additional

ADAPTED SPAULDING PYRAMID Making Isolation: How does it work? Patient Isolation- Creating

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Language Based isolation of Untrusted JavaScript Ankur Taly Dept. of Computer Science, Stanford

Harmonizing Performance and Isolation in Microkernels with Efficient Intra-kernel Isolation and

The National Adoption Service Suzanne Griffiths, Director of Achieving More Together Achieving

Loneliness and Social Isolation Select Committee Topics Defining social isolation and

Efficient Software-Based Fault Isolation Robert Wahbe Steven Lucco Thomas E. Anderson Susan L.

2 Fault Isolation & Restoration Manual Fault Isolation & Restoration The

2019-20 DNA Biology New Products RNA Biology PROTEIN Biology MOLECULAR Biology Plant DNA

Virtual Machine Introspection Isolation Interpretation Interposition Isolation

FEAR OF SOCIAL ISOLATION Aman Pandya 11481 A modified version of social isolation experiment

A Type System for Checking Applet Isolation in Java Card Peter Mller ETH Zrich Joint work

Appropriate validation of (predictive) biomarkers Elise C. Kohn, MD Gynecologic Cancer

Assessment of The Prognostic Value Of The CA-125 Modeled Kinetic Parameter KELIM in GOG-0262 and

Prognostication 4 1 4/9/19 Prognostication Mrs. Alvarez is a 79 yo woman with COPD, Why

Conflict of Interest HRT: New Evidence on a Old Topic Conflict of Interest None Off

Reproducibility and Cross-study Replicability of Prognostic Signatures from High Throughput

Cancer staging in 2022 Brian OSullivan, MD Chair, Prognostic Factors Task Force, UICC TNM

for Composition of Applications Brian Kocoloski Hasan Abbasi David Bernholdt Jack Lange Terry

Diurnal Cycle of Shallow Cumulus over Land Geert Lenderink, A. Pier Siebesma (siebesma@knmi.nl)

Achieving Performance Isolation with Lightweight Co-Kernels Jiannan - PowerPoint PPT Presentation

Achieving Performance Isolation with Lightweight Co-Kernels Jiannan Ouyang , Brian Kocoloski, John Lange The Prognostic Lab @ University of Pittsburgh Kevin Pedretti Sandia National Laboratories HPDC 2015 HPC Architecture Traditional In Situ

GCC Highlighted Products GSure Gel Extraction kit GSure Soil DNA Isolation kit GSure Sputum DNA

Serializable Snapshot Isolation Making ISOLATION LEVEL SERIALIZABLE Provide Serializable

Introduction to pixel track isolation The purpose of track isolation algorithm is an additional

ADAPTED SPAULDING PYRAMID Making Isolation: How does it work? Patient Isolation- Creating

Isolation trees Alastair Rushworth Data Scientist DataCamp Anomaly Detection in R Isolation

Financial Impacts of Achieving Aggressive Financial Impacts of Achieving Aggressive Financial

Language Based isolation of Untrusted JavaScript Ankur Taly Dept. of Computer Science, Stanford

Harmonizing Performance and Isolation in Microkernels with Efficient Intra-kernel Isolation and

The National Adoption Service Suzanne Griffiths, Director of Achieving More Together Achieving

Loneliness and Social Isolation Select Committee Topics Defining social isolation and

Efficient Software-Based Fault Isolation Robert Wahbe Steven Lucco Thomas E. Anderson Susan L.

2 Fault Isolation &amp; Restoration Manual Fault Isolation &amp; Restoration The

2019-20 DNA Biology New Products RNA Biology PROTEIN Biology MOLECULAR Biology Plant DNA

Virtual Machine Introspection Isolation Interpretation Interposition Isolation

FEAR OF SOCIAL ISOLATION Aman Pandya 11481 A modified version of social isolation experiment

A Type System for Checking Applet Isolation in Java Card Peter Mller ETH Zrich Joint work

Appropriate validation of (predictive) biomarkers Elise C. Kohn, MD Gynecologic Cancer

Assessment of The Prognostic Value Of The CA-125 Modeled Kinetic Parameter KELIM in GOG-0262 and

Prognostication 4 1 4/9/19 Prognostication Mrs. Alvarez is a 79 yo woman with COPD, Why

Conflict of Interest HRT: New Evidence on a Old Topic Conflict of Interest None Off

Reproducibility and Cross-study Replicability of Prognostic Signatures from High Throughput

Cancer staging in 2022 Brian OSullivan, MD Chair, Prognostic Factors Task Force, UICC TNM

for Composition of Applications Brian Kocoloski Hasan Abbasi David Bernholdt Jack Lange Terry

Diurnal Cycle of Shallow Cumulus over Land Geert Lenderink, A. Pier Siebesma (siebesma@knmi.nl)

2 Fault Isolation & Restoration Manual Fault Isolation & Restoration The