2 More Paper Goals The L4 Microkernel Is this actually useful? Is - PDF document

A short history of kernels Early kernel: a library of device drivers, support for � threads (QNX) Operating System Kernels Monolithic kernels: Unix, VMS, OS 360… � � Unstructured but fast… � Over time, became very large � Eventually, DLLs helped on size Ken Birman Pure microkernels: Mach, Amoeba, Chorus… � (borrowing some content from � OS as a kind of application Impure microkernels: Modern Windows OS � Peter Sirokman) � Microkernel optimized to support a single OS � VMM support for Unix on Windows and vice versa The great µ -kernel debate Summary of First Paper � How big does it need to be? � The Performance of µ-Kernel-Based Systems (Hartig et al. 16th SOSP, Oct 1997) � With a µ -kernel protection-boundary � Evaluates the L4 microkernel as a basis for a full crossing forces us to operating system � Change memory -map � Ports Linux to run on top of L4 and compares � Flush TLB (unless tagged) performance to native Linux and Linux running on � With a macro-kernel we lose structural the Mach microkernel protection benefits and fault -containment � Explores the extensibility of the L4 microkernel � Debate raged during early 1980’s Summary of Second Paper In perspective? � L4 seeks to validate idea that a µ -kernel � The Flux OSKit: A Substrate for Kernel and Language Research (Ford et al. 16th SOSP, can support a full OS without terrible 1997) cost penalty � Describes a set of OS components designed to be � Opened the door to architectures like the used to build custom operating systems Windows one � Includes existing code simply using “glue code” � Flux argues that we can get desired � Describes projects that have successfully used the OSKit structural benefit in a toolkit and that runtime µ -kernel structure isn’t needed 1

Microkernels Monolithic and Micro-kernels � An operating system kernel that provides minimal services � Usually has some concept of threads or processes, address spaces, and interprocess communication (IPC) � Might not have a file system, device drivers, or network stack Microkernels: Pro Context Switches � Flexibility: allows multiple choices for any service not implemented in the microkernel � Modular design, easier to change � Stability: � Smaller kernel means it is easier to debug � User level services can be restarted if they fail � More memory protection Microkernel: Con Paper Goals � Performance � Is it possible to build an OS on a Microkernel that performs well? � Requires more context switches � Goal is to prove that it is � Each “system call” must switch to the kernel � Port Linux to run on top of L4 (a microkernel) and then to another user level process � Compare performance of L4Linux to native Linux � Context switches are expensive � Since L4Linux is a “complete” operating system, it � State must be saved and restored is representative of microkernel operating systems � TLB is flushed 2

More Paper Goals The L4 Microkernel � Is this actually useful? Is the � Operations: microkernel extensible? � The kernel starts with one address space, which is essentially physical memory � Implemented a second memory manager � A process can grant, map, or unmap pages of size optimized for real-time applications to run 2 n from its own virtual address space alongside Linux on L4 � Some user level processes are pagers and do � Implemented an alternative IPC for memory management (and possibly virtual memory) for other processes using these applications that used L4 directly (requires primitives. modifying the application) The L4 Microkernel (continued) L4Linux � Provides communication between address � Linux source has two cleanly separated parts spaces (inter-process communication or IPC) � Architecture dependent � Page faults and interrupts are forwarded by � Architecture independent the kernel to the user process responsible for � In L4Linux them (i.e. pagers and device drivers) � Architecture dependent code is replaced by L4 � Architecture independent part is unchanged � On an exception, the kernel transfers control back to the thread’s own exception handler � L4 not specifically modified to support Linux L4Linux (continued) L4Linux – System Calls � The statically linked and the shared C libraries � Linux kernel as L4 user service are modified � Runs as an L4 thread in a single L4 � System calls in the library call the kernel using L4 address space IPC � Creates L4 threads for its user processes � For unmodified native Linux applications � Maps parts of its address space to user there is a “trampoline” process threads (using L4 primitives) � The application traps to the kernel as normal � Acts as pager thread for its user threads � The kernel bounces control to a user-level exception handler � Has its own logical page table � The handler calls the modified shared library � Multiplexes its own single thread (to avoid having to change Linux source code) 3

A note on TLBs Microkernel Cons Revisited � A significant portion of the performance � Translation Lookaside Buffer (TLB) penalty of using a microkernel comes from caches page table lookups the added work to reload the page table into � On context switch, TLB needs to be the TLB on every context switch flushed � Since L4 runs in a small address space, it runs with a simulated tagged TLB � A tagged TLB tags each entry with an � Thus, the TLB is not flushed on every context address space label, avoiding flushes switch � A Pentium CPU can emulate a tagged � Note that some pages will still be evicted – TLB for small address spaces but not as many Performance – The Performance – Compatibility Competitors � Mach 3.0 � A “first generation” microkernel � L4Linux is binary compatible with native Linux from the applications point of � Developed at CMU view. � Originally had the BSD kernel inside it � L4 � A “second generation” microkernel � Designed from scratch Performance - Microbenchmarks Performance – Benchmarks � Compared the following systems � Native Linux � L4Linux � MkLinux (in- kernel) � Linux ported to run inside the Mach microkernel � MkLinux (user) � Linux ported to run as a user process on top of the Mach microkernel 4

Performance - Performance - Macrobenchmarks Macrobenchmarks � AIM Benchmark Suite VII simulates “different application loads” using “ Load Mix Modeling”. � This benchmark has fallen out of favor but included various compilation tasks � Tasks are more representative of development in a systems lab than production OS in a web farm or data center Performance – Analysis So What? � If performance suffers, there must be other � L4Linux is 5% - 10% slower than native benefits – Extensibility for macrobenchmarks � While Linux pipes in L4Linux are slower than in � User mode MkLinux is 49% slower native Linux, pipes implemented using the bare L4 interface are faster (averaged over all loads) � Certain primitive virtual-memory options are faster � In-kernel MkLinux is 29% slower using the L4 interface than in native Linux � Cache partitioning allows L4Linux to run (averaged over all loads) concurrently with a real-time system with better � Co-location of kernel is not enough for timing predictability than native Linux good performance Microkernel Con: Revisited Again Flux OS � The Linux kernel was essentially � Research group wanted to experiment with microkernel designs unmodified � Decided that existing microkernels (Mach) � Results from “extensibility” show that were too inflexible to be modified improvements can be made (e.g. pipes) � Decided to write their own from scratch � If the entire OS were optimized to take � In order to avoid having it become inflexible, advantage of L4, performance would built it in modules probably improve � Invented an operating system building kit! � Goal Demonstrated 5

The Flux OSKit Adapting Existing Code � Many OS projects attempt to leverage � Writing Operating Systems is hard: existing code � Relevant OSs have lots of functionality: � Difficult � File system � Many parts of operating systems are � Network Stack interdependent � Debugging � E.g. File system depends on a specific � Large parts of OS not relevant to specific memory management technique research � E.g. Virtual memory depends on the file � Not cost effective for small groups system � Hard to separate components Separating OS Components OSKit � OSKit is not an operating system � OSKit is a set of operating system components � OSKit components are designed to be as self - sufficient as possible � OSKit components can be used to build a custom operating system – pick and choose the parts you want – customize the parts you want Diagram of OSKit Example OS using OSKit 6

2 More Paper Goals The L4 Microkernel Is this actually useful? Is - PDF document

A short history of kernels Early kernel: a library of device drivers, support for threads (QNX) Operating System Kernels Monolithic kernels: Unix, VMS, OS 360 Unstructured but fast Over time, became very large

Experiences with the Amoeba Distributed Operating System (Andrew Tanenbaum, Robbert van Renesse,

Intersecting Solitons, Amoeba and Tropical Geometry Norisuke Sakai (Tokyo Womans Christian

Amoeba Group communicate 1-to-n Members join and leave group Communication Ordering

Unit 15: Experimental Microkernel Systems 15.1. The Amoeba Distributed Operating System AP 9/01

Administrivia CS 4410: Operating Systems Fall 2019 Professors Schneider, Van Renesse [R.

The story of the film so far... We are discussing continuous-time Markov processes known as birth

http://www.xerial.org/ I DECIDED TO EVERYBODY MUST START MASTERING XML IS LEARNING SAX, DOM,

Lecture 7: Factor Analysis Princeton University COS 495 Instructor: Yingyu Liang Supervised v.s.

A Robust Partitioning Scheme for Ad-Hoc Query Workloads ANIL SHANBHAG MIT J/W Alekh Jindal,

for High Throughput in Data-Parallel Clusters Jinwei Liu * Haiying Shen and Ankur Sarker

Adventures in Multicellularity The social amoeba ( a.k.a. slime molds ) Dictyostelium discoideum

Rhythm: Component-distinguishable Workload Deployment in Datacenters Laiping Zhao 1 , Yanan Yang 1

Spool : Reliable Virtualized NVMe Storage Systems in Public Cloud Infrastructure Shuai Xue,

CPSC 121: Models of Computation Module 6: Rewriting predicate logic statements Module 6:

Last Time: We want RL Algorithms that Perform Optimization Delayed consequences Exploration

Object-Oriented Programming What

AN ALGEBRA APPROACH TO TROPICAL MATHEMATICS Louis Rowen, Department of Mathematics, Bar-Ilan

TagYoure It! TagYoure It! Michelle Cummings M.S. Training Wheels Michelle

1 Ikegamis Theorem for zero-dimensional Polish spaces Let I be a -ideal on a set X . We call

Analytic Combinatorics in Several Variables Robin Pemantle and Mark Wilson A of A conference, 30

Communication Protocol a pentesters view, or rude Oracle experiments Roman Bazhin ZeroNights

Photos of Real Microscope Views of Bacterial Slides As you go through this presentation ask

CSci 5103 Operating Systems Jon Weissman The Landscape at 50K feet OSPP ~ Chap. 2; Review A

VIRTUAL ARCHITECTURAL PRACTICE - AN ALTERNATE REALITY PRODUCED BY AIA PMKC + AIA TRUST 1.5