Supercomputing Operating Systems: A Naive View from Over the Fence - PowerPoint PPT Presentation

Supercomputing Operating Systems: A Naive View from Over the Fence Timothy Roscoe (Mothy) Systems Group, ETH Zurich

Disclaimer: I am a stranger in a strange land Thank you for inviting me! • I’m assuming your field is “Supercomputing” • Mine isn’t: I’m a “mainstream” OS researcher – Expect considerable naïveté on my part • This talk is about the possible intersection and interaction of “Supercomputing” and “OS research” • I will exaggerate for effect. – Please don’t take it the wrong way. 22nd June 2012 ROSS Workshop 2

Disclaimer: I am a stranger in a strange land Thank you for inviting me! • I’m assuming your field is “Supercomputing” • Mine isn’t: I’m a “mainstream” OS researcher – Expect considerable naïveté on my part • This talk is about the possible intersection and interaction of “Supercomputing” and “OS research” • I will exaggerate for effect. – Please don’t take it the wrong way. 22nd June 2012 ROSS Workshop 3

Traditionally… • Supercomputing people built and programmed their own machines – Wrote their own operating systems and/or complained about the existing ones 22nd June 2012 ROSS Workshop 4

Traditionally… • Supercomputing people built and programmed their own machines – Wrote their own operating systems and/or complained about the existing ones • Mainstream OS people ignored them – Insignificant market, no real users – Weird, expensive hardware (too many cores) 22nd June 2012 ROSS Workshop 5

Traditionally… • Supercomputing people built and programmed their own machines – Wrote their own operating systems and/or complained about the existing ones • Mainstream OS people ignored them – Insignificant market, no real users – Weird, expensive hardware (too many cores) This is, of course, changing. 22nd June 2012 ROSS Workshop 6

What’s happening in general-purpose computing?

Lots more cores per chip • Core counts now follow Moore’s Law • Cores will come and go – Energy! • Diversity of system and processor configurations will grow • Cache coherence may not scale to whole machine 22nd June 2012 ROSS Workshop 8

Parallelism • “End of the free lunch”: cores are not getting faster! • Higher performance  better parallelism • New applications  parallel applications – Mining – Recognition – Synthesis 22nd June 2012 ROSS Workshop 9

Cores will be heterogeneous • NUMA is the norm today • Heterogeneous cores for power reduction • Dark silicon, specialized cores • Integrated GPUs / Crypto / NPUs etc. • Programmable peripherals 22nd June 2012 ROSS Workshop 10

Communication latency really matters Example: 8 * quad-core AMD Opteron CPU CPU CPU CPU PCIe 0 2 4 6 L1 L1 L1 L1 L2 L2 L2 L2 L3 PCIe 1 3 5 7 RAM Access cycles normalized to L1 per-hop cost L1 cache 2 1 - L2 cache 15 7.5 - L3 cache 75 37.5 - Other L1/L2 130 65 - 1-hop cache 190 95 60 2-hop cache 260 130 70 22nd June 2012 ROSS Workshop 11

Implications • Computers are systems of cores and other devices which: – Are connected by highly complex interconnects – Entail significant communication latency between nodes – Consist of heterogeneous cores – Show unpredictable diversity of system configurations – Have dynamic core set membership – Provide only limited shared memory or cache coherence 22nd June 2012 ROSS Workshop 12

Implications • Computers are systems of cores and other devices which: – Are connected by highly complex interconnects – Entail significant communication latency between nodes – Consist of heterogeneous cores – Show unpredictable diversity of system configurations – Have dynamic core set membership – Provide only limited shared memory or cache coherence The OS model of cooperating processes over a shared-memory multithreaded kernel is dead. 22nd June 2012 ROSS Workshop 13

What’s really new? • Actually, multiprocessors are nothing new in general purpose computing • Neither are threads: people have been building systems with threads for a long time. – Word, databases, games, servers, browsers, etc. • Concurrency is old. We understand it. • Parallelism is new. 22nd June 2012 ROSS Workshop 14

Parallels with Supercomputing • Lots of cores • Implies parallelism should be used! • Message passing predominates • Heterogeneous cores (GPUs, CellBE, etc.) • Lots of algorithms highly tuned to complex interconnects, memory hierarchies, etc. Surely we can use all the cool ideas in supercomputing for our new OS! 22nd June 2012 ROSS Workshop 15

Barrelfish: our multikernel • ETH Zurich + Microsoft Research • Open source (MIT Licence) • Published 2009 • Under active development • External user community • See www.barrelfish.org.... 22nd June 2012 ROSS Workshop 16

Non-original ideas in Barrelfish Techniques we liked • Capabilities for resource management (seL4) ‏ • Minimize shared state (Tornado, K42) ‏ • Upcall processor dispatch (Psyche, Sched. Activations) ‏ • Push policy into user space domains (Exokernel, Nemesis) ‏ • User-space RPC decoupled from IPIs (URPC) ‏ • Lots of information (Infokernel) ‏ • Single-threaded non-preemptive kernel per core (K42) ‏ • Run drivers in their own domains (µkernels, Xen) ‏ • Specify device registers in a little language (Devil) ‏ ROSS Workshop 22nd June 2012 17

What things does it run on? • PCs: 32-bit and 64-bit x86 architectures – Including mixture of the two! • Intel SCC Seamlessly with • Intel MIC platform x86 host PCs! • Various ARM platforms • Beehive – Experimental Microsoft Research softcore 22nd June 2012 ROSS Workshop 18

What things run on it? • Many microbenchmarks • Webserver: http://www.barrelfish.org/ • Databases: SQLite, PostgreSQL, etc. • Virtual machine monitor – Linux kernel binary • Microsoft Office 2010! – via Drawbridge • Parallel benchmarks: More on – Parsec, SPLASH-2, NAS this later… 22nd June 2012 ROSS Workshop 19

Rethinking OS Design #1: the Multikernel Architecture 22nd June 2012 ROSS Workshop 20

The Multikernel Architecture • Computers are systems of cores and other devices which: – Are connected by highly complex interconnects – Entail significant communication latency between nodes – Consist of heterogeneous cores – Show unpredictable diversity of system configurations – Have dynamic core set membership – Provide only limited shared memory or cache coherence  Forget about shared memory. The OS is a distributed system based on message passing 22nd June 2012 ROSS Workshop 21

Multikernel principles • Share no data between cores – All inter-core communication is via explicit messages – Each core can have its own implementation • OS state partitioned if possible, replicated if not – State is accessed as if it were a local replica • Invariants enforced by distributed algorithms, not locks – Many operations become split-phase and asynchronous 22nd June 2012 ROSS Workshop 22

The multikernel model User App App Application Application space: OS node OS node OS node OS node Operating state state state state System: Async. msgs replica replica replica replica Arch-specific code: GPU w/ … x86_64 X86_64 ARM Hardware: CPU CPU CPU NIC features Interconnect(s) 22nd June 2012 ROSS Workshop 23

...vs a monolithic OS on multicore App kernel x86 x86 x86 x86 Interconnect Main memory holds global data structures 22nd June 2012 ROSS Workshop 24

...vs a  kernel OS on multicore App App App App Server Server Server state state state user mode kernel mode  kernel state state x86 x86 x86 x86 Interconnect 22nd June 2012 ROSS Workshop 25

Replication vs sharing as the default Traditional OSes Shared state , Finer-grained Clustered objects One-big-lock locking partitioning • Replicas used as an optimization in other systems 22nd June 2012 ROSS Workshop 26

Replication vs sharing as the default Multikernel Traditional OSes Shared state , Finer-grained Clustered objects Distributed state, One-big-lock locking partitioning Replica maintenance • Replicas used as an optimization in other systems 22nd June 2012 ROSS Workshop 27

Replication vs sharing as the default Traditional OSes Multikernel Shared state , Finer-grained Clustered objects Distributed state, One-big-lock locking partitioning Replica maintenance • Replicas used as an optimization in other systems • In a multikernel, sharing is a local optimisation – Shared (locked) replica on closely-coupled cores – Only when faster, as decided at runtime • Basic model remains split-phase messaging 22nd June 2012 ROSS Workshop 28

Rethinking OS Design #2: the System Knowledge Base 22nd June 2012 ROSS Workshop 29

System knowledge base • Computers are systems of cores and other devices which: – Are connected by highly complex interconnects – Entail significant communication latency between nodes – Consist of heterogeneous cores – Show unpredictable diversity of system configurations – Have dynamic core set membership – Provide only limited shared memory or cache coherence  Give the OS advanced reasoning techniques to make sense of the hardware and workload at runtime. 22nd June 2012 ROSS Workshop 30

Supercomputing Operating Systems: A Naive View from Over the Fence - PowerPoint PPT Presentation

Supercomputing Operating Systems: A Naive View from Over the Fence Timothy Roscoe (Mothy) Systems Group, ETH Zurich Disclaimer: I am a stranger in a strange land Thank you for inviting me! Im assuming your field is Supercomputing

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Introduction to Machine Learning Classification: Naive Bayes Learning goals 15 Understand the

CSE 473: Artificial Intelligence Autumn 2010 Machine Learning: Naive Bayes and Perceptron Luke

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Outline Naive Credal Classifier 2: an extension of Naive Bayes Introducing NCC2 1 for

Naive Bayes Classication Naive Bayes Classication In [1]: % matplotlib inline from

PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, LOGISTIC REGRESSION

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

CPS 210: Operating Systems CPS 210: Operating Systems Operating Systems: The Big Picture

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

The Barcelona Supercomputing Center Sergi Girona Operations Director 04/12/2019 e-IRG workshop

Far more than Petaflops: The Jlich Supercomputing Centre ScicomP 15 & SP-XXL Thomas

Maple View Flats 1.24.17 1 Site Plan Maple View Flats - 1.24.17 Historic Homes to be moved and

Student view socrative.com STUDENT LOGIN or LOGIN, STUDENT LOGIN Room name: ALLIANCE Student

INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Sch utzes, linked from

PSYCHOTRONICA nitesh dhanjani july 2009 black hat briefings 2009 (las vegas) Sunday, June 28,

Modules in Java Classes that are related should be grouped together, may even share access to

Open heart surgery in p ublic Brand cardiology Disclaimer The success story

$53 million federal support, with $ 256 M from state general fund = $309,000,000 Federal

TCMU Meeting Technical Council updates & requests Discuss a proposal for a series of

A Dual Self Model of Impulse Control Drew Fudenberg and David K. Levine December 30, 2004 The

Physical Attacks on Deep Learning Systems Ivan Evtimov Collaborators and slide content

Sambuz

Useful Links

Newsletter

Mail Us

Supercomputing Operating Systems: A Naive View from Over the Fence - PowerPoint PPT Presentation

Supercomputing Operating Systems: A Naive View from Over the Fence Timothy Roscoe (Mothy) Systems Group, ETH Zurich Disclaimer: I am a stranger in a strange land Thank you for inviting me! Im assuming your field is Supercomputing

STAT 339 Naive Bayes Classification 8-10 March 2017 Colin Reimer Dawson Outline Naive Bayes

Introduction to Machine Learning Classification: Naive Bayes Learning goals 15 Understand the

CSE 473: Artificial Intelligence Autumn 2010 Machine Learning: Naive Bayes and Perceptron Luke

Cumbernauld Academy Existing aerial view from west Site Plan Aerial view from South Aerial view

Outline Naive Credal Classifier 2: an extension of Naive Bayes Introducing NCC2 1 for

Naive Bayes Classication Naive Bayes Classication In [1]: % matplotlib inline from

PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, PLUGIN CLASSIFIERS: NAIVE BAYES, LDA, LOGISTIC REGRESSION

Naive Bayes and Gaussian Bayes Classifier Ladislav Rampasek slides by Mengye Ren and others

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

CPS 210: Operating Systems CPS 210: Operating Systems Operating Systems: The Big Picture

Operating Systems Operating Systems CMPSC 473 CMPSC 473 Operating Systems Structure Operating

The Barcelona Supercomputing Center Sergi Girona Operations Director 04/12/2019 e-IRG workshop

Far more than Petaflops: The Jlich Supercomputing Centre ScicomP 15 &amp; SP-XXL Thomas

Maple View Flats 1.24.17 1 Site Plan Maple View Flats - 1.24.17 Historic Homes to be moved and

Student view socrative.com STUDENT LOGIN or LOGIN, STUDENT LOGIN Room name: ALLIANCE Student

INFO 4300 / CS4300 Information Retrieval slides adapted from Hinrich Sch utzes, linked from

PSYCHOTRONICA nitesh dhanjani july 2009 black hat briefings 2009 (las vegas) Sunday, June 28,

Modules in Java Classes that are related should be grouped together, may even share access to

Open heart surgery in p ublic Brand cardiology Disclaimer The success story

$53 million federal support, with $ 256 M from state general fund = $309,000,000 Federal

TCMU Meeting Technical Council updates &amp; requests Discuss a proposal for a series of

A Dual Self Model of Impulse Control Drew Fudenberg and David K. Levine December 30, 2004 The

Physical Attacks on Deep Learning Systems Ivan Evtimov Collaborators and slide content

Sambuz

Useful Links

Newsletter

Mail Us

Far more than Petaflops: The Jlich Supercomputing Centre ScicomP 15 & SP-XXL Thomas

TCMU Meeting Technical Council updates & requests Discuss a proposal for a series of