Dr. Robert Grossman Professor and Director, Division of Biological - PowerPoint PPT Presentation

• Dr. Robert Grossman – Professor and Director, Division of Biological Sciences & Computation Institute, University of Chicago • Dr. Xian-He Sun – Chair and Professor, Computer Science, Illinois Institute of Technology • Dr. Judy Qiu – Assistant Professor, Computer Science and Informatics, Indiana University • Dr. Alexandru Iosup – Assistant Professor, Faculty of Engineering, Mathematics and Computer Science, Delft University of Technology, the Netherlands 2 MTAGS13: Panel -- Many-Task Computing meets Big Data

• We want to compute genomic variants. • How can this be done as a distributed 1,000,000 patients computation over science clouds? 1,000 PB • What are the APIs? • What are the key common services? • What is the governance structure? 100,000 patients • What is the sustainability model? 100 PB 10,000 patients 10 PB 1000 patients 1PB

Big Data require both HPC and HTC, that is MTC, and is mixed compute-intensive and data- intensive components 1. Human provides information to the machine. 3. Human provides raw data, Information machine outputs answer, (Objects and Labels ) human consumes the information 2. Machine learns the appropriate function. Data (Objects) f(Objects) → Labels Information (Objects and Labels ) Machine passively Human passively consumes consumes information information MTAGS13: Panel -- Many-Trask Computing meets Big Data

Decoupled-Execution Paradigm:  Handle computation- and data- intensive phases separately  One interface-Two systems, transparent to users  Integration, scheduling, optimization Supercomputer or many-core computing system Data cloud or storage cluster for execution of computing for execution of data intensive part of an application intensive part of an application Core Disk Network High speed network Y. Chen, C. Chen, X.-H. Sun, W. D. Gropp, and R. Thakur, "A Decoupled Execution Paradigm for Data-Intensive High-End Computing," IEEE Cluster'12, Sept, 2012

Interoperability between different file systems • Enable MPI Apps to access data-intensive file systems • HPC-Cloud, Data-Cloud MPI APPs MPI APPs ROMIO ADIO Data-Intensive File Systems (HDFS, KFS, GFS, etc) H. Jin, X.-H. Sun, et. al, "CHAIO: Enabling HPC Applications on Data-Intensive File Systems", ICPP2012 . MTAGS13: Panel -- Many-Trask Computing meets Big Data

Multi-core Out-of-order Execution CPU Multi-threading Speculative Execution Runahead Execution Multi-issue Multi-banked Cache Pipelined Cache Non-blocking Cache Cache Data Prefetching Multi-level Cache Write buffer Multi-channel Memory Multi-rank Multi-bank Input-Output (I/O) Parallel File System Disks MTAGS13: Panel -- Many-Trask Computing meets Big Data

• The traditional AMAT : HitCycle + MR×AMP. • MR is the miss rate of cache accesses; and AMP is the average miss penalty • The Concurrent AMAT : HitCycle/C H + MR×AMP/C M • C H is the hit concurrency; C M is the pure miss concurrency • Hit is always good, miss may not be necessary bad • Design Choice of memory systems X.-H. Sun and D. Wang, "Concurrent Average Memory Access Time", accepted to appear in IEEE Computers , 2013.(IIT Technical Report, IIT/CS-SCS-2012-05) MTAGS13: Panel -- Many-Trask Computing meets Big Data

Applications & Different Interconnection Patterns Judy Qiu Indiana University (a) Map Only (b) Classic (c) Iterative (d) Loosely (Pleasingly MapReduce MapReduce Synchronous Parallel) iterations Input Input Input map map map Pij reduce reduce Output Domain of MapReduce and Iterative Extensions MPI No Communication Collective Communication Collective Patterns Map-AllReduce Map-ReduceScatter MapReduce MapReduce- Map-AllGather • Wordcount, Grep • MDS-BCCalc MergeBroadcast • PageRank, Belief • KMeansClustering, • Matrix Mult • KMeansClustering, Propagation MDS-StressCalc PageRank MTAGS13: Panel -- Many-Trask Computing meets Big Data

VENI VENI VENI Alexandru Iosup Dick Epema Ana Lucia Henk Sips Johan Pouwelse Varbanescu Grids/Clouds Grids/Clouds HPC systems P2P systems P2P systems P2P systems HPC systems Multi-cores File-sharing Big Data Video-on-demand Multi-cores P2P systems Video-on-demand Online gaming e-Science Big Data Gamification e-Science Home page • www.pds.ewi.tudelft.nl Publications 10 • see PDS publication database at publications.st.ewi.tudelft.nl MTAGS13: Panel -- Many-Trask Computing meets Big Data

Applications from two worlds – E-Science (incl. Big Data) – Massively Multiplayer/Social Online Gaming (incl. Big Data) 10-years research in distributed systems – System design, development, and evaluation – Grid->Cloud computing, P2P->? Computing – Performance measurements, evaluation, modeling, b’marking – Grenchmark, Koala, Tribler, The Archives, [OpenTTD@large] 10 operational years research in comp. sci. A. Iosup and D. Epema , On the Gamification of a Graduate Course on Cloud Computing, SC|13 education Education Poster. – Gamification techniques in higher education A. Iosup and D. Epema , An Experience Report on Using Gamification in Technical Higher Education, SIGCSE 2014 . http://goo.gl/V97zSW http://www.pds.ewi.tudelft.nl/~iosup/ MTAGS13: Panel -- Many-Trask Computing meets Big Data

1. In the future, will Small-and-Medium Enterprises use elastic infrastructure running multiple frameworks? Many-Task Big-Data Processing on Clouds — GPUs – 2. In the future, should we risk working on scheduling policies? – Portfolio Scheduling 3. In the future, what is the role of job throughput, next to task throughput and peak performance (HPC)? 4. In the future, will social awareness be at the core of our shared distributed systems? 5. In the future, will it be possible to rate and rank distributed computing systems (benchmarking, also commercial issue)? MTAGS13: Panel -- Many-Trask Computing meets Big Data

1. How do you see MTC intersecting with MapReduce, HTC, and HPC? 2. Importance of data locality for Big Data ==> how important is data-aware scheduling for Many-Task Computing 3. Supercomputers are designed for HPC applications today; in the future, should they be designed to support both MTC and/or Big Data? 4. With the growing scale of systems, has a centralized MTC system become obsolete? Is distributed MTC management (both scheduling and storage) a necessary next step? MTAGS13: Panel -- Many-Trask Computing meets Big Data

• MTAGS 2013 Website: – http://datasys.cs.iit.edu/events/MTAGS13/ • Panel info: – http://datasys.cs.iit.edu/events/MTAGS13/panel.html • Workshop program (7 exciting talks in the PM) – http://datasys.cs.iit.edu/events/MTAGS13/program.html • Prize giveaway (win a Google Nexus 7): – http://datasys.cs.iit.edu/events/MTAGS13/prize.html • Contact – iraicu@cs.iit.edu MTAGS13: Panel -- Many-Trask Computing meets Big Data

Dr. Robert Grossman Professor and Director, Division of Biological - PowerPoint PPT Presentation

Dr. Robert Grossman Professor and Director, Division of Biological Sciences & Computation Institute, University of Chicago Dr. Xian-He Sun Chair and Professor, Computer Science, Illinois Institute of Technology Dr. Judy Qiu

Proposal's (RFP's) Robert D. Grossman President R. Grossman and Associates, Inc. Learning

Introduction to QFT Yuval Grossman Cornell Y. Grossman SM and flavor (1) ICTP, June 10, 2019

FPCP2013: Theory Summary Yuval Grossman Cornell Y. Grossman Theory Summary Buzios, Brazil, May

Symmetries + ... (3) Yuval Grossman Cornell Y. Grossman SM and flavor (3) ICTP, June 12, 2019

NuMI Beamline Radiation Safety Issues Nancy Grossman FNAL NBI03 November 2003 NUMI NBI2003

Modeling Highly Heterogeneous (Large) Data Sets: Towards a Billion Models Robert Grossman

using Sector/Sphere Yunhong Gu , Li Lu : University of Illinois at Chicago Robert Grossman :

United States Court of Appeals for the Federal Circuit 2007-1327 TRIMED, INCORPORATED,

The Great Migration An Online Professional Development Seminar James Grossman Executive Director

Automated Vehicles November 16, 2017 Ian Grossman, AAMVA 1 1 1. AAMVA Automated Vehicle

Terahertz Imaging and Security Applications Erich Grossman National Institute of Standards &

Transport LDRs Universal Waste Liquid Industrial Waste Used Oil 1 Michigan Environmental

On The Road to Sustainability Virginia M. Grossman, Superintendent of Schools Rose Wang

Elizabeth Grossman United States Equal Employment Opportunity Commission Regional Attorney, New

TRANSFORMING THE WORLD AND WASTED ORGANIC MATERIALS Samuel W Grossman, PE Vice President of

What Type of Wastes Do I Generate? 1 Michigan Environmental Michigan Environmental

Life After Xdebug Setup. June 7th, 2019 Oz Heller Drupal.org: oheller LinkedIn: oheller Slack:

EPJ A special talk Colin Wilkin, University College London Physics & Astronomy The legacy of

Background Determiners are words like every/some/most They express a relation between

Welcome to Welcome to PARTNERS FOR CLEAN STREAMS 4 TH ANNUAL GENERAL MEETING & AWARDS 2010

KOALA A PLATFORM FOR OS-LEVEL POWER MANAGEMENT D.C. Snowdon E. Le Sueur S.M. Petters G.

BADGRS Submillimeter galaxies in the local universe Steve Maddox, Lore=a Dunne, Haley Gomez

Battle-Hardened Upstart Linux Plumbers 2013 James Hunt <james.hunt@ubuntu.com> and

Cloud Management Chapter 5 eBook: Cloud Compu5ng: Web-Based

Dr. Robert Grossman Professor and Director, Division of Biological - PowerPoint PPT Presentation

Dr. Robert Grossman Professor and Director, Division of Biological Sciences & Computation Institute, University of Chicago Dr. Xian-He Sun Chair and Professor, Computer Science, Illinois Institute of Technology Dr. Judy Qiu

Proposal's (RFP's) Robert D. Grossman President R. Grossman and Associates, Inc. Learning

Introduction to QFT Yuval Grossman Cornell Y. Grossman SM and flavor (1) ICTP, June 10, 2019

FPCP2013: Theory Summary Yuval Grossman Cornell Y. Grossman Theory Summary Buzios, Brazil, May

Symmetries + ... (3) Yuval Grossman Cornell Y. Grossman SM and flavor (3) ICTP, June 12, 2019

NuMI Beamline Radiation Safety Issues Nancy Grossman FNAL NBI03 November 2003 NUMI NBI2003

Modeling Highly Heterogeneous (Large) Data Sets: Towards a Billion Models Robert Grossman

using Sector/Sphere Yunhong Gu , Li Lu : University of Illinois at Chicago Robert Grossman :

United States Court of Appeals for the Federal Circuit 2007-1327 TRIMED, INCORPORATED,

The Great Migration An Online Professional Development Seminar James Grossman Executive Director

Automated Vehicles November 16, 2017 Ian Grossman, AAMVA 1 1 1. AAMVA Automated Vehicle

Terahertz Imaging and Security Applications Erich Grossman National Institute of Standards &amp;

Transport LDRs Universal Waste Liquid Industrial Waste Used Oil 1 Michigan Environmental

On The Road to Sustainability Virginia M. Grossman, Superintendent of Schools Rose Wang

Elizabeth Grossman United States Equal Employment Opportunity Commission Regional Attorney, New

TRANSFORMING THE WORLD AND WASTED ORGANIC MATERIALS Samuel W Grossman, PE Vice President of

What Type of Wastes Do I Generate? 1 Michigan Environmental Michigan Environmental

Life After Xdebug Setup. June 7th, 2019 Oz Heller Drupal.org: oheller LinkedIn: oheller Slack:

EPJ A special talk Colin Wilkin, University College London Physics &amp; Astronomy The legacy of

Background Determiners are words like every/some/most They express a relation between

Welcome to Welcome to PARTNERS FOR CLEAN STREAMS 4 TH ANNUAL GENERAL MEETING &amp; AWARDS 2010

KOALA A PLATFORM FOR OS-LEVEL POWER MANAGEMENT D.C. Snowdon E. Le Sueur S.M. Petters G.

BADGRS Submillimeter galaxies in the local universe Steve Maddox, Lore=a Dunne, Haley Gomez

Battle-Hardened Upstart Linux Plumbers 2013 James Hunt &lt;james.hunt@ubuntu.com&gt; and

Cloud Management Chapter 5 eBook: Cloud Compu5ng: Web-Based

Terahertz Imaging and Security Applications Erich Grossman National Institute of Standards &

EPJ A special talk Colin Wilkin, University College London Physics & Astronomy The legacy of

Welcome to Welcome to PARTNERS FOR CLEAN STREAMS 4 TH ANNUAL GENERAL MEETING & AWARDS 2010

Battle-Hardened Upstart Linux Plumbers 2013 James Hunt <james.hunt@ubuntu.com> and