Investigating the impact of the Large Scale on distributed systems - PowerPoint PPT Presentation

γλ ACI Grid CGP2P Grand Large Investigating the impact of the Large Scale on distributed systems F. Cappello INRIA Grand-Large Project, INRIA/PCRI LRI, Université Paris Sud fci@lri.fr, www.lri.fr/~fci 1 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Several types of GRID Node Features: Large sites •<100 Computing « GRID » Computing •Stables centers, •Individual Clusters credential •Confidence 2 kinds of Grids Large scale distributed PC •~100 000 systems Windows, •Volatiles Linux •No « Desktop GRID » authentication or « Internet Computing » •No (Seti@home, Decrypthon, Climate-Prediction) confidence Peer-to-Peer systems (Napster, Kazaa, etc.) 2 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Fusion of Dgrid and P2P � General Purpose Large Scale Distributed systems • Large computing infrastructures (~10 000 nodes or more) • Geographically distributed / different administration domains • With almost no control of the participating nodes • Where any node to play different roles (client, server, system infrastructure) Client (PC) Service provider (PC) accept request PC Potential PC communications for PC result provide PC PC parallel applications Coordination Client (PC) request system accept PC PC PC result Server provider PC (PC) provide Request may be related to Accept concerns Computations or data computation or data 3 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Distributed System Problematic renewal A very simple problem statement but leading to a lot of research issues (classical OS): Scheduling, Load Balancing, Security, Fairness, Coordination, Message passing, Data storage, Programming, Deployment, etc. BUT « Large Scale » feature has severe implications: •Node Volatility, Network failures, Asynchrony •Lack of trust (very low control of participating nodes) •No consistent global view of the system Conventional techniques/approaches may not fit Ex: fault tolerance •Classical fault tolerance (consensus impossible) •Self-Stabilization (the system is always changing) New approaches (intrinsically scalable/FT) are needed •Autonomous decisions, Self-organization, etc. 4 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large 26 pers, 7 labs (started in 2001; end in July 2004) Research topics and sub-projects: Global architecture (F. C. and O. R.) User Interface, control language (SPI, S. Petiton) Security, sandboxing (SPII, O. Richard) Large scale Storage (SPIII, Gil Utard) Inter-node communications : MPICH-V (SPIV, F. Cappello) Scheduling -large scale, multi users- (SPIV, C. G. and F.C.) Theoretical proof of the protocols (SPV, J. Beauquier) GRID/P2P interoperability (SPV, A. Cordier) Validation on real applications (G. Alléon, etc.) 5 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Combining research tools According to the current knowledge, we need: 1) New tools (model, simulators, emulators, experi. Platforms) 2) Strong interaction between research tools Tools for Large Scale Distributed Systems log(cost) XtremWeb MPICH-V SMLSM US Grid eXplorer ADSL-Stats Grid’5000 SimLargeGrid Model for LSDS Protocol proof log(realism) emulation math simulation live systems 6 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large ACI Grid CGP2P Contribution CGP2P results log(cost) XtremWeb MPICH-V SMLSM US Grid eXplorer ADSL-Stats Grid’5000 SimLargeGrid Model for LSDS Protocol proof log(realism) emulation math simulation live systems 7 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Combining research tools According to the current knowledge, we need: 1) New tools (model, simulators, emulators, experi. Platforms) 2) Strong interaction between research tools Tools for Large Scale Distributed Systems log(cost) XtremWeb MPICH-V SMLSM US Grid eXplorer ADSL-Stats Grid’5000 SimLargeGrid INRIA Grand-Large Model for LSDS Protocol proof log(realism) emulation math simulation live systems 8 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Design of a theoretical model capturing LSDS characteristics Network: •~10 k nodes or larger, •Wide area network (Network failure, rare but to be considered) •Standard protocols (TCP/IP) Nodes: •Volatile, Byzantine, crash may be permanent TCP/IP + Very large scale + volatility •higher levels protocols must be "connexionless“ (<500 open connection with select) •If a connexion fails, what does it means? Either the target is down OR it cannot accept new connexions because all slots are full OR it does not see the incoming SYN message due to high network traffic •When a connexion is broken, what does it means? Etc. 9 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Design of a theoretical model capturing LSDS characteristics Current issues: •LSDS systems seem to fall into the category of asynchronous systems! (consensus impossibility) •Can fundamental mechanisms of LSDS systems be designed without requiring consensus? � An interesting strategy would be to consider for each node an “horizon”. Concensus would be guaranteed only inside this horizon. These questions are not trivial! Workshop: Hugues Fauconnier, Carole Delporte (Paris 7), Joffroy Beauquier, Franck Cappello, Colette Johnen, Sébastien Tixeuil, Thomas Herault (Paris 11) 10 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large SimLargeGrid: Large Scale Nearest NeighborScheduling Simulator Global coordination seems very difficult at large scale (Hierarchical solutions exist and may fit). More speculative approaches based on autonomous decisions, self organization are also good candidates. Investigate this last idea with a concrete mechanism: Scheduler/Load balancer (SimGrid, Bricks, GriSim don’t scale) � Current status: a simulation tool: topology, volatility, asynchrony, latency/BW, heterogeneity + nearest neighbor scheduling algorithms + use the tool to compare them. 12 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large SimLargeGrid: Large Scale Nearest NeighborScheduling Simulator Based on Swarm Multi-agent simulator 13 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Grid eXplorer A “GRIDinLAB” instrument for CS researchers Founded by the French ministry of research through the ACI “Data Mass” incentive + INRIA For • Grid/P2P researcher community • Network researcher community � Addressing specific issues of each domain � Enabling research studies combining the 2 domains � Ease and develop collaborations between the two communities. Statistics: 13 Laboratories 80 researchers 24 Research Experiments >1M€ (not counting salaries) Installed at IDRIS (Orsay) 15 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Grid eXplorer: the big picture • 13 Laboratories Close to Emulab and WaniLab • 80 researchers A set of sensors A set of tools for analysis Emulator Core Hardware + Soft for Emulation Simulation An experimental Validation on Conditions data base Real life testbed 16 French / UK workshop on GRID Computing

γλ ACI Grid CGP2P Grand Large Grid eXplorer (GdX) current status: • First stage: Building the Instrument – First GdX meeting was on September 16, 2003. – Hardware design meeting planned for October 15. – Hardware selection meeting on November 8 – Choosing the nodes (single or dual?) – Choosing the CPU (Intel IA 32, IA64, Athlon 64, etc.) – Choosing the experimental Network (Myrinet, Ethernet, Infiniband, etc.) – Choosing the general experiment production architecture (parallel OS architecture, user access, batch scheduler, result repositoty) – Choosing the experimental database harware – Etc. 17 French / UK workshop on GRID Computing

Investigating the impact of the Large Scale on distributed systems - PowerPoint PPT Presentation

ACI Grid CGP2P Grand Large Investigating the impact of the Large Scale on distributed systems F. Cappello INRIA Grand-Large Project, INRIA/PCRI LRI, Universit Paris Sud fci@lri.fr, www.lri.fr/~fci 1 French / UK workshop on GRID

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Large-Scale Machine Learning at Twitter 2 Large-Scale Machine Learning at Twitter Jimmy Lin and

INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2

Investigating Potential Investigating Potential Biases in Aerosol Light Biases in Aerosol Light

MongoDB large scale data-centric architectures QConSF 2012 Kenny Gorman Founder, ObjectRocket

INCORPORATING LARGE-SCALE CITIZEN INCORPORATING LARGE-SCALE CITIZEN DELIBERATION INTO

Workshop Workshop on Large on Large- -Scale Disaster Recovery Scale Disaster Recovery i i

A large-scale chemical data integration system Gaia Paolini Pfizer Confidential 1 Large-Scale

Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems A. Gharaibeh, E.

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Large Scale I nternational I Pv6 Pilot Large Scale I nternational I Pv6 Pilot Network (6NET)

Deploying Large Scale AVB/TSN Networks Jeff Koftinoff, Meyer Sound Laboratories, Inc. June 19,

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Enabling Future Enabling Future Technology Technology Ultra-Large-Scale Systems

End-to-end Learning of Action Detection from Frame Glimpse in Videos BIL722 - Advanced Topics in

1. Introduction Introduction Basics Simple Statistics More on S What is R? 1.1 What is R?

Debra Hoffos I love You, L ORD , my s trength . The L ORD is my rock , my fortress and my deliverer

Propositions & Example: There are 5 regular solids. 6 Logical True False Operations

Making cards for the adults who have helped us learn this year in Chestnut/Willow 1 Week 7

1. Introduction excerpt from the lecture at ETHZ (1V + 1U) , Autumn Sem. 2010 In this Chapter you

burlingtonbaptist.org.uk/sovereign Four Faces - a lion, an ox, a man, an eagle Ezekiel 1:6, 10

Kinematics in Two Dimensions www.njctl.org Slide 3 / 125 Table of Contents Click on the topic

Sambuz

Useful Links

Newsletter

Mail Us

Investigating the impact of the Large Scale on distributed systems - PowerPoint PPT Presentation

ACI Grid CGP2P Grand Large Investigating the impact of the Large Scale on distributed systems F. Cappello INRIA Grand-Large Project, INRIA/PCRI LRI, Universit Paris Sud fci@lri.fr, www.lri.fr/~fci 1 French / UK workshop on GRID

Investigating Dimensionality Dimensionality Dimensionality with with Investigating

A large-scale International IPv6 Network A large-scale International IPv6 Network www.6net.org

FINANCING LARGE SCALE SOLAR Large Scale Solar Conference - Sydney Gloria Chan Director, Large

Large-Scale Machine Learning at Twitter 2 Large-Scale Machine Learning at Twitter Jimmy Lin and

INFRASTRUCTURE 2110414 Large Scale Computing Systems Natawut Nupairoj, Ph.D. Outline 2

Investigating Potential Investigating Potential Biases in Aerosol Light Biases in Aerosol Light

MongoDB large scale data-centric architectures QConSF 2012 Kenny Gorman Founder, ObjectRocket

INCORPORATING LARGE-SCALE CITIZEN INCORPORATING LARGE-SCALE CITIZEN DELIBERATION INTO

Workshop Workshop on Large on Large- -Scale Disaster Recovery Scale Disaster Recovery i i

A large-scale chemical data integration system Gaia Paolini Pfizer Confidential 1 Large-Scale

Efficient Large-Scale Graph Processing on Hybrid CPU and GPU Systems A. Gharaibeh, E.

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Large Scale I nternational I Pv6 Pilot Large Scale I nternational I Pv6 Pilot Network (6NET)

Deploying Large Scale AVB/TSN Networks Jeff Koftinoff, Meyer Sound Laboratories, Inc. June 19,

Meeting the Challenges of Ultra- -Large Large- -Scale Scale Meeting the Challenges of Ultra

Enabling Future Enabling Future Technology Technology Ultra-Large-Scale Systems

End-to-end Learning of Action Detection from Frame Glimpse in Videos BIL722 - Advanced Topics in

1. Introduction Introduction Basics Simple Statistics More on S What is R? 1.1 What is R?

Debra Hoffos I love You, L ORD , my s trength . The L ORD is my rock , my fortress and my deliverer

Propositions &amp; Example: There are 5 regular solids. 6 Logical True False Operations

Making cards for the adults who have helped us learn this year in Chestnut/Willow 1 Week 7

1. Introduction excerpt from the lecture at ETHZ (1V + 1U) , Autumn Sem. 2010 In this Chapter you

burlingtonbaptist.org.uk/sovereign Four Faces - a lion, an ox, a man, an eagle Ezekiel 1:6, 10

Kinematics in Two Dimensions www.njctl.org Slide 3 / 125 Table of Contents Click on the topic

Sambuz

Useful Links

Newsletter

Mail Us

Propositions & Example: There are 5 regular solids. 6 Logical True False Operations