TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 - PowerPoint PPT Presentation

Feb 06, 2024 •275 likes •351 views

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 April 2020 Introduction HL-Luminosity LHC starts operations in ~2027, to reach a peak instantaneous luminosity of 7 10 34 cm -2 s -1 , corresponding to ~200 proton-proton

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 April 2020
Introduction • HL-Luminosity LHC starts operations in ~2027, to reach a peak instantaneous luminosity of 7 × 10 34 cm -2 s -1 , corresponding to ~200 proton-proton collisions per bunch crossing • Each collision produces about 10,000 particles • The ATLK Inner Tracker will record ~150,000 hits for each event. • For doublet graph, 150,000 nodes and 135,000 true edges. Assuming the fake rate of input doublets is 10%, the doublet graph would have 150,000 nodes and 1,350,000 edges. Xiangyang Ju 7 April 2020 ExaTrkX Collaboration Meeting 2
Tensor Processing Units • Why not GPUs? Limit amount of high bandwidth memory (HBM). NVIDIA V100 GPU has 32 GB HBM Need to split the whole graph into small segments and feed each segment to GPU • Why TPUs? primarily because of its large HBM, which can reach 32 TB specially designed for the matrix operations, particularly the matrix multiplications, which happens a lot in the bit graph one can run TensorFlow and Pytorch (via pytorch/xla) drawbacks: • does not support all TensorFlow operations • does not support double-precision arithmetic Xiangyang Ju 7 April 2020 ExaTrkX Collaboration Meeting 3
Cloud TPU offering Colab and Kaggle provides limited but free access to TPU, good places for debugging. $4.5/hour $8.0/hour $384/hour contact sales Xiangyang Ju 7 April 2020 ExaTrkX Collaboration Meeting 4
Migrating to cloud TPU To reach best performance, TPU prefers • batch size that are multiples of 8, because a single could TPU consists of 8 TPU cores • fixed shapes, so dynamic graphs are not supported padding graph is added for each doublet graph so that the number of nodes and edges are constant values • matrix dimension of 128, because the structure of the matrix unit hardware is a 128x128 systolic array Systolic array: hard-wired processing units for specific operations • training data in the cloud at the same zone before training, upload the data to google cloud storage that sits in the same zone as the cloud TPU Xiangyang Ju 7 April 2020 ExaTrkX Collaboration Meeting 5
Using cloud TPU • Install python packages and scripts, create VM VM • In the training code: USER create the TPUStrategy to use the TPUs create TPU point TFRecord the cloud storage directory TPUs upload data for training inputs to cloud storage perform the training Storage • Just made the GNN model run on TPU with some caveats to resolve remove the padding graph from the loss calculations find a workaround to replace the weighted log_loss • Next step is to figure out which TPU type we need so that we could use one graph for one event in the training Xiangyang Ju 7 April 2020 ExaTrkX Collaboration Meeting 6

Recommend

TPU & CPU TPU & CPU June 12th, 2017 Coatings & Engineering Materials Div., Mitsui

Technical Information Introduction for FORTIMO TM Introduction for FORTIMO TM TPU & CPU TPU & CPU June 12th, 2017 Coatings & Engineering Materials Div., Mitsui Chemicals, INC. 1 /12

253 views • 12 slides

CS 744: TPU Shivaram Venkataraman Fall 2020 Administrivia Course ML Tue in Fairness Next

! morning good CS 744: TPU Shivaram Venkataraman Fall 2020 Administrivia Course ML Tue in Fairness Next , . . summary Midterm 2, Dec 3 rd Midterm Papers from SCOPE to TPU Thu 2 . . Similar format etc. Piazza

394 views • 20 slides

Convolutional Neural Networks for Particle Tracking Steve Farrell for the HEP.TrkX project May

Convolutional Neural Networks for Particle Tracking Steve Farrell for the HEP.TrkX project May 8, 2017 DS@HEP, FNAL Particle tracking at the LHC An interesting and challenging pattern recognition problem A very important piece of

613 views • 33 slides

PROJECT HOLLOWAY LANDSCAPE PRESENTATION AUGUST 2020 ABOUT EXTERIOR ARCHITECTURE WHO ARE EXA?

PROJECT HOLLOWAY LANDSCAPE PRESENTATION AUGUST 2020 ABOUT EXTERIOR ARCHITECTURE WHO ARE EXA? OUR DNA ExA are an established and well-respected landscape architecture practice with an impressive catalogue of experience both throughout the

799 views • 45 slides

Exa & Yotta Scale Data SC 08 Panel November 21 2008, Austin, TX Garth Gibson Carnegie

Exa & Yotta Scale Data SC 08 Panel November 21 2008, Austin, TX Garth Gibson Carnegie Mellon University and Panasas Inc. SciDAC Petascale Data Storage Institute (PDSI) www.pdsi-scidac.org Charting the Path thru Exa- to Yotta-scale

189 views • 14 slides

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for Computing and Media Studies, Kyoto University (0) When Exa-scale system will come ? - Year 201X ? - Never come :-) (1) Issues on Peta- and

365 views • 3 slides

TOMSK POLYTECHNIC UNIVERSITY university of resource- efficient technologies TPU

TOMSK POLYTECHNIC UNIVERSITY university of resource- efficient technologies TPU university of resource-efficient technologies Tomsk Polytechnic University founded in 1896 , the

157 views • 14 slides

CS 744: TPU Shivaram Venkataraman Fall 2019 Administrivia Midterm 2, Dec 10 th Papers from

CS 744: TPU Shivaram Venkataraman Fall 2019 Administrivia Midterm 2, Dec 10 th Papers from Dataflow Model toTPU Similar format, cheat sheet etc. Poster session Dec 13 th Template Printing instructions Reimbursement

249 views • 20 slides

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Matt Bellis Debbie

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Matt Bellis Debbie Bard Cosmology: the study of the nature and history of the Universe History of Universe driven by competing forces: gravitational attraction

719 views • 37 slides

Ju Juli lie A e Aus ustin in Pre resenta entati tion on Exa Examp mples les

Ju Juli lie A e Aus ustin in Pre resenta entati tion on Exa Examp mples les Sparking Innovation How to Turn Great Ideas Into Great Value Intellectual property is a valuable asset that is often overlooked by companies and

305 views • 4 slides

Meta- -Study of Distance Study of Distance Meta Education Com pletion Rates Education Com

12/5/2003 Lea rner - -centered Instructiona l centered Instructiona l Lea rner Design a nd Dev elop m ent: Design a nd Dev elop m ent: Tw o Exa m p les of Success Tw o Exa m p les of Success A presentation prepared for the APRU 4 th DLI

431 views • 10 slides

Exa-DM: Enabling Scientific Discovery in Exascale Simulations Jeremy Iverson 1 , 2 , Ya Ju Fan 1 ,

Exa-DM: Enabling Scientific Discovery in Exascale Simulations Jeremy Iverson 1 , 2 , Ya Ju Fan 1 , George Karypis 2 , Chandrika Kamath 1 1 Lawrence Livermore National Laboratory 2 University of Minnesota DOE Exascale Research Conference October

308 views • 20 slides

Exa Examining ining Boun oundari daries of Elec of Electoral toral Wards in ards in Kwa

Exa Examining ining Boun oundari daries of Elec of Electoral toral Wards in ards in Kwa Kwa-Zul ulu Nat u Natal: al: Delim elimit itati ation on Re Requirem uiremen ents and Sp and Spati atial Com al Complianc liance Ntobeko

732 views • 33 slides

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC 2018 PDSW workshop Kaiyuan Hou, Reda Al-Bahrani, Esteban Rangel, Ankit Agrawal, Robert Latham, Robert Ross, Alok Choudhary, and Wei-keng Liao

479 views • 19 slides

BUILDING A USEFUL NETWORK PROBE WHILE YOU WAIT David Farrar / Exa Networks UKNOF 40 -

BUILDING A USEFUL NETWORK PROBE WHILE YOU WAIT David Farrar / Exa Networks UKNOF 40 - Manchester The problem We were filtering customers on our new SurfProtect platform HTTP and HTTPS proxy (we provide certificates to schools) in

343 views • 11 slides

Challenge and Solutions for { Peta | Exa }-scale Programming WPSE09 panel discussion Raymond

Challenge and Solutions for { Peta | Exa }-scale Programming WPSE09 panel discussion Raymond Namyst Runtime group INRIA Bordeaux Research Center University of Bordeaux 1 France Legal Disclaimer This presentation represents EXCLUSIVELY

49 views • 4 slides

Automated Task Distribution in Multicore Network Processors using Statistical Analysis Arindam

Automated Task Distribution in Multicore Network Processors using Statistical Analysis Arindam Mallik, Yu Zhang, Gokhan Memik Electrical Engineering and Computer Science Dept. Northwestern University Network Demand Gap Gap increases with the

441 views • 21 slides

High-Performance Embedded High-Performance Embedded Systems-on-a-Chip Systems-on-a-Chip Sanjay

High-Performance Embedded High-Performance Embedded Systems-on-a-Chip Systems-on-a-Chip Sanjay Rajopadhye Sanjay Rajopadhye Computer Science, Colorado State Computer Science, Colorado State University University Lecture 3: Systolic Arrays

129 views • 10 slides

Compilation and Hardware Support for Approximate Acceleration Thierry Moreau , Adrian Sampson,

Compilation and Hardware Support for Approximate Acceleration Thierry Moreau , Adrian Sampson, Andre Baixo, Mark Wyse, Ben Ransford, Jacob Nelson, Hadi Esmaeilzadeh (Georgia Tech), Luis Ceze and Mark Oskin University of Washington

801 views • 45 slides

Software testing In requirements gathering we focus on validation, are we building the right

Software testing In requirements gathering we focus on validation, are we building the right product? (will it be what the user needs) In development we focus on verification: are we building the product right? (does it conform to

532 views • 16 slides

Linear Arrays Chapter 7 1. Basics for the linear array computational model. a. A diagram for this

Linear Arrays Chapter 7 1. Basics for the linear array computational model. a. A diagram for this model is P 1 P 2 P 3 ... P k b. It is the simplest of all models that allow some form of communication between PEs. c. Each

630 views • 36 slides

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained Parallelism (CGP) Fine-Grained Parallelism (FGP) Fine-Grained Parallelism (FGP) Two Dimensional Parallelism (TDP) Results Conclusions Q

467 views • 20 slides

VLSI VLSI - Digital Signal Processing Digital Signal Processing - - - Hsie-Chia

VLSI VLSI - Digital Signal Processing Digital Signal Processing - - - Hsie-Chia Chang E-mail : hcchang@mail.nctu.edu.tw Fall 2006 Course I nformation Course I nformation Mission This course will cover the most important

306 views • 8 slides

Why F Function onal al Pro rogra ramming M Matters rs John Hughes Mary Sheeran

Why F Function onal al Pro rogra ramming M Matters rs John Hughes Mary Sheeran Functional Programming la 1940s Minimalist: who needs booleans? A boolean just makes a choice! true true x y = true x y = x false false x y =

1.03k views • 92 slides

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 - PowerPoint PPT Presentation

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 April 2020 Introduction HL-Luminosity LHC starts operations in ~2027, to reach a peak instantaneous luminosity of 7 10 34 cm -2 s -1 , corresponding to ~200 proton-proton

TPU & CPU TPU & CPU June 12th, 2017 Coatings & Engineering Materials Div., Mitsui

CS 744: TPU Shivaram Venkataraman Fall 2020 Administrivia Course ML Tue in Fairness Next

Convolutional Neural Networks for Particle Tracking Steve Farrell for the HEP.TrkX project May

PROJECT HOLLOWAY LANDSCAPE PRESENTATION AUGUST 2020 ABOUT EXTERIOR ARCHITECTURE WHO ARE EXA?

Exa & Yotta Scale Data SC 08 Panel November 21 2008, Austin, TX Garth Gibson Carnegie

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for

TOMSK POLYTECHNIC UNIVERSITY university of resource- efficient technologies TPU

CS 744: TPU Shivaram Venkataraman Fall 2019 Administrivia Midterm 2, Dec 10 th Papers from

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Matt Bellis Debbie

Ju Juli lie A e Aus ustin in Pre resenta entati tion on Exa Examp mples les

Meta- -Study of Distance Study of Distance Meta Education Com pletion Rates Education Com

Exa-DM: Enabling Scientific Discovery in Exascale Simulations Jeremy Iverson 1 , 2 , Ya Ju Fan 1 ,

Exa Examining ining Boun oundari daries of Elec of Electoral toral Wards in ards in Kwa

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC

BUILDING A USEFUL NETWORK PROBE WHILE YOU WAIT David Farrar / Exa Networks UKNOF 40 -

Challenge and Solutions for { Peta | Exa }-scale Programming WPSE09 panel discussion Raymond

Automated Task Distribution in Multicore Network Processors using Statistical Analysis Arindam

High-Performance Embedded High-Performance Embedded Systems-on-a-Chip Systems-on-a-Chip Sanjay

Compilation and Hardware Support for Approximate Acceleration Thierry Moreau , Adrian Sampson,

Software testing In requirements gathering we focus on validation, are we building the right

Linear Arrays Chapter 7 1. Basics for the linear array computational model. a. A diagram for this

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

VLSI VLSI - Digital Signal Processing Digital Signal Processing - - - Hsie-Chia

Why F Function onal al Pro rogra ramming M Matters rs John Hughes Mary Sheeran

Sambuz

Useful Links

Newsletter

Mail Us

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 - PowerPoint PPT Presentation

TPU for Exa-TrkX Xiangyang Ju ExaTrkX Collaboration Meeting 7 April 2020 Introduction HL-Luminosity LHC starts operations in ~2027, to reach a peak instantaneous luminosity of 7 10 34 cm -2 s -1 , corresponding to ~200 proton-proton

TPU &amp; CPU TPU &amp; CPU June 12th, 2017 Coatings &amp; Engineering Materials Div., Mitsui

CS 744: TPU Shivaram Venkataraman Fall 2020 Administrivia Course ML Tue in Fairness Next

Convolutional Neural Networks for Particle Tracking Steve Farrell for the HEP.TrkX project May

PROJECT HOLLOWAY LANDSCAPE PRESENTATION AUGUST 2020 ABOUT EXTERIOR ARCHITECTURE WHO ARE EXA?

Exa &amp; Yotta Scale Data SC 08 Panel November 21 2008, Austin, TX Garth Gibson Carnegie

Challenges and Solutions for Peta- and Exa-Sacle Programming Tasuku Hiraishi Academic Center for

TOMSK POLYTECHNIC UNIVERSITY university of resource- efficient technologies TPU

CS 744: TPU Shivaram Venkataraman Fall 2019 Administrivia Midterm 2, Dec 10 th Papers from

Statistics of the Universe: Exa-calculations and Cosmology's Data Deluge Matt Bellis Debbie

Ju Juli lie A e Aus ustin in Pre resenta entati tion on Exa Examp mples les

Meta- -Study of Distance Study of Distance Meta Education Com pletion Rates Education Com

Exa-DM: Enabling Scientific Discovery in Exascale Simulations Jeremy Iverson 1 , 2 , Ya Ju Fan 1 ,

Exa Examining ining Boun oundari daries of Elec of Electoral toral Wards in ards in Kwa

Integration of Burst Buffer in High- level Parallel IO Library for Exa- scale Computing Era SC

BUILDING A USEFUL NETWORK PROBE WHILE YOU WAIT David Farrar / Exa Networks UKNOF 40 -

Challenge and Solutions for { Peta | Exa }-scale Programming WPSE09 panel discussion Raymond

Automated Task Distribution in Multicore Network Processors using Statistical Analysis Arindam

High-Performance Embedded High-Performance Embedded Systems-on-a-Chip Systems-on-a-Chip Sanjay

Compilation and Hardware Support for Approximate Acceleration Thierry Moreau , Adrian Sampson,

Software testing In requirements gathering we focus on validation, are we building the right

Linear Arrays Chapter 7 1. Basics for the linear array computational model. a. A diagram for this

Junfeng Fan ESAT/COSIC ECC implementation methods Multi-core systems Coarse-Grained

VLSI VLSI - Digital Signal Processing Digital Signal Processing - - - Hsie-Chia

Why F Function onal al Pro rogra ramming M Matters rs John Hughes Mary Sheeran

Sambuz

Useful Links

Newsletter

Mail Us

TPU & CPU TPU & CPU June 12th, 2017 Coatings & Engineering Materials Div., Mitsui

Exa & Yotta Scale Data SC 08 Panel November 21 2008, Austin, TX Garth Gibson Carnegie