GPU Accelerated Tandem Traversal of Blocked Bounding Volume - PowerPoint PPT Presentation

Mar 05, 2024 •315 likes •467 views

GPU Accelerated Tandem Traversal of Blocked Bounding Volume Hierarchies Jesper Damkjr and Kenny Erleben { damkjaer,kenny } @diku.dk Department of Computer Science University of Copenhagen October 2009 Traditional BVH Traversal Two BVHs are

GPU Accelerated Tandem Traversal of Blocked Bounding Volume Hierarchies Jesper Damkjær and Kenny Erleben { damkjaer,kenny } @diku.dk Department of Computer Science University of Copenhagen October 2009
Traditional BVH Traversal Two BVHs are traversed Using either a stack or a queue Using a descend rule descending either tree Descend both trees simultainiously For each descend, the BVs in the nodes are compared for overlap 2
Naive BVH on GPU One pair of BVHs per Thread Upper space bound for stack k ( c − 1) max ( height ( A ) , height ( B )) , max. cardinality, c , and size of two BV node references, k . Shared memory too small and global memory too slow 3
Use Blocks 1 Block ≡ Each node has 4 children If overlap ⇒ 16 new overlaps Less data to transfer and more work per thread 4
Use Double Buffered List Stack/Queue ⇒ Double buffered list Swap input/output paris for next pass 5
Memory Trick Needed 6
Need Imaginary Nodes Less than 4 children ⇒ fill with imaginary nodes Fills up space ⇒ part of calculation time ⇒ use sparesly 7
Blocks with Mixed Internal or Leaf Nodes Not allowed ⇒ Simpler code 8
Internal Block versus Leaf Block if collide ( a , k ) ⇒ push ( e , k ) if collide ( a , l ) collision ⇒ push ( e , k ) if collide ( a , m ) collision ⇒ push ( e , k ) if collide ( a , n ) collision ⇒ push ( e , k ) Redundant results ⇒ add extra check to code 9
The Test Setup Three different configuration types Structured stack Unstructured Pile Rock Slide 10
The Test Setup (Cont’d) For each configuration type Increasing number of triangles in objects Increasing number of objects Test against Rapid Rapid uses OBBs we use AABBs No optimization of imaginary nodes in BVHs (upto 33%) 11
Results Rapid on Intel Quad CPU using one core Stack: Rapid Pile: Rapid Rockslide: Rapid 5 0.3 3 Time in seconds Time in seconds 4 Time in seconds 0.2 3 2 2 0.1 1 1 0 0 0 1000 1000 2500 192 24000 24000 729 729 2000 48 6000 6000 512 512 1500 12 343 1500 343 1500 1000 216 216 500 Triangles per object Number of objects Triangles per object Number of objects Triangles per object Number of objects Cuda on ge9800 GX2 using one core Stack: Cuda only Pile: Cuda only Rockslide: Cuda only 5 0.3 3 Time in seconds Time in seconds Time in seconds 4 0.2 2 3 2 0.1 1 1 0 0 0 1000 1000 2500 192 24000 24000 729 729 2000 48 6000 6000 512 512 1500 12 343 1500 343 1500 1000 216 216 500 Triangles per object Triangles per object Triangles per object Number of objects Number of objects Number of objects Stack (5-8) Pile (3-7) Slide (2) 12
Thanks Questions? 13

Recommend

DNA Short Tandem Repeats Organism DNA Short Tandem Repeats Organ DNA Short Tandem Repeats Cell

DNA Short Tandem Repeats Organism DNA Short Tandem Repeats Organ DNA Short Tandem Repeats Cell Weights 1kg a bag of sugar 1g paper clip 1mg (milligram) 0.001g brain of a bee 1g (microgram) 0.000001g weight of a

1.38k views • 102 slides

Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem

Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem repeat Ted Pak HURS

460 views • 22 slides

Hierarchical Bounding Volume October 11, 2005 () Hierarchical Bounding Volume October 11, 2005

Hierarchical Bounding Volume October 11, 2005 () Hierarchical Bounding Volume October 11, 2005 1 / 15 Outline Introduction to hierarchical bounding volume (HBV) Tree generation Other optimization issues () Hierarchical Bounding Volume

350 views • 16 slides

Graph traversal anhtt-fit@mail.hut.edu.vn Graph Traversal We need also algorithm to traverse

Graph traversal anhtt-fit@mail.hut.edu.vn Graph Traversal We need also algorithm to traverse a graph like for a tree Graph traversal may start at an arbitrary vertex. (Tree traversal generally starts at root vertex) Two difficulties

339 views • 6 slides

Tandem modeling investigations Dan Ellis International Computer Science Institute, Berkeley CA

Tandem modeling investigations Dan Ellis International Computer Science Institute, Berkeley CA <dpwe@icsi.berkeley.edu> Outline 1 What makes Tandem successful? 2 Can we make Tandem better? 3 Does Tandem work with LVCSR tricks?

716 views • 7 slides

Remote Procedure Call Client Server R e Blocked q u e s t Outline Protocol Stack

RPC Timeline Remote Procedure Call Client Server R e Blocked q u e s t Outline Protocol Stack Blocked Computing Presentation Formatting y p l e R Blocked Spring 2005 CS 461 1 Spring 2005 CS 461 2 RCP Components Bulk

498 views • 7 slides

1 Transition from Blocked to Runnable Entering the Blocked state A blocked thread moves into the

Thread states 1. New: created with the new operator (not yet started ) 2. Runnable: either running or ready to run 3. Blocked: Java threads: synchronization deactivated to wait for something 4. Dead: has executed its run

287 views • 6 slides

NVGRAPH,FIREHOSE,PAGERANK GPU ACCELERATED ANALYTICS NOV 2016 Joe Eaton Ph.D. Accelerated

NVGRAPH,FIREHOSE,PAGERANK GPU ACCELERATED ANALYTICS NOV 2016 Joe Eaton Ph.D. Accelerated Computing nvGRAPH New Features Coming Soon Agenda Dynamic Graphs GraphBLAS 2 ACCELERATED COMPUTING 10x Performance & 5x Energy Efficiency GPU

855 views • 23 slides

Binary Tree Traversal Methods Preorder Inorder In a traversal of a binary tree, each

Binary Tree Traversal Methods Binary Tree Traversal Methods Preorder Inorder In a traversal of a binary tree, each element of Postorder the binary tree is visited exactly once. Level order During the visit of an

408 views • 5 slides

graph traversal Nov. 15/16, 2017 1 Today Recursive graph traversal depth first

COMP 250 Lecture 29 graph traversal Nov. 15/16, 2017 1 Today Recursive graph traversal depth first Non-recursive graph traversal depth first breadth first 2 Heads up! There were a few mistakes in the slides for Sec. 001

732 views • 48 slides

Graph Traversal Graph Traversal with DFS/BFS One of the most fundamental graph problems is to

Graph Traversal Graph Traversal with DFS/BFS One of the most fundamental graph problems is to traverse every Tyler Moore edge and vertex in a graph. CSE 3353, SMU, Dallas, TX For correctness, we must do the traversal in a systematic way so

736 views • 5 slides

Binary Tree Traversal Methods Preorder Inorder In a traversal of a binary tree, each

406 views • 7 slides

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large

GPU-Accelerated GPU-Accelerated Large Vocabulary Continuous Speech Recognition Large Vocabulary Continuous Speech Recognition for Scalable Distributed Speech Recognition for Scalable Distributed Speech Recognition Jungsuk Kim

600 views • 34 slides

Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland

Status of GPU offloading on Wayland Status of GPU offloading on Wayland Axel Davy FOSDEM 2014 Status of GPU offloading on Wayland How to do GPU offloading 1 GPU offloading with X DRI2 2 GPU offloading with Wayland 3 and XWayland? 4

427 views • 29 slides

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs.

Motivation to Learn GPGPU Julius Parulek Why to Learn About GPU? Computational power of GPU vs. CPU Why to Learn About GPU? NVIDIA GPU relative performances Why to Learn About GPU? Hardware Why to Learn About GPU? Interactive rendering

852 views • 46 slides

Picture This! Visualization on GPU Accelerated Supercomputers Peter Messmer, 11/15/2016 NVIDIA

Picture This! Visualization on GPU Accelerated Supercomputers Peter Messmer, 11/15/2016 NVIDIA CONFIDENTIAL. DO NOT DISTRIBUTE. Computational Data Science Science GPU Visualization 2 Many vis workflows, one GPU accelerated system Compute

196 views • 18 slides

MA111: Contemporary mathematics Jack Schmidt University of Kentucky November 12, 2012 Entrance

. . MA111: Contemporary mathematics Jack Schmidt University of Kentucky November 12, 2012 Entrance Slip (due 5 min past the hour): Today we investigate last diminisher. Exam and HW is Nov 19. Context: How to make a pile file Suppose a judge

289 views • 7 slides

Recursion II Fundamentals of Computer Science Outline Recursion A method calling itself

Recursion II Fundamentals of Computer Science Outline Recursion A method calling itself A new way of thinking about a problem A powerful programming paradigm Examples: Last time: Factorial, binary search, H-tree,

644 views • 43 slides

How to Count like a Mathematician Counting Warmup Round 1 squares Lara Pudwell dominoes

How to Count like a Mathe- matician Lara Pudwell Introduction How to Count like a Mathematician Counting Warmup Round 1 squares Lara Pudwell dominoes Round 2 trees Department of Mathematics and Computer Science blocks The Bigger

1.11k views • 83 slides

Fast MC simulation for top studies S.Chekanov (ANL) Feb 2013 Introduction ~ 3 months ago

Fast MC simulation for top studies S.Chekanov (ANL) Feb 2013 Introduction ~ 3 months ago we have started a new project called Inclusive boosted top studies using a jet X fast MC simulation (Delphes) for LO+PS models +

564 views • 13 slides

CS 126 Lecture P4: An Example Program Outline Introduction Program - Data structures -

CS 126 Lecture P4: An Example Program Outline Introduction Program - Data structures - Code Conclusions CS126 5-1 Randy Wang Goals Gain insight of how to put together a large program Learn how to read a large

430 views • 24 slides

Exploring The (Metric) Space of Collider Events ATLAS-Theory Lunch Seminar Eric M. Metodiev

Exploring The (Metric) Space of Collider Events ATLAS-Theory Lunch Seminar Eric M. Metodiev Center for Theoretical Physics Massachusetts Institute of T echnology Joint work with Patrick Komiske and Jesse Thaler [1902.02346] April 17, 2019 1

1.06k views • 58 slides

The Space of Collider Events BOOST 2019 Eric M. Metodiev Center for Theoretical Physics

The Space of Collider Events BOOST 2019 Eric M. Metodiev Center for Theoretical Physics Massachusetts Institute of T echnology Joint work with Patrick Komiske, Radha Mastandrea, Preksha Naik, and Jesse Thaler [1902.02346], to appear in PRL

636 views • 49 slides

Modeling Power and Pilgrimage in Medieval Orkney Jennifer Grayburn Julie Gibson CLIR

Modeling Power and Pilgrimage in Medieval Orkney Jennifer Grayburn Julie Gibson CLIR Postdoctoral Fellow County Archaeologist, Orkney Digital Scholarship Center Lecturer, University of the Temple University Highlands & Islands St.

590 views • 9 slides