gpu activities at fi muni and their results
play

GPU activities at FI MUNI and their results Ji Matela, Ji Filipovi - PowerPoint PPT Presentation

GPU activities at FI MUNI and their results Ji Matela, Ji Filipovi <matela@ics.muni.cz> , <fila@ics.muni.cz> Laboratory of Advanced Network Technologies MetaCentrum CESNET Grid Computing Seminar 2010 Praha,


  1. GPU activities at FI MUNI and their results Jiří Matela, Jiří Filipovič <matela@ics.muni.cz> , <fila@ics.muni.cz> Laboratory of Advanced Network Technologies MetaCentrum CESNET Grid Computing Seminar 2010 Praha, 2010–10–15 1/11

  2. AutoGrid • Potential maps generation for molecular docking • The most computationally expensive parts accelerated on GPU • CPU part analyzed and modified 2/11

  3. AutoGrid – Speedup Accelerated design shows speedup of up to 400 × 450 disntance-dependent constant 400 350 300 speedup 250 200 150 100 50 0 0 50 100 150 200 250 300 350 grid size 3/11

  4. Discrete Wavelet Transform (DWT) • Digital signal processing technique • Application in diverse areas • digital speech recognition • multi-resolution video processing • data compression 4/11

  5. DWT – Speedup Our GPU implementation shows 68 × speedup 5/11

  6. Real-Time Video and Fast Large-Scale Image Compression • Ongoing project • Real-time compression and transmission of video in HD post-HD resolutions • Fast compression of pathological images of resolutions in order of gigapixels • GPU acceleration of JPEG2000 6/11

  7. JPEG2000 compression process Data compression Context Arithmetic DWT Modeling Encoding 7/11

  8. Context-Modeling in JPEG2000 • Serial algorithm • Redesign to fit specifics of GPUs • 12 × faster compared to JasPer CPU implementation 8/11

  9. Soft tissues simulations • Haptic surgical simulators • Simulations modelled using Finite Element Method (FEM) • FEM discretizes the modeled object as a mesh of elements • Per element computation and system of equations solving • Per element computation is complex problem so that it needs to be decomposed into several GPU functions • Not easy to choose decomposition granularity • Manual development of as small functions as possible • Automatic fusion into lager functions 9/11

  10. Preliminary performance gain of fusion 80% gain compared to non fused approach GPU SMEM/GMEM GPU GMEM 10000 CPU 1 core 8000 thousands elements/s 6000 4000 2000 0 0 10000 20000 30000 40000 50000 60000 # elements 10/11

  11. Thank you for you attention! Q?/A! <matela@ics.muni.cz> , <fila@ics.muni.cz> http://www.sitola.cz/ 11/11

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend