PacketShader: A GPU-Accelerated Software Router Some images and - PowerPoint PPT Presentation

PacketShader: A GPU-Accelerated Software Router Some images and sentence are from original author Sangjin Han’s presentation. Presenter: Hao Lu

Why? What? How? • Why used software routers ? • What is GPU ? • Why use GPU ? • How to use GPU ? • What is PacketShader’s design ? • How is the performance ? • If have time, configuration of the system.

Software Router • Not limited to IP routing • You can implement whatever you want on it. • Driven by software • Flexible • Based on commodity hardware • Cheap 3

What is GPU? • Graph process units. • 15 Streaming Multiprocessors consist 32 processors = 480 cores

Why use GPU? Benefit: • Higher computation power • 1-8 v.s. 480 • Memory access latency • Multi-thread to hide the latency • CPU has miss register (up to 6) • Memory bandwidth • 32GB v.s. 177GB Down Sides: • Thread start latency • Data transfer rate

How to use GPU? • GPU is used for highly parallelizable tasks. • With enough threads to hide the memory access latency RX queue 2. Parallel Processing in GPU 1. Batching

PacketShader Overviw • Three stages in a streamline • Pre-shader • Fetching packets from RX queues. • Shader • Using the GPU to do what it need to be done • Post-shader • Gather the result and scatter to each TX queue Pre- Post- Shader shader shader

IPv4 Forwarding Example 2. Forwarding table lookup • Checksum, TTL • Format check • … Update packets 1. IP addresses 3. Next hops and transmit Pre- Post- Shader shader shader Some packets go to slow-path

Scaling with Muti-Core CPU • Problems: • GPU are not as efficient if more than one CPU access it. Master core Shader Device Pre- Post- Device driver shader shader driver Worker cores

Another view

Optimization • Chuck Pipelining: • Gather/Scatter • Concurrent Copy and Execution

Performance: hardware

Performance: IPv4 Forwarding • Algorithm: DIR-24-8-BASIC • It requires one memory access per packet for most cases, by storing next-hop entries for every possible 24- bit prefix. • Pre-shade : • Require slow path => Linux TCP/IP stack • Else, Update TTL and checksum.

Performance: IPv6 Forwarding • Same idea of IPv4, more memory access

Performance: OpenFlow • OpenFlow is a framework that runs experimental protocol s over existing networks. Packets are processed on a flow basis. • The OpenFlow switch is responsible for packet forwarding driven by flow tables.

Performance: IPsec • IPsec is widely used to secure VPN tunnels or for secure communication between two end hosts. • Cryptographic operations used in IPsec are highly compute- intensive

Configuration of the System • Problem: 1. Linux Network Stack Inefficiency. 2. NUMA (None uniform memory access) 3. Dual-IOH Problem • Solutions: 1. Better Driver, use Huge Packet Buffer 2. NUMA aware driver 3. In research

Network Stack Inefficiency 1. Frequent allocation/deallocation memory 2. skb too large (208 bytes)

NUMA • None Uniform Memory Access due to RSS. • Solution : Reconfigure RSS to we configure RSS to distribute packets only to those CPU cores in the same node as the NICs

Dual-IOH Problem • Asymmetry on Data transfer rate. • Cause: Unknown!!

PacketShader: A GPU-Accelerated Software Router Some images and - PowerPoint PPT Presentation

PacketShader: A GPU-Accelerated Software Router Some images and sentence are from original author Sangjin Hans presentation. Presenter: Hao Lu Why? What? How? Why used software routers ? What is GPU ? Why use GPU ? How to use

PacketShader: A GPU-Accelerated Software Router Sangjin Han In collaboration with: Keon Jang

Git David Parker CSCI 5828 - Presentation Outline What is Git? Other Useful Related

Metadata working group report ILDG 14 June 5 2009 Chris Maynard Overview Extending QCDml

10Gb/s Ethernet Platform Implementation John Chaiyasarikul, Yumeng Xu, Shuguan Yang, Jian Zhong

Beta Presentation Integration & Testing Suite for ADAS Sensors The Capstone Experience Team

VisualCSV presentation www.prose.one info.pisa@prose.one About VisualCSV From our industry to

Managing Data for Climate Model Intercomparison: The User Perspective Reto Knutti Institute for

A Dropbox-like Personal Cloud for OpenStack Swift Pedro Garca Lpez OpenStack Summit Adrin

First year review WP4 overview Trento - September 24th, 2007 Goal of WP4 Trust and Security

ZFS Zettabyte File System Powered by: www.netbsd.ir www.usenix.ir ZFS Futures Zpool

Specification of a Specification of a Network Adaptation Layer Network Adaptation Layer for the

PRESENTATION TO MACKENZIE VALLEY LAND AND WATER BOARD 16 May 2017, Yellowknife NWT By Dr. R. A.

Midterm Presentation Tuesday, February 05, 2002 Who are we? Who are we? Nate Distel:

Optimizing UDP for content delivery: GSO, pacing and zerocopy Willem de Bruijn

ex per ience cont i nued 2005 2010 PACIFIC DIGITAL IMAGE HEARST MUSEUM OF ANTHROPOLOGY - -

Understanding Your Childs Behavior Megan Gropp School Psychologist/Board Certified Behavior

Parent Survey Results July 29, 2020 Parents, caregivers and families are dealing with a lot

and Mobilizing EDI 1 Christine Beresford, Data Analysis Coordinator 255-5200 ext 5136

Bostons Progress Toward High - Quality Universal Pre-K Overview Understanding Bostons Pre

How to engage your child and help them be successful in school Quonisha Jackson - Parent Liaison

New Data, New Perspectives: What the New Child Outcomes Data Tell Us (and Dont Tell Us) about

Type III School Bus Training 5/12 1 Disclaimer While every effort has been made to assure the

Maximality and Domain Restriction: evidence from adjectival modification . Alan Munn 1 Karen

Array Sally had 24 pencils and she wanted to put them in groups of 4. How many are in each

Sambuz

Useful Links

Newsletter

Mail Us

PacketShader: A GPU-Accelerated Software Router Some images and - PowerPoint PPT Presentation

PacketShader: A GPU-Accelerated Software Router Some images and sentence are from original author Sangjin Hans presentation. Presenter: Hao Lu Why? What? How? Why used software routers ? What is GPU ? Why use GPU ? How to use

PacketShader: A GPU-Accelerated Software Router Sangjin Han In collaboration with: Keon Jang

Git David Parker CSCI 5828 - Presentation Outline What is Git? Other Useful Related

Metadata working group report ILDG 14 June 5 2009 Chris Maynard Overview Extending QCDml

10Gb/s Ethernet Platform Implementation John Chaiyasarikul, Yumeng Xu, Shuguan Yang, Jian Zhong

Beta Presentation Integration &amp; Testing Suite for ADAS Sensors The Capstone Experience Team

VisualCSV presentation www.prose.one info.pisa@prose.one About VisualCSV From our industry to

Managing Data for Climate Model Intercomparison: The User Perspective Reto Knutti Institute for

A Dropbox-like Personal Cloud for OpenStack Swift Pedro Garca Lpez OpenStack Summit Adrin

First year review WP4 overview Trento - September 24th, 2007 Goal of WP4 Trust and Security

ZFS Zettabyte File System Powered by: www.netbsd.ir www.usenix.ir ZFS Futures Zpool

Specification of a Specification of a Network Adaptation Layer Network Adaptation Layer for the

PRESENTATION TO MACKENZIE VALLEY LAND AND WATER BOARD 16 May 2017, Yellowknife NWT By Dr. R. A.

Midterm Presentation Tuesday, February 05, 2002 Who are we? Who are we? Nate Distel:

Optimizing UDP for content delivery: GSO, pacing and zerocopy Willem de Bruijn

ex per ience cont i nued 2005 2010 PACIFIC DIGITAL IMAGE HEARST MUSEUM OF ANTHROPOLOGY - -

Understanding Your Childs Behavior Megan Gropp School Psychologist/Board Certified Behavior

Parent Survey Results July 29, 2020 Parents, caregivers and families are dealing with a lot

and Mobilizing EDI 1 Christine Beresford, Data Analysis Coordinator 255-5200 ext 5136

Bostons Progress Toward High - Quality Universal Pre-K Overview Understanding Bostons Pre

How to engage your child and help them be successful in school Quonisha Jackson - Parent Liaison

New Data, New Perspectives: What the New Child Outcomes Data Tell Us (and Dont Tell Us) about

Type III School Bus Training 5/12 1 Disclaimer While every effort has been made to assure the

Maximality and Domain Restriction: evidence from adjectival modification . Alan Munn 1 Karen

Array Sally had 24 pencils and she wanted to put them in groups of 4. How many are in each

Sambuz

Useful Links

Newsletter

Mail Us

Beta Presentation Integration & Testing Suite for ADAS Sensors The Capstone Experience Team