heterogeneous computing
play

Heterogeneous Computing for a Smarter City Mr. Jinshui Liu Chief - PowerPoint PPT Presentation

Security Level: Heterogeneous Computing for a Smarter City Mr. Jinshui Liu Chief Architect for IT Hardware Huawei IT Product Line Smart City, Many Different Faces, All for Better Livings Smart Safe City Smart Smart Home Government


  1. Security Level: Heterogeneous Computing for a Smarter City Mr. Jinshui Liu Chief Architect for IT Hardware Huawei IT Product Line

  2. Smart City, Many Different Faces, All for Better Livings Smart Safe City Smart Smart Home Government Manufacturing Smart Smart Smart Smart Healthcare Energy Transport Building

  3. Our Smart/Safe City Mission:  Create a Better Life  Attract More Talents and Investments  Promote More Business Opportunities 3

  4. Why Heterogeneous Computing for Smart City? • Image & Video analysis Super fast Computing Required to: • Facial recognition & real-time facial feature query • Train ML/DL neural networks for To Enable Smart City • NLP for voice activated services • Inference ML/DL neural networks • Billions of rows of big data real-time query • … • … DRAM DRAM DRAM DRAM GPU …… AI-ASIC CPU/x86 CPU CPU CPU Store Store Store Store Acceleration Acceleration Acceleration NIC NIC NIC NIC Homogeneous Computing Heterogeneous Computing is more efficient for AI Source; Nvidia 2017 4

  5. Building the Base FFV Database Government Agencies Have Many Ways to Collect individuals' Facial Feature Vector (FFV) Data Individual-1 Individual-1 Individual-X Individual-X Name Name ID# ID# Address FFV2 – Passport Phone # FFV3 – DL Vehicle LIC # FFV4 – Social Security Gender FFV5 – Marriage Card FFV Extraction FFV1 for ID Card FFV6-Latest Base FFV DB Extended FFV DB 5

  6. DIY ID Card Renewal: Quick, Convenient & Money Saving In China, most ID cards are valid for 5/10/20/ years. In the past, people usually need to go back to their hometowns and wait in long lines to renew their ID cards. ID card renewal in the past: Long Lines! DIY ID card renewal: Quick! 1. Enter your ID number (as an index 4. Enter your current address and to the ID DB). phone number. 2. Take a photo of yourself. • The address and phone number are • analyzed in real time. Your Facial Feature Vectors generated & 5. Confirm your information & pay compared to the ID libraries in real time. • The New Facial Feature Vectors are also the fees. added to the system. 6. Pick up or get your new id card 3. Take your fingerprints. by mail. • Your fingerprints are compared to the FP • A lot of time & money are saved. DB in real time.. GPU/ ID card renewal process in the past: GPU Facial extraction FR training Cluster • Travel back to your hometown.   • Go to the authority office & wait in a long line to get your photo Facial & FP raw data Government Authority DC  Network & fingerprint taken & application form submitted. ID Application Systems  • Return to your living city & wait for a notice. 1:1 FFV Matching GPU • Facial FP Other Info Go back to your hometown, and wait in a long line again to CPU DIY Box Verification Verification Verification /CPU pick up the new ID card. P40 GPU: ~200 Facial Feature Extractions/s w/ CPU • FFV DB FP DB Other DBs Return to your living city. for image/video decoding, 10X faster than CPU 6

  7. Spring Festival: Getting Home Faster Traditional Check-in Requires Ticket-ID-Person Matching Facial Recognition Requires Only ID-Person Matching 3-5s Check in w/ Facial Recognition , 2X Faster than manual Source: Southern China Morning Post ID-Person Matching is 1-to-1 FFV Matching Smart IPC (get ID picture from ID number & match to captured picture) Center DC At Railway Company HQ Edge DC At Railway Station SUSS IDV TMAC Tracking AGG/ Access CoRE Server Smart Small Server Station Edge DC: ASIC/GPU Smart IPC & Smart Small Station: Center DC: • Facial Feature Extraction(FFV) ASIC • Facial Feature Extraction (FFV): 200/P4 • Image Recognition DL Algorithm DL training GPU /CPU GPU • FFV Matching against FFV-DB (500M-Lib) • ID Card Reading /GPU • Ticket Reading 7

  8. Finding out a Missing Kid in Minutes A missing kid could be found in minutes to Found! hours w/ Facial Recognition & Real-time FFV matching, if reported in time Shopping Mall Street Railway Station Highway Entry Subway Station Airport Bus Alert & Dispatch Control Center DC Servers N 1 20K-200K/s Submit Report Facial Vector Picture Facial Vectors Capture N:1 FFV Locating Dispatching Matching Pictures GPU GPU GPU 20K-200K/s FFV Matchings: GPU 8

  9. Catching a Known Suspicious Suspect in Minutes Shopping Mall Street Railway Station Highway Entry Subway Station Airport Bus Alert & Dispatch Control Center DC Servers 1 20K-200K/s Capture Pictures Facial Vector Facial Vectors N:1 FFV Matching GPU GPU 20K-200K/s FFV Matchings: GPU 9 Source: Some pictures from BBC News

  10. Catching Red Light Violation: Reduce Traffic Jam To Identify Red Light Violations Is a Time-consuming Many (Violation)-to-Many (DB) FFV Matching Process Violation Detection & ID Recognition in Real Time FFV Matching against FFV-DB at Edge DCs:  20M-People City & 2M for Each Edge DC  15M FFV-Records per Edge DC, Others at Center DC  20K HD Cameras & 1 Violation/s/camera at Peak • 20K Violations/s at peak • 20K x 15M FFV Matchings/s = 300G FFVMs/s • 300G/s x 2KB = 600TB/s !! Raw Memory Bandwidth About 1000 V100 GPU HBM2 BW!! Many to Large FFV DB Matching Requests! Smart IPC Center DC SUSS: Suspect Surveillance Edge DC IDV: ID Verification SUSS IDV TMAC Tracking Video Cloud AGG/ Access CoRE TMAC: Traffic Monitoring Server Smart Small Analysis & Control Server Station Edge DC: • Violation Detection Smart IPC & Smart Small Station: Center DC: • Violation Detection • Small Image Extraction from Large(SIEL) • FFV Matching against FFV-DB(500M-Nation) ASIC • Small Image Extraction from Large(SIEL) • Facial Feature Extraction(FFV) • History Activity Tracking GPU /GPU GPU • Facial Feature Extraction(FFV): 200/P4 • FFV Matching against FFV-DB (15M local) • Image Recognition DL Algorithm Training 10

  11. Huawei Heterogeneous Computing for Smarter Cities Rich HC Product Portfolio for Smart City Video Surveillance & Intelligent Analysis: G5500/G2500/G1500 G5500: Modular Design for Quick Upgrade & Maintenance & System Reliability & Availability G5500: High Performance & Scalable for 350 W GPU & 255 W x86 CPU & 2S+32-DIMM CPU Node G5500: Zero-Touch Topology Change & Large NVMe SSD or HDD Storage w/o Need for External NAS V100-SXM2 XXXX V100 P4 Center DC G5500 Up to 8 GPU Cards Max 32 Cards … G2500 Edge DC G1500 Smart IPC IPC IPC Training/DB Query Inferencing … Street Box Atlas Inferencing Inferencing G5500 G2500 G1500 NIC PSU SMM FAN 2S-X86 + 16*P4 + 24x 3.5-in. HDD AI SoC+SSD/HDD 2S/4S CPU Node w/ Dual 2S CPU Node 6X NVMe SSD w/ 2X NVMe SSD 11

  12. Shenzhen Smart Transport Powered By Huawei Atlas+GPU Safer Cities Creating a Better Life, Attracting More Talents & Investments, Promoting More Businesses Challenges Legacy devices are prone to fail and difficult to maintain. No sudden event detection  function is available. More than 100M images per day are uploaded to be analyzed in time.  Complex services, multiple algorithms & applications to support  Sudden Event Quick Detection Redlight Violator Detection Huawei Solution G5500 w/ High Performance GPU for 100M Pictures analysis & 500 HD video streams  real-time structured analysis per day per 4U chassis Container-based deployment for traffic volume detection, red light violation & sudden  event detection on the same platform, resource pooling and removal of silos of Traffic Volume & Direction D&A LIC Recognition & Fake LIC Detection resources Project Summary Customer Benefits 3.3M vehicles, 2430 intersections & 480 vehicles per km 2 in Shenzhen, Resource pooling for multiple AI algorithms on the same hardware, reduced CAPEX &   human resource investment intelligent transportation control & management required to ensure smooth traffic Signal optimized adjustment cycle shortened from 3 months to 7 days  Vehicle speed up by 9% for critical road segments "Huawei Atlas+GPU+FusionInsight" solution awarded due to high   performance, high density, modular design, standardized & openness. Traffic jam wait time reduced by 24% in rush hours  12

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend