BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling - PowerPoint PPT Presentation

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling Gao, Fei Tang, Jianfeng Zhan http://www.benchcouncil.org/AIBench/index.html INSTITUTE O BenchCouncil OF C Bench’19, Denver, Colorado, USA COMPUTING T TECHNOLOGY

Why Datacenter AI? § AI is widely employed to augment Internet services processing images, video, speech, and audio n n There is an urgent need for datacenter AI benchmarks AIBench Bench’19

Challenge 1# Isolation n Confidential issues of workloads and datasets n Isolation ！ n There is no publicly available industry-scale Internet service benchmark AIBench Bench’19

Challenge 2# Microservice based Architecture n A collection of loosely coupled services n Various modules and complex execution path n massive scale and complex hierarchy of infrastructure Component 1 … Request st Resp sponse se Splitted Merged suboperations results Component n End-to-end benchmark that models the critical paths and primary modules is needed AIBench Bench’19

Challenge 3# Diversity of workloads and models Bianco, S., Cadene, R., Celona, L., and Napoletano, P. Benchmark analysis of representativ e deep neural network architectures . IEEE Access, 6:64270– 64277, 2018. AIBench Bench’19

Challenge 4# Domain-specific metrics n Time-to-accuracy n State-of-the-art accuracy n Throughput n Latency n Tail latency AIBench Bench’19

Challenge 5# More vs. Less n Workload characterization n SPECCPU 2017 (43), PARSEC3.0 (30), TPC-DS (99) n Performance ranking (Benchmarketing) n top500 AIBench Bench’19

Challenge 6# Inconsistency n Inconsistent benchmarking requirements Micro or Application Portability Reality benchmark ? Component ? Earlier stage of Later stage of architecture research architecture research AIBench Bench’19

Requirements n Industry-scale n critical paths and primary modules of business AI scenario • AI-related and non AI-related components n A modular framework design n Collectively as a whole end-to-end application n Individually as a micro or component benchmark n Representativeness and coverage n Diverse AI problem domains and datasets are needed AIBench Bench’19

Outline n AIBench Overview n Tasks, Models, Datasets, Metrics n How to use AIBench n Preliminary Results n Conclusion AIBench Bench’19

BenchCouncil AIBench n A Datacenter AI Benchmark Suite n Contributors: many companies and top universities • Alibaba, Microsoft, Paypal, Tencent, etc http://www.benchcouncil.org/AIBench/index.html Wanling Gao, Fei Tang, Lei Wang, Jianfeng Zhan, Chunxin Lan, Chunjie Luo, et al. AIBench: An Industry Standard Internet Service AI Benchmark Suite. Technical Report 2019. arXiv preprint arXiv:1908.08998. AIBench Bench’19

AIBench Overview The First end-to-end industry- n standard AI benchmark suite Industry-scale Internet services n • critical paths and primary modules • AI-related and non AI-related A highly extensible, configurable, and n flexible benchmark framework 16 prominent AI problem domains n Multiple loosely coupled modules n • Individually – Micro/Component benchmarks • Collectively – Application benchmarks AIBench Bench’19

Sixteen AI Problem Domains n Text Processing (4) n Text-to-Text translation, Text summarization, Learning to rank, Recommendation n Image Processing (8) n Image classification, Image generation, Image-to-text, Image-to-Image, Face embedding, Object detection, Image compression, Spatial transformer n Audio Processing (1) n Speech recognition n Video Processing (1) n Video prediction n 3D Data Processing (2) n 3D face recognition, 3D object reconstruction AIBench Bench’19

End-to-End: E-commerce Search Query generator ： simulate concurrent users and send query requests n Online Module ： personalized searching and recommendations n Offline Module ： a training stage to generate a learning model n Data storage module ： data storage, e.g., user database, product database n AIBench Bench’19

Component Benchmark (16) AIBench Bench’19

BenchCouncil International Competitions n DC-AI-C1 Image classification n DC-AI-C8 3D face recognition n DC-AI-C10 Recommendation n Competition papers are available soon! AIBench Bench’19

Image Classification n Extract different thematic classes within an image n a supervised learning problem to define a set of target classes and train a model to recognize n ResNet neural network, Dataset ： ImageNet2012, 100GB+ AIBench Bench’19

Image Generation n Mimic the distribution of data and generate image data n Dataset ： LSUN ， about million labelled image data n Model: WGAN algorithm AIBench Bench’19

Text-to-Text Translation n Translates text from one language to another n Model: Transformer n Dataset ： WMT English-German (4.5MB training text data) AIBench Bench’19

Image-to-Image n Converts an image from one representation of a specific scene to another scene or representation n Model: cycle- GAN algorithm n Datasets ： Cityscapes from 50+ cities （ 300MB ） AIBench Bench’19

Speech-to-Text n Recognizes and translates the spoken language to text n Model ： deep speech 2 n Dataset ： LibriSpeech, 1000+ hours‘ speech data AIBench Bench’19

Object Detection n Detects the objects within an image n Model ： Faster R-CNN algorithm n Dataset ： MSCOCO2014 • 82783 training samples, 40504 Validation samples, 40775 test samples （ 20GB+ ） AIBench Bench’19

Image-to-Text n Generates the description of an image automatically n Model ： Neural Image Caption model n Dataset ： MSCOCO2014 AIBench Bench’19

Face Embedding n Transforms a facial image to a vector in embedding space n Model ： FaceNet algorithm n Dataset ： VGGFace2 • 36GB training data ， 1.9GB test data AIBench Bench’19

3D Face Recognition n Recognize the 3D facial information from an image n Model ： 3D face models n Dataset ： Intellifusion data set ， 77,715 samples from 253 face IDs AIBench Bench’19

Video Prediction n Predicts the future video through predicting previous frames transformation n Model ： motion-focused predictive models n Dataset ： Robot pushing dataset • 59000 samples, 100GB+ AIBench Bench’19

Image Compression n Full-resolution lossy image compression n Model ： recurrent neural networks n Dataset ： ImageNet2012 ， 100GB+ AIBench Bench’19

Recommendation n Collaborative filtering-based movie recommendations n Model ： Collaborative filtering algorithm n Dataset ： MovieLens • 20,000,000 movie ratings data AIBench Bench’19

3D Object Reconstruction n Predicts and reconstructs 3D objects n Model ： a convolutional encoder-decoder network n Dataset ： ShapeNet • 51,3000 different 3D data covering 5 object categories AIBench Bench’19

Text Summarization n Generate the text summary n Model ： sequence-to-sequence model n Dataset ： Gigaword • 10,000,000 text data, Four billion words AIBench Bench’19

Spatial Transformer n Performs spatial transformations n Model ： spatial transformer networks n Dataset ： MNIST • 60000 training samples, 10000 test samples AIBench Bench’19

Learning to Rank n Machine-learned ranking for recommender system n Model: ranking distillation n Dataset: Gowalla • Social network data: 196,591 nodes and 950,327 edges AIBench Bench’19

Micro Benchmark (12) AIBench Bench’19

AIBench Inference Specification n Inference System under Test Datasets Query Result Generator Outputs System under Test Concurrency Accuracy Ø Ø Ø Arriving rate Ø Latency Distribution Tail Latency Ø Ø Thinking time Throughput Ø Ø Monitoring Tools AIBench Bench’19

Inference Metrics n Online Inference ( Accuracy-ensured ) n Latency, Tail latency n Latency-bounded throughput n Offline Inference ( Accuracy-ensured ) n Throughput, Energy consumption n Accuracy-ensured: n Accuracy deviation with target accuracy is within 2% AIBench Bench’19

AIBench Training Specification n Training System under Test Datasets Result Outputs System under Test Accuracy Ø Ø Latency Tail Latency Ø Throughput Ø Monitoring Tools AIBench Bench’19

Training Metrics n Offline Training n Time-to-accuracy n Energy-to-accuracy n Throughput • Running 1000 epochs • Hyper parameter settings should be able to achieve target accuracy AIBench Bench’19

Benchmark Guideline n Online Server n Each component can be distributed deployed on a large cluster AIBench Bench’19

Benchmark Guideline n Offline Analytics n Single GPU, Multi GPUs, Distributed versions • TensorFlow implementation • PyTorch implementation n Example distributed training setting AIBench Bench’19

Outline n AIBench Overview n Tasks, Models, Datasets, Metrics n How to use AIBench n Preliminary Results n Conclusion AIBench Bench’19

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling - PowerPoint PPT Presentation

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling Gao, Fei Tang, Jianfeng Zhan http://www.benchcouncil.org/AIBench/index.html INSTITUTE O BenchCouncil OF C Bench19, Denver, Colorado, USA COMPUTING T TECHNOLOGY Why

BenchCouncil: Present and Future Prof. Dr. Jianfeng Zhan BenchCouncil

Implications of Integration of Deep Learning and HPC for Benchmarking Geoffrey Fox, Shantenu Jha,

SSH-Backed API Performance Case Study Anagha Jamthe, Mike Packard, Joe Stubbs, Gilbert Curbelo

How to Use HPC AI500 Zihan Jiang, Xingwang Xiong, Tianshu Hao, and Jianfeng Zhan INSTITUTE O

XDN: Towards Efficient Inference of Residual Neural Networks on Cambricon Chips 2019

New opportunities in optical testing given by photochromic materials Giorgio Pariani, Martino

Scientific Programming: Part B Lecture 2 Luca Bianco - Academic Year 2019-20

Scientific Programming Lecture A05 - Designing programs Andrea Passerini Universit degli Studi

ATT&CKing the Castle Chip Greene Conrad Layne Introductions Chip Greene Conrad Layne

Changing Network Detection Using Bro and Distributed Computing Concepts Mike Reeves @TOoSmOotH

Chart 1 U.S. Personal Saving Rate and Household Debt (consumer plus mortgage) as a % of

STELLA: A Domain Specific Language for Stencil Computations Carlos Osuna, Center for Climate

Women on Boards in Italy Magda Bianco, Angela Ciavarella, Rossella Signoretti Banca dItalia,

On Model Checking Techniques for Randomized Distributed Systems Christel Baier Technische

JRA2 Task 24.7 Testing JRA Face to Face Meeting ESRF Ivan Andrian 2018.05.24 2 Data

New observable for CP asymmetry in charm decays Hsiang-nan Li ( ) Academia Sinica,

Tensor Methods: A New Paradigm for Probabilistic Models and Feature Learning Anima Anandkumar

inference-making Cat Davies, Michelle McGillion, Danielle Matthews This session will 1. Review

Scheduling Jobs with Unknown Duration in Clouds Siva Theja Maguluri, Student Member, IEEE, and R.

Abelian, nilpotent and solvable quandles David Stanovsk y jointly with M. Bonatto, P. Jedli

4TH WORKSHOP ON RECONFIGURABLE COMPUTING FOR MACHINE LEARNING Organisers: Christos Bouganis,

Topological Phases of Matter Out of Equilibrium Nigel Cooper T.C.M. Group, Cavendish Laboratory,

Schema.org Update Guha Outline of talk The context How did we end up where we are with the

Maximum Entropy & Subjective Interestingness Jill illes V s Vreeken 26 June une 2015

Sambuz

Useful Links

Newsletter

Mail Us

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling - PowerPoint PPT Presentation

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling Gao, Fei Tang, Jianfeng Zhan http://www.benchcouncil.org/AIBench/index.html INSTITUTE O BenchCouncil OF C Bench19, Denver, Colorado, USA COMPUTING T TECHNOLOGY Why

BenchCouncil: Present and Future Prof. Dr. Jianfeng Zhan BenchCouncil

Implications of Integration of Deep Learning and HPC for Benchmarking Geoffrey Fox, Shantenu Jha,

SSH-Backed API Performance Case Study Anagha Jamthe, Mike Packard, Joe Stubbs, Gilbert Curbelo

How to Use HPC AI500 Zihan Jiang, Xingwang Xiong, Tianshu Hao, and Jianfeng Zhan INSTITUTE O

XDN: Towards Efficient Inference of Residual Neural Networks on Cambricon Chips 2019

New opportunities in optical testing given by photochromic materials Giorgio Pariani, Martino

Scientific Programming: Part B Lecture 2 Luca Bianco - Academic Year 2019-20

Scientific Programming Lecture A05 - Designing programs Andrea Passerini Universit degli Studi

ATT&amp;CKing the Castle Chip Greene Conrad Layne Introductions Chip Greene Conrad Layne

Changing Network Detection Using Bro and Distributed Computing Concepts Mike Reeves @TOoSmOotH

Chart 1 U.S. Personal Saving Rate and Household Debt (consumer plus mortgage) as a % of

STELLA: A Domain Specific Language for Stencil Computations Carlos Osuna, Center for Climate

Women on Boards in Italy Magda Bianco, Angela Ciavarella, Rossella Signoretti Banca dItalia,

On Model Checking Techniques for Randomized Distributed Systems Christel Baier Technische

JRA2 Task 24.7 Testing JRA Face to Face Meeting ESRF Ivan Andrian 2018.05.24 2 Data

New observable for CP asymmetry in charm decays Hsiang-nan Li ( ) Academia Sinica,

Tensor Methods: A New Paradigm for Probabilistic Models and Feature Learning Anima Anandkumar

inference-making Cat Davies, Michelle McGillion, Danielle Matthews This session will 1. Review

Scheduling Jobs with Unknown Duration in Clouds Siva Theja Maguluri, Student Member, IEEE, and R.

Abelian, nilpotent and solvable quandles David Stanovsk y jointly with M. Bonatto, P. Jedli

4TH WORKSHOP ON RECONFIGURABLE COMPUTING FOR MACHINE LEARNING Organisers: Christos Bouganis,

Topological Phases of Matter Out of Equilibrium Nigel Cooper T.C.M. Group, Cavendish Laboratory,

Schema.org Update Guha Outline of talk The context How did we end up where we are with the

Maximum Entropy &amp; Subjective Interestingness Jill illes V s Vreeken 26 June une 2015

Sambuz

Useful Links

Newsletter

Mail Us

ATT&CKing the Castle Chip Greene Conrad Layne Introductions Chip Greene Conrad Layne

Maximum Entropy & Subjective Interestingness Jill illes V s Vreeken 26 June une 2015