benchcouncil aibench
play

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling - PowerPoint PPT Presentation

BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling Gao, Fei Tang, Jianfeng Zhan http://www.benchcouncil.org/AIBench/index.html INSTITUTE O BenchCouncil OF C Bench19, Denver, Colorado, USA COMPUTING T TECHNOLOGY Why


  1. BenchCouncil AIBench --- A Datacenter AI Benchmark Suite Wanling Gao, Fei Tang, Jianfeng Zhan http://www.benchcouncil.org/AIBench/index.html INSTITUTE O BenchCouncil OF C Bench’19, Denver, Colorado, USA COMPUTING T TECHNOLOGY

  2. Why Datacenter AI? § AI is widely employed to augment Internet services processing images, video, speech, and audio n n There is an urgent need for datacenter AI benchmarks AIBench Bench’19

  3. Challenge 1# Isolation n Confidential issues of workloads and datasets n Isolation ! n There is no publicly available industry-scale Internet service benchmark AIBench Bench’19

  4. Challenge 2# Microservice based Architecture n A collection of loosely coupled services n Various modules and complex execution path n massive scale and complex hierarchy of infrastructure Component 1 … Request st Resp sponse se Splitted Merged suboperations results Component n End-to-end benchmark that models the critical paths and primary modules is needed AIBench Bench’19

  5. Challenge 3# Diversity of workloads and models Bianco, S., Cadene, R., Celona, L., and Napoletano, P. Benchmark analysis of representativ e deep neural network architectures . IEEE Access, 6:64270– 64277, 2018. AIBench Bench’19

  6. Challenge 4# Domain-specific metrics n Time-to-accuracy n State-of-the-art accuracy n Throughput n Latency n Tail latency AIBench Bench’19

  7. Challenge 5# More vs. Less n Workload characterization n SPECCPU 2017 (43), PARSEC3.0 (30), TPC-DS (99) n Performance ranking (Benchmarketing) n top500 AIBench Bench’19

  8. Challenge 6# Inconsistency n Inconsistent benchmarking requirements Micro or Application Portability Reality benchmark ? Component ? Earlier stage of Later stage of architecture research architecture research AIBench Bench’19

  9. Requirements n Industry-scale n critical paths and primary modules of business AI scenario • AI-related and non AI-related components n A modular framework design n Collectively as a whole end-to-end application n Individually as a micro or component benchmark n Representativeness and coverage n Diverse AI problem domains and datasets are needed AIBench Bench’19

  10. Outline n AIBench Overview n Tasks, Models, Datasets, Metrics n How to use AIBench n Preliminary Results n Conclusion AIBench Bench’19

  11. BenchCouncil AIBench n A Datacenter AI Benchmark Suite n Contributors: many companies and top universities • Alibaba, Microsoft, Paypal, Tencent, etc http://www.benchcouncil.org/AIBench/index.html Wanling Gao, Fei Tang, Lei Wang, Jianfeng Zhan, Chunxin Lan, Chunjie Luo, et al. AIBench: An Industry Standard Internet Service AI Benchmark Suite. Technical Report 2019. arXiv preprint arXiv:1908.08998. AIBench Bench’19

  12. AIBench Overview The First end-to-end industry- n standard AI benchmark suite Industry-scale Internet services n • critical paths and primary modules • AI-related and non AI-related A highly extensible, configurable, and n flexible benchmark framework 16 prominent AI problem domains n Multiple loosely coupled modules n • Individually – Micro/Component benchmarks • Collectively – Application benchmarks AIBench Bench’19

  13. Sixteen AI Problem Domains n Text Processing (4) n Text-to-Text translation, Text summarization, Learning to rank, Recommendation n Image Processing (8) n Image classification, Image generation, Image-to-text, Image-to-Image, Face embedding, Object detection, Image compression, Spatial transformer n Audio Processing (1) n Speech recognition n Video Processing (1) n Video prediction n 3D Data Processing (2) n 3D face recognition, 3D object reconstruction AIBench Bench’19

  14. End-to-End: E-commerce Search Query generator : simulate concurrent users and send query requests n Online Module : personalized searching and recommendations n Offline Module : a training stage to generate a learning model n Data storage module : data storage, e.g., user database, product database n AIBench Bench’19

  15. Component Benchmark (16) AIBench Bench’19

  16. BenchCouncil International Competitions n DC-AI-C1 Image classification n DC-AI-C8 3D face recognition n DC-AI-C10 Recommendation n Competition papers are available soon! AIBench Bench’19

  17. Image Classification n Extract different thematic classes within an image n a supervised learning problem to define a set of target classes and train a model to recognize n ResNet neural network, Dataset : ImageNet2012, 100GB+ AIBench Bench’19

  18. Image Generation n Mimic the distribution of data and generate image data n Dataset : LSUN , about million labelled image data n Model: WGAN algorithm AIBench Bench’19

  19. Text-to-Text Translation n Translates text from one language to another n Model: Transformer n Dataset : WMT English-German (4.5MB training text data) AIBench Bench’19

  20. Image-to-Image n Converts an image from one representation of a specific scene to another scene or representation n Model: cycle- GAN algorithm n Datasets : Cityscapes from 50+ cities ( 300MB ) AIBench Bench’19

  21. Speech-to-Text n Recognizes and translates the spoken language to text n Model : deep speech 2 n Dataset : LibriSpeech, 1000+ hours‘ speech data AIBench Bench’19

  22. Object Detection n Detects the objects within an image n Model : Faster R-CNN algorithm n Dataset : MSCOCO2014 • 82783 training samples, 40504 Validation samples, 40775 test samples ( 20GB+ ) AIBench Bench’19

  23. Image-to-Text n Generates the description of an image automatically n Model : Neural Image Caption model n Dataset : MSCOCO2014 AIBench Bench’19

  24. Face Embedding n Transforms a facial image to a vector in embedding space n Model : FaceNet algorithm n Dataset : VGGFace2 • 36GB training data , 1.9GB test data AIBench Bench’19

  25. 3D Face Recognition n Recognize the 3D facial information from an image n Model : 3D face models n Dataset : Intellifusion data set , 77,715 samples from 253 face IDs AIBench Bench’19

  26. Video Prediction n Predicts the future video through predicting previous frames transformation n Model : motion-focused predictive models n Dataset : Robot pushing dataset • 59000 samples, 100GB+ AIBench Bench’19

  27. Image Compression n Full-resolution lossy image compression n Model : recurrent neural networks n Dataset : ImageNet2012 , 100GB+ AIBench Bench’19

  28. Recommendation n Collaborative filtering-based movie recommendations n Model : Collaborative filtering algorithm n Dataset : MovieLens • 20,000,000 movie ratings data AIBench Bench’19

  29. 3D Object Reconstruction n Predicts and reconstructs 3D objects n Model : a convolutional encoder-decoder network n Dataset : ShapeNet • 51,3000 different 3D data covering 5 object categories AIBench Bench’19

  30. Text Summarization n Generate the text summary n Model : sequence-to-sequence model n Dataset : Gigaword • 10,000,000 text data, Four billion words AIBench Bench’19

  31. Spatial Transformer n Performs spatial transformations n Model : spatial transformer networks n Dataset : MNIST • 60000 training samples, 10000 test samples AIBench Bench’19

  32. Learning to Rank n Machine-learned ranking for recommender system n Model: ranking distillation n Dataset: Gowalla • Social network data: 196,591 nodes and 950,327 edges AIBench Bench’19

  33. Micro Benchmark (12) AIBench Bench’19

  34. AIBench Inference Specification n Inference System under Test Datasets Query Result Generator Outputs System under Test Concurrency Accuracy Ø Ø Ø Arriving rate Ø Latency Distribution Tail Latency Ø Ø Thinking time Throughput Ø Ø Monitoring Tools AIBench Bench’19

  35. Inference Metrics n Online Inference ( Accuracy-ensured ) n Latency, Tail latency n Latency-bounded throughput n Offline Inference ( Accuracy-ensured ) n Throughput, Energy consumption n Accuracy-ensured: n Accuracy deviation with target accuracy is within 2% AIBench Bench’19

  36. AIBench Training Specification n Training System under Test Datasets Result Outputs System under Test Accuracy Ø Ø Latency Tail Latency Ø Throughput Ø Monitoring Tools AIBench Bench’19

  37. Training Metrics n Offline Training n Time-to-accuracy n Energy-to-accuracy n Throughput • Running 1000 epochs • Hyper parameter settings should be able to achieve target accuracy AIBench Bench’19

  38. Benchmark Guideline n Online Server n Each component can be distributed deployed on a large cluster AIBench Bench’19

  39. Benchmark Guideline n Offline Analytics n Single GPU, Multi GPUs, Distributed versions • TensorFlow implementation • PyTorch implementation n Example distributed training setting AIBench Bench’19

  40. Outline n AIBench Overview n Tasks, Models, Datasets, Metrics n How to use AIBench n Preliminary Results n Conclusion AIBench Bench’19

Download Presentation
Download Policy: The content available on the website is offered to you 'AS IS' for your personal information and use only. It cannot be commercialized, licensed, or distributed on other websites without prior consent from the author. To download a presentation, simply click this link. If you encounter any difficulties during the download process, it's possible that the publisher has removed the file from their server.

Recommend


More recommend