escartes Modeling and Experimental Analysis of Virtualized Storage - PowerPoint PPT Presentation

escartes Modeling and Experimental Analysis of Virtualized Storage Performance using IBM System z as Example Diploma Thesis Presentation October 12, 2012 Dominik Bruhn Reviewers: Prof. Dr. Ralf H. Reussner, Prof. Dr. Walter F . Tichy Advisors: Qais Noorshams, Dr. Samuel Kounev CHAIR FOR SOFTWARE DESIGN AND QUALITY www.kit.edu KIT – Universit¨ at des Landes Baden-W¨ urttemberg und nationales Forschungszentrum in der Helmholtz-Gemeinschaft

Motivation Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 2/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Motivation ? ? App App App App A A’ A A System System System Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 2/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Problem & Idea & Benefit & Action Problem Complex systems with many layers Difficulty to obtain good performance prediction models Idea Derivation of storage performance models from systematic measurements using regression techniques Benefit Possibility to predict the performance Easier decisions on configurations and systems Action Creation and evaluation of performance models Evaluation of techniques and optimization possibilites Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 3/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Contribution Contribution Creation and evaluation of regression models for storage performance prediction Evaluation, analysis and comparison of regression techniques valid for storage performance prediction Repeatable process validated in a representative real-world environment Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 4/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

System Under Study IBM System z IBM DS8700 LPAR1 LPAR2 Storage Controller App. App. Volatile Non-Volatile Fibre Channel Cache Cache z/Linux z/Linux PR/SM (Hypervisor) Switched Fibre Channel Processors, Memory Harddisks (RAID) Storage-Performance-Influencing Factors Workload System Requests Locality Operating System Hardware Size Mix Pattern File System I/O Scheduler Derived from Noorshams et al. (2012) Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 5/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Modeling Regression Models Training Data Independent Dependent Variables Variable Regression Model Regression Model 1. Training 2. Prediction Black Box Model Introspection Regression Model Regression Model D C A B E Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 6/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Regression Techniques Linear Regression Models MARS 10.0 10.0 7.5 7.5 5.0 y 5.0 y 2.5 2.5 0.0 0.0 2 4 6 8 2 4 6 8 x 1 x 1 y = − 1 . 884 + 1 . 293 x 1 y = 1 . 014501 + 1 . 72866 h ( x 1 − 3 . 25 ) Parameters: None Parameters: nk , threshold Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 7/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Regression Techniques Regression Trees (CART) M5 10.0 10.0 7.5 7.5 y 5.0 y 5.0 2.5 2.5 0.0 0.0 2 4 6 8 2 4 6 8 x 1 x 1 x 1 < 4 . 5 x 1 ≤ 3 . 5 Model LM0 LM1 (Intercept) 1 -3.34 1.17 x 1 < 6 . 75 LM0 LM1 x 1 1.53 5.35 8.93 Parameter: nsplits Parameters: minsplit , cp Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 8/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Cross-Validation Samples Randomized Split into Samples k folds Training Test Data Data Test Training Data Data . . . Training Data Training Data Test Data First Second k th Training Training Training Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 9/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Experimental Setup Workload Benchmark - FFSB Existing benchmark System Parameters At application layer File system ext4 I/O scheduler CFQ, NOOP System Setup Workload Parameters Virtual Machines: z/Linux Threads 100 Virtualized by PR/SM in an File set size 1 GB, 25 GB, 50 GB, LPAR 75 GB, 100 GB Request size 4 KB, 8 KB, 12 KB, DS8700 System Storage 16 KB, 20 KB, 24 KB, with 50 GB volatile and 2GB 28 KB, 32 KB non-volatile cache. Access pattern random, sequential Read percentage 0%, 25%, 30%, 50%, 70%, 75%, 100% Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 10/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Approach Goal/Question/Metric (GQM) Setup Analysis Stability of the Results Measurements Analysis Parameter Influence Parameter Analysis Virtualization Influence Synthetic Interpolation Random Synthetic Extrapolation Random Model Analysis Reduced Training Sets Nominal Split Performance Modeling Quality Comparison Technique Analysis Parameter Tuning Tradeoff Analysis Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 11/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Measurement Analysis - Results GQM Setup Analysis Stability of the Results Measurements Analysis Parameter Influence Parameter Analysis Virtualization Influence Synthetic Interpolation Random Synthetic Extrapolation Random Model Analysis Reduced Training Sets Nominal Split Performance Modeling Quality Comparison Technique Analysis Parameter Tuning Tradeoff Analysis Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 12/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Performance Modeling - Results GQM Setup Analysis Stability of the Results Measurements Analysis Parameter Influence Parameter Analysis Virtualization Influence Synthetic Interpolation Random Synthetic Extrapolation Random Model Analysis Reduced Training Sets Nominal Split Performance Modeling Quality Comparison Technique Analysis Parameter Tuning Tradeoff Analysis Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 13/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Interpolation Using Random Samples What interpolation abilities do the regression models show when being tested using newly collected samples? Method Creation of six regression models: Linear regression model ( lm ) Linear regression model including interaction terms ( lm 5param inter ) CART tree ( cart ) MARS model without interactions ( mars ) MARS model including all interaction terms ( mars multi ) M5 model ( m5 ) Training using all measurements Validation using newly collected random samples Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 14/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Interpolation Using Random Samples Models without interactions ( lm , lm 79.34% mars ) do not perform well. 13.87% lm 5param inter With an error of ∼ 10%, M5 works 35.28% cart read well. 79.49% mars Linear regression with interactions mars multi 28.52% works surprisingly well. 9.27% m5 CART and MARS (with lm 62.28% interactions) rank in the midfield. 10.01% lm 5param inter 33.97% cart write 64.65% mars mars multi 16.64% 10.39% m5 0 25 50 75 Relative Error (%) Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 15/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

Extrapolation Using Random Samples How is the extrapolation ability of the regression models when testing using newly collected data? Again, the models without 97.27% lm interactions do not work well. 20.16% lm 5param inter CART models can not be used for 61.29% cart read extrapolation. mars 95.92% mars multi 41.01% M5 still performs well with an error 12.58% m5 of ∼ 14%. 84.33% lm 20.32% lm 5param inter 82.13% cart write 79.71% mars 26.47% mars multi m5 15.46% 0 25 50 75 100 Relative Error (%) Introduction Foundations Methodology Results Related Work Conclusion October 12, 2012 16/25 Dominik Bruhn – Modeling and Experimental Analysis of Virtualized Storage Performance

escartes Modeling and Experimental Analysis of Virtualized Storage - PowerPoint PPT Presentation

escartes Modeling and Experimental Analysis of Virtualized Storage Performance using IBM System z as Example Diploma Thesis Presentation October 12, 2012 Dominik Bruhn Reviewers: Prof. Dr. Ralf H. Reussner, Prof. Dr. Walter F . Tichy

Gibbs Sampling Biostatistics 615/815 Lecture 22: . . . . . . . . . Metropolis-Hastings

planBTV: Burlingtons Comprehensive Plan Presentation to: Planning Commission October 9, 2018

Metro Fares Work Program Stakeholder Advisory Group Meeting #1 1 Introductions and Overview

Comparison of Three Comparison of Three Wireless Based Wireless Based Technologies for

OF HEALTH DATA USING PRAPARE This project was made possible with funding from: TO REDUCE

Organic heirloom tomato sauce and heirloom cherry tomatoes Organic heirloom tomato jams Organic

MRSA EQAS 2009 Isolation identification and typing of MRSA from dust samples EURL workshop, April

Electron Expulsion of Plasmonic Nanoparticles Cooper Agar, Erfan Saydanzad, Jason Li, Uwe Thumm

Pre repare red by SM SMH Octo tober 2 2018 Market Overview Mainland China Middle

Travel Alaska in Film With the Alaska Film Archives, UAF Film Archivist Angela Schmidt March 11,

18 - 19 April 2015 INVITATION The Vitrofestival The bi-annual Vitrofestival Romont has developed

What does Solvency II mean for Buyers? Philippe Gouraud Head of Major Accounts Practice &

readiness, comfort and/or efficiency in areas of combat operations. To provide the American

The questions every coach needs to ask and (at least try to) answer Rob B Briner 1 Some of The

Contractor EH&S Management Best Practice (2007) Best Practice (2007) December 2006

GROWING THE PRODUCTION PROFILE GENERATING FREE CASH FLOW BUILDING A GROWTH PIPELINE Q4 F2009

Overview and Opportunities Subsea UK Webinar 02 July 2020 Flavia Silva de Castro BDM Oil

Local lodging as an asset to the Algarve Tourism Industry A one stop shop event for those

Nigel Garrard Rob Blackwell Managing Director Commercial Director 13 October 2005 1 October

ANNUAL RESULTS For the year ended 29 September 2019 PRESENTATION OUTLINE REVIEW OF THE YEAR 1 2

Kivori Villages PNG Kivori Poe Village Unsafe road conditions contribute to vulnerability

Common Worship Main Volume (Presentation ed) Common Worship Main Volume (Presentation ed) Book

Presented By: Greg Bartlett Student Researcher gbartlett.bu@gmail.com Backyards and Beyond

MSc Infrastructure Investment and Finance: A UCL-EIB collaboration success story Dr Aris

escartes Modeling and Experimental Analysis of Virtualized Storage - PowerPoint PPT Presentation

escartes Modeling and Experimental Analysis of Virtualized Storage Performance using IBM System z as Example Diploma Thesis Presentation October 12, 2012 Dominik Bruhn Reviewers: Prof. Dr. Ralf H. Reussner, Prof. Dr. Walter F . Tichy

Gibbs Sampling Biostatistics 615/815 Lecture 22: . . . . . . . . . Metropolis-Hastings

planBTV: Burlingtons Comprehensive Plan Presentation to: Planning Commission October 9, 2018

Metro Fares Work Program Stakeholder Advisory Group Meeting #1 1 Introductions and Overview

Comparison of Three Comparison of Three Wireless Based Wireless Based Technologies for

OF HEALTH DATA USING PRAPARE This project was made possible with funding from: TO REDUCE

Organic heirloom tomato sauce and heirloom cherry tomatoes Organic heirloom tomato jams Organic

MRSA EQAS 2009 Isolation identification and typing of MRSA from dust samples EURL workshop, April

Electron Expulsion of Plasmonic Nanoparticles Cooper Agar, Erfan Saydanzad, Jason Li, Uwe Thumm

Pre repare red by SM SMH Octo tober 2 2018 Market Overview Mainland China Middle

Travel Alaska in Film With the Alaska Film Archives, UAF Film Archivist Angela Schmidt March 11,

18 - 19 April 2015 INVITATION The Vitrofestival The bi-annual Vitrofestival Romont has developed

What does Solvency II mean for Buyers? Philippe Gouraud Head of Major Accounts Practice &amp;

readiness, comfort and/or efficiency in areas of combat operations. To provide the American

The questions every coach needs to ask and (at least try to) answer Rob B Briner 1 Some of The

Contractor EH&amp;S Management Best Practice (2007) Best Practice (2007) December 2006

GROWING THE PRODUCTION PROFILE GENERATING FREE CASH FLOW BUILDING A GROWTH PIPELINE Q4 F2009

Overview and Opportunities Subsea UK Webinar 02 July 2020 Flavia Silva de Castro BDM Oil

Local lodging as an asset to the Algarve Tourism Industry A one stop shop event for those

Nigel Garrard Rob Blackwell Managing Director Commercial Director 13 October 2005 1 October

ANNUAL RESULTS For the year ended 29 September 2019 PRESENTATION OUTLINE REVIEW OF THE YEAR 1 2

Kivori Villages PNG Kivori Poe Village Unsafe road conditions contribute to vulnerability

Common Worship Main Volume (Presentation ed) Common Worship Main Volume (Presentation ed) Book

Presented By: Greg Bartlett Student Researcher gbartlett.bu@gmail.com Backyards and Beyond

MSc Infrastructure Investment and Finance: A UCL-EIB collaboration success story Dr Aris

What does Solvency II mean for Buyers? Philippe Gouraud Head of Major Accounts Practice &

Contractor EH&S Management Best Practice (2007) Best Practice (2007) December 2006