towards system analysis with variability model metrics thorsten - PowerPoint PPT Presentation

research-in-progress report towards system analysis with variability model metrics thorsten berger, jianmei guo

how big is my system? thorsten berger, jianmei guo

simple systems simple model / simple feature modeling language Mobile phone Camera Flash Redeye reduction requires requires Fixed optics Zoom Basic flash Adaptable flash requires requires model adapted from keynote of J. Savolainen 3

systems software Feature Variability Configurator Header Header Code Definitions Source file files files define & constraints Model … symbols #DEFINE #DEFINE (Solution Space) #IFDEF F1 #DEFINE #DEFINE architecture generate configuration … (Problem Space) ‐ specific #IF defined(F4 & F6 ) features … Scripts Scripts common #ENDIF (Makefile) features … Presence Presence Generator Conditions Conditions ENDIF Mapping Core select and Resour Resour ces cesHeader Kbuild / Make compile Header optional files files source artifacts Kernel target artifacts 4

eCos ARM AT91 a systems software model 1297 features cdl_component POSIX_SIGNALS { display "POSIX signals configuration“ flavor bool default_value 1 requires KERNEL_EXCEPTIONS requires POSIX_PTHREAD requires POSIX_TIMERS implements POSIX_REALTIME_SIGNALS cdl_option HAL_RTC_PERIOD { implements ISO_SIGSETJMP display "Real-time clock period“ requires { ISO_SIGSETJMP_HEADER == "<cyg/posix/sigsetjmp.h>" } flavor data implements ISO_SIGNAL_NUMBERS cdl_option POSIX_MQUEUE_NOTIFY { legal_values 1 to 0xffff implements ISO_SIGNAL_IMPL display "Allow empty queue notification“ calculated { (CYGNUM_HAL_RTC_NUMERATOR * requires { ISO_SIGNAL_NUMBERS_HEADER == "<cyg/posix/signal.h>" } flavor bool requires { CYGBLD_ISO_SIGNAL_IMPL_HEADER == "<cyg/posix/signal.h>" } CYGNUM_HAL_ARM_AT91_CLOCK_SPEED / requires POSIX_SIGNALS description "This component provides configuration controls for the POSIX (CYGBLD_HAL_ARM_AT91_TIMER_TC ? 32 : 16) / active_if POSIX CYGNUM_HAL_RTC_DENOMINATOR / 1000000000 ) } signals.“ scalability concepts default_value POSIX_SIGNALS description compile signal.cxx except.cxx "Value to program into the RTC clock generator." description "Enabling this option adds the function mq_notify() } } derived features to the API. Without it, some code and per-message queue descriptor space is saved, as well as no longer requiring POSIX ranges realtime signal support.“ } expressive constraint languages visibility conditions feature modeling concepts defaults (als computed) hierarchy capabilities Boolean, integer, string features binding modes feature groups hierarchy manipulation cross-tree constraints ... Berger, She, Lotufo, Czarnecki, Wasowski: A Study of Variability Models and Languages in the Systems 5 Software Domain. IEEE Transactions on Software Engineering, 2013

more systems software models 129 models 108 – 8355 features languages: CDL and Kconfig system: 26K – 10.2M LOC analysis tools CDLTools LVAT (credits: S. She) abstractions configuration space in propositional logic (DIMACS) hierarchy plots 6

WHY TO MEASURE? 7

quantifying model properties but we need to build models explaining but we need to build models explaining relationships betwen measures relationships betwen measures 8

assure quality attributes but our understanding of the relationship but our understanding of the relationship between measures and quality attributes is poor between measures and quality attributes is poor 9

HOW TO MEASURE? 10

measurement using metrics compound complex metrics attribute validity? reliability? low-level metrics structural attribute understanding of low-level attributes of variability models is low! 11

metrics definition goal: define metrics for low-level characteristics 9 structural metrics reflect size, shape, hierarchy, grouping 7 feature representation metrics reflect data types (switch, none, number, string), value domain restrictions (e.g., ratio of open value domain features), capabilities 10 feature constraint metrics constrained features, ratio of constraint types (e.g., derived, visibility, default) 3 dependency metrics CTCR, density, connectivity prospective metrics hierarchy specifics, feature descriptions, feature-to-code mapping 12

examples RConstr ... ratio of features declaring any constraint RPurelyBoolConstr ... ratio of purely Boolean constraints RCon ... connectivity of an abstracted dependency graph RDen ... density of an abstracted dependency graph CDL Metamodel (excerpt) 13

preliminary experiment ANALYSES USING METRICS 14

possible analysis techniques interest in co-variance: association (correlation) analysis interest in prediction: classification and regression interest in outliers: clustering and anomaly detection 15

preliminary experiment CORRELATION ANALYSIS 16

methodology eight real-world systems with models and proper (C-based) codebases correlation test criteria (limitations) model metrics have no normal distribution low sample size compared to the number of variables (34 model metrics, 23 code metrics) Spearman correlation test significant level: p-value < 0.05 Spearman is non-parametric and can detect non-linear relationships, to account for limitations of our dataset qualitative inspection of correlations 17

selection of preliminary RESULTS 18

model metric correlation test goal: identify inherent model characteristics 19

correlations and insights model size and shape number features, number top-level features and leaf features -> equal growth at both levels; with other findings: shapes remain when models grow ratio of abstract features strongly negatively correlated with branching, but strongly correlated with defaults -> domain modeling does not produce wide trees, and the more manual effort goes into domain modeling, the more defaults are modeled mean and median branching not correlated -> many outliers (we knew before) , median is the better measure 20

correlations and insights feature constraints CTCR correlated strongly with branching and strongly negatively with maximum depth -> wider and less-deep trees have less opportunities to encode constraints in hierarchy CTCR, connectivity and density of dependency graph highly correlate -> more investigation required, but early indicator of regular, non-skewed structures 21

model and code relationship code metrics from [Liebig et al. 2010] adapted the cppstats tool, and ran it on our codebases Liebig, Apel, Lengauer, Kästner, Schulze: An analysis of the variability in forty preprocessor-based software product lines. In International Conference on Software Engineering, 2010 22

model and code relationship goal: explore potential of predicting system characteristics 23

correlations and insights sizes model size metrics and code size metrics (LOC, NOFC, LOFC) very strongly correlated size metrics very strongly correlated with code extension metrics HOM, HOHE, but not with HET granularity sizes strongly correlated with extension granularities (GRANGL, GRANFL, GRANBL, GRANSL, GRANEL, GRANML, and GRANERR) -> identification of significant system characteristics -> indications of system characteristics show that forward and reverse inference of model and code characteristics is possible 24

CONCLUSION 25

summary and conclusions contributions: defined and implemented metrics on rich languages, a tool, quantitative datasets, qualitatively inspected correlations model metrics provide insights analysis both confirms earlier findings and provides a complementary picture model and code metric analysis can potentially provide insights for instance, for reverse-engineering techniques -> further analysis required, but needs better focus 26

outlook evaluation of applicability of metrics to further languages and further real models investigate prospective model metrics and feature metrics connect to findings about computational and cognitive complexity theoretical evaluation of the metrics regarding accepted properties (e.g., additivity, triangle inequality), for instance, using the DISTANCE framework look at evolution? 27

and so? models https://bitbucket.org/tberger/variability-models https://code.google.com/p/linux-variability-analysis- tools/source/browse/?repo=extracts metrics and analysis tools VMM https://bitbucket.org/tberger/vmm LVAT (S. She) https://code.google.com/p/linux-variability-analysis-tools/ CDLTools https://bitbucket.org/tberger/cdltools read on... Berger, She, Lotufo, Czarnecki, Wasowski: A Study of Variability Models and Languages in the Systems Software Domain. IEEE Transactions on Software Engineering, 2013 She, Berger: Formal Semantics of the Kconfig Language. Technical Note, 2010 Berger, She: Formal Semantics of the CDL Language. Technical Note, 2010 28

thanks for listening! towards system analysis with variability model metrics thorsten berger, jianmei guo 29

towards system analysis with variability model metrics thorsten - PowerPoint PPT Presentation

research-in-progress report towards system analysis with variability model metrics thorsten berger, jianmei guo how big is my system? thorsten berger, jianmei guo simple systems simple model / simple feature modeling language Mobile phone

VARIABILITY OF HAWAIIAN WINTER RAINFALL VARIABILITY OF HAWAIIAN WINTER RAINFALL VARIABILITY OF

Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem

Variability Extraction and Analysis Toolkit (VEXA) VEXA Introduction The Variability Extraction

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Climate Variability in South Asia V. Niranjan, M. Dinesh Kumar, and Nitin Bassi Institute for

Introduction Variability in Data Summarizing variability in a data set CS 239

Chapter 4: Variability Variability Provides a quantitative measure of the degree to which

A New Two- -Scale Mix Model: Towards Scale Mix Model: Towards a Multi a Multi- - A New Two A

Software Metrics Alex Boughton Executive Summary What are software metrics? Why are

Astheno-Khler and strong KT General results metrics Bismut connection Definition of strong KT

NDCs and metrics Andrei Marcu , Director, ERCST 1 NDCs and metrics Main issues: - Which metrics

Metrics are Pivotal A NATIONAL FARM TO INSTITUTION METRICS COLLABORATIVE WEBINAR Local

Metrics and Estimation Rahul Premraj + Andreas Zeller 1 Metrics Quantitative measures that

Associate Professor Mohamad Abdalla Presentation to Government officials 19 March 2015

FY 2013 & FY 2014 Financial Forecast Tom Tarkiewicz, City Manager Sandra Bird,

POPULATION LEVEL STORM RESILIENCE MODEL THREATS_SRM Julian Forbes Laird BA(Hons), MICFor,

European Tree of the Year Contest General instructions for the organisers Organisers of the

BUILDING AUTONOMOUS VEHICLES USING DRIVE PX 2 Shri Sundaram, May 8, 2017 Autonomous Vehicles

Using Aspects for Language Portability Lennart Kats Eelco Visser DSLs Stratego SDF Spoofax

Geoffrey Nelissen, Humberto Carvalho, David Pereira, Eduardo Tovar System Verification

IGG Agenda 4 th March 2015 10:30 10:35 Minutes from Previous IGG Meeting 10.35 11:00

towards system analysis with variability model metrics thorsten - PowerPoint PPT Presentation

research-in-progress report towards system analysis with variability model metrics thorsten berger, jianmei guo how big is my system? thorsten berger, jianmei guo simple systems simple model / simple feature modeling language Mobile phone

VARIABILITY OF HAWAIIAN WINTER RAINFALL VARIABILITY OF HAWAIIAN WINTER RAINFALL VARIABILITY OF

Variability of an artificial tandem repeat Ted Pak HURS 2007 Variability of an artificial tandem

Variability Extraction and Analysis Toolkit (VEXA) VEXA Introduction The Variability Extraction

What we learned from Community Metrics Agenda Why are metrics used? How metrics are used

Performance Metrics for Graph Mining Tasks 1 Outline Introduction to Performance Metrics

AGENCY OPERATIONS METRICS The Metrics of Me The Metrics of Me x 159 13,006 5 days old books

Proposal Metrics Dashboard What Gets Measured Gets Done Topics Why Keep Metrics? What

Climate Variability in South Asia V. Niranjan, M. Dinesh Kumar, and Nitin Bassi Institute for

Introduction Variability in Data Summarizing variability in a data set CS 239

Chapter 4: Variability Variability Provides a quantitative measure of the degree to which

A New Two- -Scale Mix Model: Towards Scale Mix Model: Towards a Multi a Multi- - A New Two A

Software Metrics Alex Boughton Executive Summary What are software metrics? Why are

Astheno-Khler and strong KT General results metrics Bismut connection Definition of strong KT

NDCs and metrics Andrei Marcu , Director, ERCST 1 NDCs and metrics Main issues: - Which metrics

Metrics are Pivotal A NATIONAL FARM TO INSTITUTION METRICS COLLABORATIVE WEBINAR Local

Metrics and Estimation Rahul Premraj + Andreas Zeller 1 Metrics Quantitative measures that

Associate Professor Mohamad Abdalla Presentation to Government officials 19 March 2015

FY 2013 &amp; FY 2014 Financial Forecast Tom Tarkiewicz, City Manager Sandra Bird,

POPULATION LEVEL STORM RESILIENCE MODEL THREATS_SRM Julian Forbes Laird BA(Hons), MICFor,

European Tree of the Year Contest General instructions for the organisers Organisers of the

BUILDING AUTONOMOUS VEHICLES USING DRIVE PX 2 Shri Sundaram, May 8, 2017 Autonomous Vehicles

Using Aspects for Language Portability Lennart Kats Eelco Visser DSLs Stratego SDF Spoofax

Geoffrey Nelissen, Humberto Carvalho, David Pereira, Eduardo Tovar System Verification

IGG Agenda 4 th March 2015 10:30 10:35 Minutes from Previous IGG Meeting 10.35 11:00

FY 2013 & FY 2014 Financial Forecast Tom Tarkiewicz, City Manager Sandra Bird,