discussion paper Domenico Amalfitano Anna Rita Fasolino Valerio - PowerPoint PPT Presentation

Reverse Engineering of Data Models from Legacy Spreadsheets-Based Systems: An Industrial Case Study discussion paper Domenico Amalfitano Anna Rita Fasolino Valerio Maggio Porfirio Tramontana Vincenzo De Simone SEBD 2014 – Castellammare di Stabia – 16/6/2014

Spreadsheet Based Information System Issues  Spreadsheets are designed only for computing purposes and commercial applications but …  … very often they are used as Information Systems ◦ Very difficult to maintain  High rate of duplicated data between different sheets and files  The first and more critical step of a migration process is the Data Reengineering SEBD 2014 – Castellammare di Stabia – 16/6/2014

Case Study  An automotive company collects the specification of the tests executed on the vehicles in form of Test Patterns ◦ Test Patterns are implemented in Excel files following a common template  We have 30,615 different Excel files with 2,700 data cells on average ◦ There is a high rate of replication data  50% of data cells recurred more than 100 times  Excel Test Patterns represent the input of an automatic test generation process SEBD 2014 – Castellammare di Stabia – 16/6/2014

Data Model Reverse Engineering  Data Model Reverse Engineering is the first step of a more general migration process towards a Web MVC architecture  An heuristic based approach to infer the Data Model was proposed.  A set of 26 heuristics were considered. ◦ 11 heuristics derived from the literature and were adapted to work in this specific context. SEBD 2014 – Castellammare di Stabia – 16/6/2014

Data Model Reverse Engineering  Heuristics can be grouped in two main classes: ◦ Structure based rules (SBRs) ◦ Information based rules (IBRs) SEBD 2014 – Castellammare di Stabia – 16/6/2014

Structure based rules (SBRs)  SBRs analyze the structure and the properties of spreadsheets and their components, such as sheets, cells, cell headers, etc. ◦ Used to abstract the set of candidate classes and their relationships; ◦ Applied to a single Excel File. SEBD 2014 – Castellammare di Stabia – 16/6/2014

Example of SBR Rule: If the spreadsheet contains more than one sheet, then it is possible to associate the spreadsheet to a class C and each component sheet to a distinct class S i , where C has a UML composition relationship with each S i . SEBD 2014 – Castellammare di Stabia – 16/6/2014

Example of SBR Rule: If a sheet S contains sets of consecutive non-empty cells (hereafter non-empty cell area ) that are well delimited from each other by means of empty cells, then it is possible to associate each non-empty cell area to a single class C i and the sheet S to a candidate class C S , where S has a UML composition relationship with each C i . SEBD 2014 – Castellammare di Stabia – 16/6/2014

Information based rules (IBRs)  IBRs analyze the informative content of the cells by looking for repeated data, synonyms, and cells containing well-defined data structures such as array strings, integer matrixes, etc. ◦ Used to infer the attributes of classes, the relationships between classes and their cardinalities. ◦ Applied to all the Excel Files SEBD 2014 – Castellammare di Stabia – 16/6/2014

Example of IBR Rule: If the header cells of the columns that discriminated the extraction of a given class A assume the same textual content in all the spreadsheets, then these values may be considered attributes of that class. SEBD 2014 – Castellammare di Stabia – 16/6/2014

Process Execution and Results  Selected groups of rules were iteratively applied to the spreadsheets.  Sets of candidate classes and relationships were automatically proposed.  The data model made by 18 classes, 27 relationships, and 95 attributes was reconstructed at the end of the process.  Candidates were submitted to domain experts who chose to accept, to refine or to reject them. ◦ Experts accepted 75% of candidates inferred by means of SBRs and 33% of candidates inferred by IBRs SEBD 2014 – Castellammare di Stabia – 16/6/2014

discussion paper Domenico Amalfitano Anna Rita Fasolino Valerio - PowerPoint PPT Presentation

Reverse Engineering of Data Models from Legacy Spreadsheets-Based Systems: An Industrial Case Study discussion paper Domenico Amalfitano Anna Rita Fasolino Valerio Maggio Porfirio Tramontana Vincenzo De Simone SEBD 2014 Castellammare di

Filter convergence and decompositions for vector lattice-valued measures Domenico Candeloro, Anna

PAPER PROJECT 1 SOURCE: http://www.printhaus.es/diferencias-entre-papel/ PAPER PROJECT 1: TYPES

PAPER PROJECT 3 SOURCE: http://www.printhaus.es/diferencias-entre-papel/ PAPER PROJECT 3: TYPES

Rita Movsesian Quartet The Armenian-Andalusian Fusion Rita Movsesian (vocal), David Santos

Calculating the eligibility rate of sampling units with unknown eligibility Rita Lima Rita

Trattografia probabilistica e fMRI nella pianificazione neurochirurgica Domenico Lizio SC Fisica

Nonbipartite regular 2factor isomorphic graphs: an update Domenico Labbate

Theunreasonable effectivenessofthe largechargeexpansion Domenico Orlando based on

Encoding Sets as Real Numbers Domenico Cantone 1 Alberto Policriti 2 Dept. of Mathematics and

The STARS Paper The Paper and the Process Part 2 The Paper Components of the Paper Abstract:

Ieee Paper Format For Paper Presentation 1 / 4 2 / 4 Ieee Paper Format For Paper Presentation 3

The STARS Paper Summer 2017 The Paper and the Process Part 2 The Paper Components of the Paper

ANNA SETTON TOUR 2019 - 2020 Brazilian singer Anna Setton has been pointed by music critics as the

Anna Kirchgater Back to School Night Thursday, August 18, 2016 Anna Kirchgater will provide a

Healthwatch Committee Meeting May 2014 Welcome and apologies Anna Bradley Minutes from last

TMS Bobcats want to say... Ron Melton Anna Palumbos Grandpa Jeanie Melton Anna Palumbos

FFCRA Leave Management Spreadsheet FFCRA Leave Management Spreadsheet Updated DOL Guidance On

T ra c king Da ta whe n yo ur ne e ds e xc e e d a spre a dshe e t Ho m e Ke e p e r Ho m e

Historical 5-Year Commercial Mortgage Spreads 425 400 375 350 325 300 Spread (bp) 275

SACOT THE STEM ADVOCACY CONFERENCE OF TEXAS What is SACOT? Organization created to help spread

NMTEACH: Accuroster 40, 80 May 2018 and 120 Day Review What is Accuroster An opportunity and

Value Creation Through Constructive Activism Special Purpose Vehicle Focused on Synacor, Inc.

2 LENDLEASE PRESENTATION TO CLSA INVESTORS FORUM 3 $55.9b Urbanisation pipeline 2 By

1 1 section page 1 Overview and Financial Highlights 3 H12016 Financial performance 2 9

Sambuz

Useful Links

Newsletter

Mail Us