Simulation Data Mining for Supporting Bridge Design Ninth - PowerPoint PPT Presentation

Simulation Data Mining for Supporting Bridge Design Ninth Australasian Data Mining Conference Steven Burrows † Benno Stein † org Frochte ‡ J¨ David Wiesner † uller † Katja M¨ † Bauhaus University Weimar ‡ Bochum University of Applied Science 1–2 December 2011 Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 1 / 16

About Me Undergrad, Honours, PhD, RMIT University, up to 2010. PostDoc, Bauhaus University Weimar, 2011 to current. “Strategies for Robust Design of Structures” project. ◮ Includes simulation data mining and civil engineering sub-projects. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 2 / 16

Applications of Simulation Data Mining Car crashworthiness (Kuhlmann et al., 2005; Mei and Thole, 2007). Occupant restraint systems (Zhao et al., 2010). Aviation (Fayyad et al., 1996; Painter et al., 2006). Semiconductor manufacturing (Brady and Yellig, 2005). http://en.wikipedia.org/wiki/Finite element method Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 3 / 16

Interactive Bridge Design in Civil Engineering Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 4 / 16

Supporting Bridge Design Key idea: Mine patterns in pre-computed bridge simulation results. Why simulation data mining?: Faster simulations, provide diagnosis, automated design, etc. Consider models { m i ∈ M } and simulation results { y i ∈ Y } : � y 1 ⊖ y 2 � < ε ⇔ ϕ Design ( m 1 , m 2 ) ≈ 1. Develop ϕ Design to predict the similarity of two designs with regards to learned behavior. Questions we can answer: 1 Identify good ‘?’ in ϕ Design ( m 1 , ?) ≈ 1. 2 Predict the behavior of a new model m . 3 Learn cost optimization rules for any equivalence class M ′ ⊆ M . Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 5 / 16

Methodology Computation of the similarity measure in six steps: 1 Sample candidate designs. 2 Simulate the models. 3 Aggregate the simulation results. 4 Cluster the simulation results. 5 Sample the simulation results. 6 Learn a mapping from { m i ∈ M } to { y i ∈ Y } . Future work disclaimer: There are still competing alternatives in many steps to be explored. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 6 / 16

Step 1: Sample Candidate Designs Data format: IFC (Industry Foundation Classes): An object-oriented data model for describing entities in the construction and building industries. IFC-Bridge: An extension to IFC for bridges. NURBS (Non-Uniform Rational Basis Spline): Novel extension to IFC-Bridge in the project. Data set: 14 641 geometry and material permutations of the model below. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 7 / 16

Step 2: Simulate the Models Input: IFC-Bridge data models (Lebegue et al., 2007) with NURBS. Simulation Engine: Finite Element Method implementation (Gerold, 2010). Our “oracle”. Output: VTK (Visualization Toolkit) format (Schroeder et al., 1996). Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 8 / 16

Step 3: Aggregate the Simulation Results Original data: 12 064 points and measurements from the FEM mesh. Process: Consultation with a Numerics professor. Aggregated data (45 measurements): Five regions (below). Maximum displacement, strain, and stress. X, Y, and Z co-ordinates. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 9 / 16

Step 4: Cluster the Simulation Results Goal: Learn similar groupings of simulated models. Clustering algorithms: K-means (Hartigan and Wong, 1979). Hierarchical Agglomerative Clustering (Gowda and Krishna, 1978). AiTools implementation (http://webis.de/research/projects/aitools). Evaluation: Expected Density measure (Stein et al., 2003). Higher quality clusterings give have higher expected density score. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 10 / 16

Step 5: Sample the Simulation Results Example (350 items): � � 100(100 − 1) Cluster A: 100 items. = 4 950 positive pairs. 2 � � 120(120 − 1) Cluster B: 120 items. = 7 140 positive pairs. 2 � � 130(130 − 1) Cluster C: 130 items. = 8 385 positive pairs. 2 = 20 475 positive pairs. + 40 600 negative pairs. = 61 075 total pairs. Sampling strategy (from approximately 10 8 pairs): Class balance. Equal sampling from each cluster of positive pairs. Random sampling for negative pairs. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 11 / 16

Step 6: Machine Learning Training data: Duples in the form � m k ⊖ m l , c j � . Learning: Ten-fold cross validation. Naive Bayes and Maximum Entropy classifiers (Burrows et al., 2011). Outcome: Class probability estimates [0, 1] for evaluating ϕ Design . Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 12 / 16

Clustering Results Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 13 / 16

Accuracy Results K-means (12) HAC (37) Data set size Naive bayes Entropy Naive bayes Entropy 100 94.0 94.0 94.0 96.0 200 92.5 93.0 90.0 91.0 500 85.4 90.6 89.8 90.8 1 000 91.4 94.3 90.8 91.2 2 000 88.6 92.4 89.8 91.0 5 000 89.4 92.7 89.3 90.5 10 000 89.6 92.4 88.5 89.4 20 000 89.8 92.3 89.6 90.3 50 000 89.9 92.7 89.1 90.1 100 000 89.7 92.4 89.1 89.7 200 000 89.8 92.5 89.1 89.8 all 89.8 92.5 89.1 89.7 Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 14 / 16

Future Work Use of a rank correlation co-efficient such as Spearman’s rho , Pearson’s r , or Kendall’s tau to compare the correlation of the ranks of ϕ Design with the ranks of the cosine similarity taken from the simulation space. Apply clustering instead of ranking for the evaluation, and compare the coverage of the clusterings (F-measure). Apply domain decomposition as a parallelization technique for solving partial differential equations in FEM analysis. Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 15 / 16

Summary Mine patterns in pre-computed bridge simulation results for knowledge discovery. Six step methodology for computing ϕ Design so that new questions can be answered. Initial results are promising, but more remains for future work. Thankyou! Steven Burrows steven.burrows@uni-weimar.de www.webis.de Steven Burrows (Bauhaus University) Simulation Data Mining 1–2 December 2011 16 / 16

Simulation Data Mining for Supporting Bridge Design Ninth - PowerPoint PPT Presentation

Simulation Data Mining for Supporting Bridge Design Ninth Australasian Data Mining Conference Steven Burrows Benno Stein org Frochte J David Wiesner uller Katja M Bauhaus University Weimar Bochum University of

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Bridge Design Introduction 13.02.2020 ETH Zurich | Chair of Concrete Structures and Bridge

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

RISK ASSESSEMENT supporting TEST supporting supporting supporting supporting REAGENTS RISK

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Indi diana na Bridge L Load R Rating Jeremy Hunter INDOT Bridge Design Manager Indiana

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

LECTURE 1: INTRODUCTION TO DATA MINING Dr. Dhaval Patel CSE, IIT-Roorkee What is data mining?

Grid simulation (AliEn) Outline GRID simulation Simulation tool Ptolemy (Berkeley)

Bridge Rehabilitation/Replacement Needs Redbud Trail Bridge Barton Springs Road Bridge Presentation

5 th Street SE Bridge Overview Bridge is functionally obsolete New bridge funded with

WEBINAR WEDNESDAY Negotiation and Navigation in a Post-Naviance World March 7, 2018 Todays

Tor and circumvention: Lessons learned Roger Dingledine The Tor Project https://torproject.org/

Work-based Learning Why Now and Where to? Work rk-based Le Learnin ing: : Why Now? Interest

1 ILP Ferrara sept 2018 Games 2 ILP Ferrara sept 2018 Interest of games for AI Excellent

Establishing Time for Professional Learning Transforming

Hardware accelerating Linux network functions Roopa Prabhu, Wilson Kok Proceedings of netdev

> > > > < > SANDALS CHURCH CAMPUS LEADERSHIP TRAINING > > >

Other defenses Threat model (beyond TLS) TLS = confidentiality, integrity, authenticity

Simulation Data Mining for Supporting Bridge Design Ninth - PowerPoint PPT Presentation

Simulation Data Mining for Supporting Bridge Design Ninth Australasian Data Mining Conference Steven Burrows Benno Stein org Frochte J David Wiesner uller Katja M Bauhaus University Weimar Bochum University of

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Bridge Design Introduction 13.02.2020 ETH Zurich | Chair of Concrete Structures and Bridge

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

RISK ASSESSEMENT supporting TEST supporting supporting supporting supporting REAGENTS RISK

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Indi diana na Bridge L Load R Rating Jeremy Hunter INDOT Bridge Design Manager Indiana

Introduction What is data mining? to Data mining functionalities Data Mining Major

Data mining Machine Intelligence Thomas D. Nielsen September 2008 Data mining September 2008

DATA MINING LECTURE 2 What is data? The data mining pipeline What is Data Mining? Data

Outline Narcisse Ngada DESY, MKK 1) What is simulation ? 14.05.2014 2) Why simulation ? 3)

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

LECTURE 1: INTRODUCTION TO DATA MINING Dr. Dhaval Patel CSE, IIT-Roorkee What is data mining?

Grid simulation (AliEn) Outline GRID simulation Simulation tool Ptolemy (Berkeley)

Bridge Rehabilitation/Replacement Needs Redbud Trail Bridge Barton Springs Road Bridge Presentation

5 th Street SE Bridge Overview Bridge is functionally obsolete New bridge funded with

WEBINAR WEDNESDAY Negotiation and Navigation in a Post-Naviance World March 7, 2018 Todays

Tor and circumvention: Lessons learned Roger Dingledine The Tor Project https://torproject.org/

Work-based Learning Why Now and Where to? Work rk-based Le Learnin ing: : Why Now? Interest

1 ILP Ferrara sept 2018 Games 2 ILP Ferrara sept 2018 Interest of games for AI Excellent

Establishing Time for Professional Learning Transforming

Hardware accelerating Linux network functions Roopa Prabhu, Wilson Kok Proceedings of netdev

&gt; &gt; &gt; &gt; &lt; &gt; SANDALS CHURCH CAMPUS LEADERSHIP TRAINING &gt; &gt; &gt;

Other defenses Threat model (beyond TLS) TLS = confidentiality, integrity, authenticity

> > > > < > SANDALS CHURCH CAMPUS LEADERSHIP TRAINING > > >