CS626 Data Analysis and Simulation Instructor: Peter Kemper R 104A, - - PowerPoint PPT Presentation

cs626 data analysis and simulation
SMART_READER_LITE
LIVE PREVIEW

CS626 Data Analysis and Simulation Instructor: Peter Kemper R 104A, - - PowerPoint PPT Presentation

CS626 Data Analysis and Simulation Instructor: Peter Kemper R 104A, phone 221-3462, email:kemper@cs.wm.edu Office hours: Monday,Wednesday 2-4 pm Today: Overview & Introduction 1 Organizational Issues Class materials


slide-1
SLIDE 1

1

CS626 Data Analysis and Simulation

Today: Overview & Introduction Instructor: Peter Kemper

R 104A, phone 221-3462, email:kemper@cs.wm.edu Office hours: Monday,Wednesday 2-4 pm

slide-2
SLIDE 2

2

Organizational Issues Class materials http://www.cs.wm.edu/~kemper

 PDF files of slides  Homework, projects, …  Supplementary material: tutorials, references, …

Schedule

 MWF: 10.00 – 10.50 am  Office hours: Monday, Wednesday 2pm-4pm and by appointment  No class February 18

  • gives extra time for assignments
  • due to DSN PC Meeting

Just ask if you are interested what I am doing there, I am happy to tell you more …

slide-3
SLIDE 3

3

References

We will use texts from multiple sources! A lot of documents/books are available online in the SWEM library! Data Analysis: Bertholt, Borgelt, Hoeppner, Klawonn, Guide to Intelligent Data Analysis NIST/SEMATECH e-Handbook of Statistical Methods, http://www.itl.nist.gov/div898/ handbook/

  • P. Dalgaard, Introductory statistics with R,

Springer 2002, online at SWEM

  • B. Everitt, A handbook of statistical analyses

using R, CRC Press 2010, online at SWEM Simulation: Law/Kelton, Simulation Modeling and Analysis, McGrawHill

slide-4
SLIDE 4

4

Big Picture: Model-based Analysis of Systems

portion/facet real world formal / computer aided analysis solution, rewards, qualitative and quantitative properties probability model, stochastic process transformation presentation transfer decision description perception solution to real world problem real world problem formal model

slide-5
SLIDE 5

5

CS 626 – Focus: Stochastic Models

Stochastic models rely on probability theory

What is probability? A much beloved topic for students A particular type of functions f : S -> [0,1] A mathematical mean to process something we are not sure about:

slide-6
SLIDE 6

6

Calculating with something we are not sure about ? What to give as input ?

 Separate “system” from “environment”, clarify on interactions  Separate “subsystems” and their dependencies within “system”  Quantify likelihood of relevant elementary “events” to happen

What to know ?

 How to calculate with probabilities?  How to handle dependencies?

What to gain ?

 Information on likelihood of overall behavior,

quantified information on expected behavior to evaluate a single system or to compare systems

slide-7
SLIDE 7

7

Grading: How to get an „A“ for CS 626

Class participation: 0%

Seems useless but is key to meet the other criteria

It will be more fun if you actively participate

 Ask questions, make suggestions, contribute …

Homework: 30%

About 5-6

How to get an A? Just do it, hand in on time, present your results …

Projects: 20%

Will require some effort, time and creativity

Adheres to „learning by doing“ approach

How to get an A? Start early, get things done, hand in on time, reflect what you are doing, present your results …

In-class Exams: 50%

Midterm: 20%

Final: 30%

How to get an A? The usual game …

slide-8
SLIDE 8

8

Overview – this is the plan Probability Theory Primer & Statistics Concepts Stochastic Input Modeling

 Different types of stochastic workloads  Relies heavily on data analysis

Simulation Models

 Static  Dynamic

 Discrete Event Dynamic Systems  Continuous, ODE models

Output Analysis

 Data analysis strikes back again!

Verification, Validation, Testing of Simulation Models Data Analysis Classics:

 Preparation,  Finding Patterns, Explanations, and Predictors

Tools:

  • Mobius
  • BioPEPA
  • AnyLogic
  • R
  • KNIME
slide-9
SLIDE 9

9

Probability Theory

has its origins in an interest in games

Cards, roulette, dices,

 Gerolamo Cardano, 1501-1576

From Wikipedia: … notoriously short of money and kept himself solvent by being an accomplished gambler and chess player. His book about games of chance, Liber de ludo aleae, written in the 1560s but published

  • nly in 1663 after his death, contains the first systematic treatment of

probability, as well as a section on effective cheating methods.

Pierre-Simon Laplace, 1749-1827

  • "It is remarkable that a science which began with the consideration of

games of chance should have become the most important object of human knowledge." Theorie Analytique des Probabilite, 1812.

  • Other VIPs of probability theory:

Andrey Kolmogorov Andrey Markov

slide-10
SLIDE 10

10

So where to start? With an application example!

My choice: Dependability of a LEO satellite network

Reference

  • E. Athanasopoulou, P. Thakker, W.H. Sanders.

Evaluating the dependability of a LEO satellite network for scientific applications. In Proc. 2nd int. Conf. Quantitative Evaluation of Systems, pp 95-104, IEEE, 2005.

What to learn from this:

An impression on what can be achieved with stochastic models

Some terminology, techniques and tools we need to give a closer look

Please keep in mind:

LEO satellite modeling is one application among many

Dependability modeling is one application area among many

The terms and techniques are a subset of what is known

=> there is more to learn here, and it is interesting!

slide-11
SLIDE 11

11

A Low Earth Orbit Satellite Network The story:

 University of Illinois student-developed satellite network  Based on Illinois Observing Nanosatellite (ION)  Purpose: collect scientific data

 E.g. natural disaster monitoring, earthquake monitoring, mapping of

Earth’s magnetic field, measuring radiation flux for space weather …

 In particular: measurement of light emissions from oxygen chemistry

in the atmosphere

 Several mission objectives:

 Testing new thrusters, a new processor for small satellites, a new

CMOS camera, demonstration of attitude control on a CubeSat

Some issues from the list of challenges:

 What minimum radiation shielding is necessary ?  What level of redundancy is necessary ?

… to achieve a 6 mth target lifetime with COTS components …

 How does sharing resources through a network could improve

communication with the ground ?

slide-12
SLIDE 12

12

Dependability Assessment

Properties of interest, goals of study

Reliability of network R(0,t), probability of no permanent critical system failures during time interval [0,t]

Interval availability A(0,t), fraction of time system delivers proper service during time interval [0,t]

System vs Environment

System:

 Ground station, 45° inclination, northern hemis., repairable failures  4 satellites

 7 critical subsystems (5V/9V regulators, battery, solar panel, comm

hardware, processor, telemetry) + experiment hardware with temp and permanent failures

 Orbits:

 Sat 1-3: 90 min period, Sat 4: 720 min period  Inclination: Sat 1 90° , Sat 2 90° orth, Sat 3 45° , Sat 4 0°

Environment: “the rest” with a foreseen influence based on

 Lightning storms etc cause preamp failures at ground station  Radiation causes failures for satellite processors and CMOS devices

slide-13
SLIDE 13

13

Communication

Communication

Satellite - satellite: if within communication range

Satellite - ground station: if within footprint

We discretize orbits, identify matching periods

Intersatellitelink (ISL) Gateway Link (GWL) GWL

Elevation: Angle wrt to center of radiation cone and earth surface

ε

Footprint

slide-14
SLIDE 14

14

Clarification: Inclination

Inclination δ δ Satellite orbit Closest point to earth Plane of satellite orbit Equatorial plane

slide-15
SLIDE 15

15

More input data

Radiation dose, shielding and its mapping to failure rates,

0.007 failures per year for processor and CMOS components

0.0001 failures per year for discrete components Scale factor r=1 for 1mm shielding for higher orbit Consider several model configurations!

Communication:

Data collection rate: 2 GB/yr while memory available all data lost at failure

Uplink communication considered negligible

Simple routing mechanism

ISL communication rate: 115 kbps with 50% overhead, 226665 MB/yr

GWL rate: double ISL rate

ISL with commercial satellite networks

slide-16
SLIDE 16

16

Where are the probabilities ?

Dependability study:

Ground station: rates of failure / repair actions

Satellite subsystems: rates of failure / repair actions are modeled with a random variable that follows a negative exponential distribution with a given rate. Rate 5.0 means on average 5 events per time unit.

Total Ionizing Dose (TID): is taken into account by a scaling factor r towards failure rates of components.

What is then analyzed

Reliability and availability

For different levels of radiation shielding

For different levels of redundancy of components

What type of analysis is used

Transient analysis of Markov chains

 For single satellite design

Discrete event simulation of stochastic models

 valid alternative, used for evaluation of overall network

slide-17
SLIDE 17

17

Outcome of calculations Satellite reliability in time interval [0,t] with t in months for different levels of radiation shielding

 Base radiation rate r = 1

(matches 1 mm of shielding)

 Increased shielding: r = 0.4, r = 0.2  Reduced shielding: r = 2, r = 5

Probability of no permanent critical system failures during [0,t]

slide-18
SLIDE 18

18

Some more results wrt reliability Different levels of redundancy for

 Batteries

Note, failures are unrelated to radiation Two batteries seem ok

 Regulators

Redundancy does not improve TID immunity since processor or communication system fail due to TID much earlier

slide-19
SLIDE 19

19

Results wrt to Performance and Dependability Baseline results for r = 1 A set of simulations experiments are performed for

  • Different levels of r
  • Different protocols
  • Configurations
  • Buffer capacities
  • Data collection rates
  • Communication with
  • ther commercial networks
slide-20
SLIDE 20

20

What does this study illustrate?

portion/facet real world formal / computer aided analysis solution, rewards, functional properties probability model, stochastic process transformation presentation transfer decision description perception solution to real world problem real world problem formal model

slide-21
SLIDE 21

Memo

21

Rene Magritte, The Treachery of Images, 1928-29

slide-22
SLIDE 22

What does this study illustrate? Model vs System

 largely simplified formal/mathematical/stochastic model

implemented in software in a fully controlled environment

 set of physical devices interacting in space-time in an largely

uncontrolled, not fully understood environment

Model

 includes some of the rules how the system operates, excludes

  • thers

 includes some aspects of the real world as random variables,

ignores others or assumes them as constant

 is parameterized with respect to certain design variables

Study

 has an objective, a clear question  delivers values that are probabilities like R(0,t)

 Interpretation?

 evaluates effects of different design choices

22

slide-23
SLIDE 23

23

Open Issues: How does the stochastic model look like ?

 Modeling notation:

 compositional stochastic activity network

 Modeling framework:

 Mobius: multi-paradigm multi-solution framework

from W.H. Sanders, UIUC

How to obtain numerical values used in modeling?

 Data analysis

How do the calculations work ?

 Analysis of continuous time Markov chains  Discrete event simulation of stochastic models

How interpret results and how to decide if computed values are trustworthy?

 Output analysis, Verification and Validation

And what happened with the satellites ?

slide-24
SLIDE 24

24

How the story ended:

“A launch attempt with the Illinois Observing Nanosatellite (ION) was made on July 26, 2006. Unfortunately, ION and the 17 other satellites on the launch did not make it into orbit due to a launch failure in the second minute. ION was to be deployed in the fifteenth minute. The launch provider is carefully evaluating the root cause of the failure,and …”

  • P. Thakker, http://www.amsat.org/amsat/archive/amsat-bb/200607/msg00444.html

So success is not easily earned with space technology, at least no human was killed in that mission. Obviously a dependability analysis of the carrier rocket system would give us an interesting topic as well.

slide-25
SLIDE 25

25

Summary Today:

 Example case study

Future:

Probability Theory Primer, Statistics Concepts Stochastic Input Modeling Different types of stochastic workloads Simulation Models (static, dynamic) Output Analysis Verification, Validation, Testing Data Analysis Classics:

 Preparation,  Finding Patterns, Explanations, and

Predictors