Model Checking Contest Report for 2012 Fabrice Kordon - LIP6/MoVe, - - PowerPoint PPT Presentation

model checking contest
SMART_READER_LITE
LIVE PREVIEW

Model Checking Contest Report for 2012 Fabrice Kordon - LIP6/MoVe, - - PowerPoint PPT Presentation

Model Checking Contest Report for 2012 Fabrice Kordon - LIP6/MoVe, UPMC, France Alban Linard - CUI/SMV, Univ. Genve, Switzerland Franck Pommereau - IBISC, Univ. Evry Val dEssonne M odel C hecking C ont et t @ Contents F. Kordon -


slide-1
SLIDE 1

Model Checking Contett @

Model Checking Contest

Report for 2012

Fabrice Kordon - LIP6/MoVe, UPMC, France Alban Linard - CUI/SMV, Univ. Genève, Switzerland Franck Pommereau - IBISC, Univ. Evry Val d’Essonne

slide-2
SLIDE 2
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Contents

Objectives Evaluation procedure The models Participating tools Analysis of the results Concluding remarks

2

slide-3
SLIDE 3
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Contents

Objectives Evaluation procedure The models Participating tools Analysis of the results Concluding remarks

2

Special thanks for those who helped to

  • rganize this MCC, in particular Nicolas

Gibelin (Cluster), Lom Hillah (PNML), Emmanuel Paviot-Adet (models)

slide-4
SLIDE 4

Objectives

slide-5
SLIDE 5
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

When it Comes to Deal with Large and Complex Systems...

Lots of questions are raised...

To verify highly concurrent systems, should we use a symmetry-based or a partial order-based model checker? For models with large variable domains, should we use decision diagram-based,

  • r a symmetry-based model checker?

Can we combine structural reductions techniques with partial-order ones or symmetry-based ones? ...

A large variety of model checking techniques

and their potential combination

A large variety of model categories A challenge with large scale specifications A need to evaluate in the fairest way current MC implementations

4

slide-6
SLIDE 6
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

The Objectives...

MCC is intended to:

Exchange experience between tool programmers, Imagine some association of techniques, and thus better tools Stimulate development of tools Provide visibility to these tools

MCC can also be of great help for the PN community (and users):

Define a common set of models for benchmarks Identify experimentally classes of problems (in models)

identify the techniques able to cope with a given class of problems...

Improve communication between tools (and PNML ;-) ) Provides raw data for comparison

This is the second edition

We hope more editions for an enhanced analysis and evaluation of tools

5

slide-7
SLIDE 7

Evaluation Procedure

slide-8
SLIDE 8
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

What to be measured?

The «enemies» of model checking

Memory consumption CPU consumption

«Examinations» to be processed

State space generation Formula evaluation

Structural Formulas Reachability Formulas CTL formulas LTL formulas

Another 2012 innovation

Models to be proposed by the community («call for model»)

7 models in 2011 19 models in 2012 (including the 7 from 2011)

7

slide-9
SLIDE 9
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

What to be measured?

The «enemies» of model checking

Memory consumption CPU consumption

«Examinations» to be processed

State space generation Formula evaluation

Structural Formulas Reachability Formulas CTL formulas LTL formulas

Another 2012 innovation

Models to be proposed by the community («call for model»)

7 models in 2011 19 models in 2012 (including the 7 from 2011)

7

slide-10
SLIDE 10
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

What to be measured?

The «enemies» of model checking

Memory consumption CPU consumption

«Examinations» to be processed

State space generation Formula evaluation

Structural Formulas Reachability Formulas CTL formulas LTL formulas

Another 2012 innovation

Models to be proposed by the community («call for model»)

7 models in 2011 19 models in 2012 (including the 7 from 2011)

7

Special thanks for the community who provided interesting models 12 new models coming from 5 institutions

  • Univ. Evry Val d’Essone, France
  • Univ. Geneva, Switzerland
  • Univ. P. & M. Curie France
  • Univ. Paris 13, France
  • Univ. Rostock, Germany
slide-11
SLIDE 11
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

slide-12
SLIDE 12
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

slide-13
SLIDE 13
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

slide-14
SLIDE 14
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

slide-15
SLIDE 15
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

2419 runs processed! State Space : 639 Formulas : 1780 VM deployment : 6h!

slide-16
SLIDE 16
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Evaluation procedure

Execution on a dedicated cluster (23 nodes)

PowerEdge R410 (6 ports gigabits) and 1.5To local disks 8GB memory (DDR3, 1333) Intel Xeon E5645@2.40GHz (6 cores, 12 threads) Cache L1=192kB, L2=1536kB, L3=12288kB

Run = execution of a tool for one examination on one model/scale

A run is executed in a Virtual machine We process runs until one fails (to check how far a tool goes)

A benchmark script launching all runs

With time confinement 3600 sec per run With memory confinement 4 GByte per run Time and memory measures CPU and Memory evolution

8

2419 runs processed! State Space : 639 Formulas : 1780 VM deployment : 6h! Optimized technique compared to 2011 dispatch of runs all over the cluster

slide-17
SLIDE 17
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Difficulties

The Cluster

Was delivered later than expected

Old nodes could not operate virtualization

The formulas

Last year solution was not satisfactory

Based on invariants Too «easy» formulas One set per model

This year solution

One set per run Two formats, XML and textual (update of the grammar) But... ... a nightmare

Other technical difficulties

Fighting with qemu Change of structure for formulas provide PNML form for submitted models

9

slide-18
SLIDE 18

The Models

slide-19
SLIDE 19
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Presentation of the Models

11

Model Name model type safe dead- lock free choice state machine event graph rever- sible cs_repetitions colored + P/T

! ! ? ? ? ?

rwmutex colored + P/T

" ! ! ! ! "

echo colored + P/T

" " ! ! ! !

eratosthenes colored + P/T

" " ? ? ? ?

galloc_res colored + P/T

! ! ? ? ? ?

lamport_fmea colored + P/T

" ! ! ! ! !

neoelection colored + P/T

" " ! ! ! "

philo_dyn colored + P/T

" " ? ? ? ?

planning colored + P/T

! ? ! ! ? ?

railroad colored + P/T

! ! ? ? ? ?

ring colored + P/T

" ? ! ! ? ?

simple_lbs colored + P/T

" ! ! ! ! !

FMS P/T

! ? ? ? ? ?

Kanban P/T

! ? ? ? ? ?

MAPK P/T

! ? ? ? ? ?

Peterson colorred

" ? ? ? ? ?

Philosophers colorred

" ? ? ? ? ?

SharedMemory colorred

" ! ? ? ? ?

TokenRing colorred

" ? ? ? ? ? Models from 2011 Models proposed for 2012

slide-20
SLIDE 20
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Presentation of the Models

11

Model Name model type safe dead- lock free choice state machine event graph rever- sible cs_repetitions colored + P/T

! ! ? ? ? ?

rwmutex colored + P/T

" ! ! ! ! "

echo colored + P/T

" " ! ! ! !

eratosthenes colored + P/T

" " ? ? ? ?

galloc_res colored + P/T

! ! ? ? ? ?

lamport_fmea colored + P/T

" ! ! ! ! !

neoelection colored + P/T

" " ! ! ! "

philo_dyn colored + P/T

" " ? ? ? ?

planning colored + P/T

! ? ! ! ? ?

railroad colored + P/T

! ! ? ? ? ?

ring colored + P/T

" ? ! ! ? ?

simple_lbs colored + P/T

" ! ! ! ! !

FMS P/T

! ? ? ? ? ?

Kanban P/T

! ? ? ? ? ?

MAPK P/T

! ? ? ? ? ?

Peterson colorred

" ? ? ? ? ?

Philosophers colorred

" ? ? ? ? ?

SharedMemory colorred

" ! ? ? ? ?

TokenRing colorred

" ? ? ? ? ? Models from 2011 Models proposed for 2012

D i v e r s i f i c a t i

  • n
  • f

c h a r a c t e r i s t i c s

slide-21
SLIDE 21

Participating Tools

slide-22
SLIDE 22
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

# Tool from 1 lola-binstore

  • Univ. Rostock

2 lola-bloom

  • Univ. Rostock

3 sara

  • Univ. Rostock

4 neco

  • Univ. Evry Val D'essonne

5 pnxdd

  • Univ. P. & M. Curie

6 marcie

  • Univ. Cottbus

7 helena

  • Univ. Paris 13

8 AlPiNa Univ Geneva 9 crocodile

  • Univ. P. & M. Curie

10 ITS-tools

  • Univ. P. & M. Curie

The Submissions (order of arrival ;-)

13

slide-23
SLIDE 23
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

# Tool from 1 lola-binstore

  • Univ. Rostock

2 lola-bloom

  • Univ. Rostock

3 sara

  • Univ. Rostock

4 neco

  • Univ. Evry Val D'essonne

5 pnxdd

  • Univ. P. & M. Curie

6 marcie

  • Univ. Cottbus

7 helena

  • Univ. Paris 13

8 AlPiNa Univ Geneva 9 crocodile

  • Univ. P. & M. Curie

10 ITS-tools

  • Univ. P. & M. Curie

The Submissions (order of arrival ;-)

13

P r

  • v

i d e d i n t h e i r V M

slide-24
SLIDE 24
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

# Tool from 1 lola-binstore

  • Univ. Rostock

2 lola-bloom

  • Univ. Rostock

3 sara

  • Univ. Rostock

4 neco

  • Univ. Evry Val D'essonne

5 pnxdd

  • Univ. P. & M. Curie

6 marcie

  • Univ. Cottbus

7 helena

  • Univ. Paris 13

8 AlPiNa Univ Geneva 9 crocodile

  • Univ. P. & M. Curie

10 ITS-tools

  • Univ. P. & M. Curie

The Submissions (order of arrival ;-)

13

P r

  • v

i d e d i n t h e i r V M N i c e h

  • t

l i n e t

  • ;
  • )
slide-25
SLIDE 25
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Participating tools: Supported techniques

14

State space generation

Decision Diagrams (any kind) Explicit representation of the state space Exploitation of the system symmetries Use of «topological» information (syphon, traps,invariant, etc)

Formula evaluation

Abstractions (on the fly state elimination) Decision Diagrams (any kind) Explicit representation of the state space Use of a constraint solver (SAT, SMT) Use of structural reduction (Berthelot’s, Haddad’s, etc.) Use of Partial order techniques

slide-26
SLIDE 26
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Participating tools: Supported techniques

14

State space generation

Decision Diagrams (any kind) Explicit representation of the state space Exploitation of the system symmetries Use of «topological» information (syphon, traps,invariant, etc)

Formula evaluation

Abstractions (on the fly state elimination) Decision Diagrams (any kind) Explicit representation of the state space Use of a constraint solver (SAT, SMT) Use of structural reduction (Berthelot’s, Haddad’s, etc.) Use of Partial order techniques

Also a combination of such techniques State space

ITS-Tool: Decision Diagrams + Symmetries PNXDD, ITS-Tool: Decision Diagrams + Topological

Formula evaluation

Lola*: Explicit + Partial Orders + Topological Sara:Abstraction + SAT/SMT + Decision Diagrams

slide-27
SLIDE 27
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

AlPiNA

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-28
SLIDE 28
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

Crocodile

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-29
SLIDE 29
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

Helena

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-30
SLIDE 30
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

ITS-Tools

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-31
SLIDE 31
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

Marcie

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-32
SLIDE 32
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

Neco

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-33
SLIDE 33
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

PNXDD

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-34
SLIDE 34
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (State Space)

15

LoLa* and Sara did not participated in the State Space generation

PNXDD

T h r e e « k i l l i n g m

  • d

e l s » c s

  • r

e p e t i t i

  • n

p l a n n i n g n e

  • e

l e c t i

  • n

participated Max Value reached

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-35
SLIDE 35
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

AlPiNA

%processed formulas

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-36
SLIDE 36
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

Helena

%processed formulas

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-37
SLIDE 37
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

LoLa-binstore

%processed formulas

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-38
SLIDE 38
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

LoLa-bloom

%processed formulas

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-39
SLIDE 39
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

Sara

%processed formulas

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-40
SLIDE 40
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Reachability)

Only AlPiNa, Helena, LoLa* and Sara participated

16

participated Max Value reached

Sara

%processed formulas T h r e e « k i l l i n g m

  • d

e l s » c s

  • r

e p e t i t i

  • n

p l a n n i n g r i n g

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-41
SLIDE 41
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Structural)

Only AlPiNa and Helena participated

17

participated Max Value reached %processed formulas

AlPiNA

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-42
SLIDE 42
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Structural)

Only AlPiNa and Helena participated

17

participated Max Value reached %processed formulas

Helena

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-43
SLIDE 43
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Models & examinations Processed by Tools (Structural)

Only AlPiNa and Helena participated

17

participated Max Value reached %processed formulas

Helena

h e i g h t « k i l l i n g m

  • d

e l s » c s

  • r

e p e t i t i

  • n

e c h

  • e

r a t

  • s

t h e n e g a l l

  • c

_ r e s M A P K p l a n n i n g r a i l r

  • a

d r i n g

FMS Kanban MAPK echo rwmutex planning ring Peterson Philosophers SharedMemory TokenRing cs_repetitions galloc_res lamport_fmea philo_dyn neo-election simple_lbs eratosthenes railroad

Models

slide-44
SLIDE 44
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

The «surprise model»

We had a bench of «big ones»

From Paris and from Rostock

Unfortunately...

Formula analysis ended last Sunday evening (about 23h00)

Due to the number of «small problems» to be solved...

No possibility to operate this examination this year

18

slide-45
SLIDE 45

Analysis of the Results

slide-46
SLIDE 46
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Results were difficult to analyze

The execution itself was shorter than expected

Around 6+8 hours (both examinations)

But outcomes were much bigger (see next slide) State space analysis

This is possible «comparison» is also possible

Formula evaluation

This is more difficult

Some problems come from the original requirements

Comparison impossible

All tools do not process the same subset of formulas Most formulas where false

More work is needed on formulas for the next edition

20

slide-47
SLIDE 47
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

No (more than last year) interest in a «race» 654 charts generated

358 for the state space examination

Comparison of CPU, elapsed time, Memory, Evolution of memory and CPU Radars

296 for the formulas examination (reachability and structural)

Comparison of CPU, elapsed time, Memory (no signification) Radars

Identification (partial) of some «surprises» discovered when test were processed

How tools scale up

P/T and colored

Some observations on time and memory consumption Feed back with tools’ characteristics

How to perform the analysis

21

slide-48
SLIDE 48
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Interesting Facts, initial cost of some techniques

22

500 1000 1500 2000 2500 3000 3500 4000 2 5 10 20 50 100 Memory (MB) Scaling parameter Memory for state space generation (FMS)

24/06/2012, 11:26

AlPiNa ITS−Tools Marcie PNXDD

slide-49
SLIDE 49
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Philosophers, comparison with last year

23

1 10 100 1000 10000 5 10 20 50 100 200 500 1000 2000 5000 10000 50000 100000 Time (s), log scale Scaling parameter (log scale) CPU for state space generation (Philosophers)

24/06/2012, 11:26

AlPiNa Helena ITS−Tools Marcie PNXDD

Marcie (new tool) is doing quite well

slide-50
SLIDE 50
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Stability of some techniques

Decision diagrams for Marcie

24

500 1000 1500 2000 2500 3000 3500 4000 5 10 20 50 100 200 Memory (MB) Scaling parameter Memory for state space generation (SharedMemory)

24/06/2012, 11:26

AlPiNa Crocodile Helena ITS−Tools Marcie PNXDD

slide-51
SLIDE 51
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Eratosthene, Scalability of Decision diagrams?

25

500 1000 1500 2000 2500 3000 3500 4000 20 50 100 200 500 Memory (MB) Scaling parameter Memory for state space generation (eratosthenes)

24/06/2012, 11:26

AlPiNa ITS−Tools Marcie Neco

Eratosthene 500 = 4,13x10121 states

slide-52
SLIDE 52
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Sometimes, «simple» is better

Helena beats them all (philo_dyn 50 = 2,26x106 states)

26

500 1000 1500 2000 2500 3000 3500 4000 2 3 10 20 50 Memory (MB) Scaling parameter (log scale) Memory for state space generation (philo−dyn)

24/06/2012, 11:26

AlPiNa Helena ITS−Tools Marcie Neco PNXDD

slide-53
SLIDE 53
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Rwmutex

27

500 1000 1500 2000 2500 3000 3500 4000 10 20 50 100 500 1000 2000 Memory (MB) Scaling parameter (w, r=10), log scale, log scale Memory for state space generation (rwmutex)

24/06/2012, 11:26

AlPiNa ITS−Tools Marcie Neco

The tool going farer used a combination of techniques

slide-54
SLIDE 54
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Simple_lbs

28

500 1000 1500 2000 2500 3000 3500 4000 2 5 10 15 20 Memory (MB) Scaling parameter Memory for state space generation (simple−lbs)

24/06/2012, 11:26

AlPiNa ITS−Tools Marcie Neco PNXDD

The tools that go farer also use a combination of techniques

slide-55
SLIDE 55
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

CS_repetition?

29

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) PNXDD, CPU/Memory over execution for cs−repetitions (25)

26/06/2012, 11:46

CPU Memory

slide-56
SLIDE 56
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

CS_repetition?

29

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) Marcie, CPU/Memory over execution for cs−repetitions (25)

26/06/2012, 11:46

CPU Memory

slide-57
SLIDE 57
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

CS_repetition?

29

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) AlPiNa, CPU/Memory over execution for cs−repetitions (25)

26/06/2012, 11:46

CPU Memory

slide-58
SLIDE 58
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

CS_repetition?

29

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) AlPiNa, CPU/Memory over execution for cs−repetitions (25)

26/06/2012, 11:46

CPU Memory

C l e a r l y a C P U c h a l l e n g e

slide-59
SLIDE 59
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

planning

30

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) AlPiNa, CPU/Memory over execution for planning (fixed)

26/06/2012, 11:53

CPU Memory

slide-60
SLIDE 60
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

planning

30

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) ITS−Tools, CPU/Memory over execution for planning (fixed)

26/06/2012, 11:53

CPU Memory

slide-61
SLIDE 61
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

planning

30

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) ITS−Tools, CPU/Memory over execution for planning (fixed)

26/06/2012, 11:53

CPU Memory

A n

  • t

h e r C P U c h a l l e n g e

slide-62
SLIDE 62
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

neo-election

31

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) AlPiNa, CPU/Memory over execution for neo−election (2)

26/06/2012, 11:56

CPU Memory

slide-63
SLIDE 63
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

neo-election

31

0% 20% 40% 60% 80% 100% 500 1000 1500 2000 2500 3000 3500 4000 execution time (s) AlPiNa, CPU/Memory over execution for neo−election (2)

26/06/2012, 11:56

CPU Memory

s t i l l C P U c h a l l e n g e

slide-64
SLIDE 64
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

On formulas

It is impossible to really evaluate

Charts have been generated but with no real meaning No execution chart (tracking bug)

On the number of evaluated reachability formulas

LoLa-binstore is clearly better (LoLa-bloom has more fails) AlPiNa does not scale up well

On the number of evaluated structural formulas

AlPiNa beats Helena (more formulas and less fails)

Let us note that sometimes tool diverge

The vector of evaluated formula is not the same ;-)

On the state space too

Buts the notion of state (symbolic, explicit) may not be the same.

32

slide-65
SLIDE 65

Concluding Remarks

slide-66
SLIDE 66
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Outcomes

Mitigated results for formulas

Lots of difficulties... and lessons learned No much time left for analysis (and big data to go through)

MCC 2013 @ Petri Nets?

The team is ready to go Need to provide more help to tools submitters

A way to execute a tool on the first instance of each model?

What to be proposed in MCC 2013 @ Petri Nets?

No more complex things (we must stabilize the procedure) More memory (but 64bits VM then) More models: models from 2012 and more? A finer classification of properties

Bounds + Deadlocks + mixed Satisfiable + Unsatisfiable

The «surprise model»

34

slide-67
SLIDE 67
  • F. Kordon - LIP6/MoVe - UPMC

SUMo 2011 - Model Checking Contest report - June 26, 2012

Downloading Participating Tools

35

AlPiNA - CUI, University of Geneva - Switzerland

http://cui.unige.ch/~buchs/Site/About_Me.html

Crocodile - LIP6, Université P. & M. Curie - France

http://www.cosyverif.org (as a part of the environment)

Helena - LIPN, Université Paris 13 - France

http://www-lipn.univ-paris13.fr/~evangelista/

ITS_Tools - LIP6, Université P. & M. Curie - France

http://ddd.lip6.fr and http://www.cosyverif.org (as a part of the environment)

LoLA Binstore - University of Rostock - Germany

http://www.informatik.uni-rostock.de/tpp/lola/

LoLA Bloom - University of Rostock - Germany http://www.informatik.uni-rostock.de/tpp/lola/ Marcie - BTU-Cottbus - Germany http://www-dssz.informatik.tu-cottbus.de/ Neco - IBISC, Université Evry val d'Essonne - France www.ibisc.fr/~lfronc/ PNXDD - LIP6, Université P. & M. Curie - France http://move.lip6.fr and http://www.cosyverif.org (as a part of the environment) Sara- University of Rostock - Germany http://www.informatik.uni-rostock.de/tpp/lola

slide-68
SLIDE 68

Discussion