Replication and Robust Results Jim Herbsleb School of Computer - - PowerPoint PPT Presentation

replication and robust results
SMART_READER_LITE
LIVE PREVIEW

Replication and Robust Results Jim Herbsleb School of Computer - - PowerPoint PPT Presentation

Replication and Robust Results Jim Herbsleb School of Computer Science Carnegie Mellon University jdh@cs.cmu.edu http://conway.isri.cmu.edu/~jdh/ Science Is Based on a Peculiar Logic Experimental method Relationship => hypothesis


slide-1
SLIDE 1

Replication and Robust Results

Jim Herbsleb

School of Computer Science Carnegie Mellon University jdh@cs.cmu.edu http://conway.isri.cmu.edu/~jdh/

slide-2
SLIDE 2

Science Is Based on a Peculiar Logic

  • Experimental method
  • Relationship => hypothesis
  • Hypothesis is true
  • Conclude relationship is true
  • Affirming the consequent
  • A => B
  • B is true
  • Conclude A
slide-3
SLIDE 3

Many-Layered Problem

Theory Relationship Hypothesis Measures

slide-4
SLIDE 4

Many-Layered Problem

Theory Relationship Hypothesis Measures

slide-5
SLIDE 5

Robust Results

  • Results consistent as “irrelevant” things

vary

slide-6
SLIDE 6

Multi-site Delay

10 20 30 multi site single site 12.7 4.9

Network Element A

Work Days

Last Modification - First Modification All changes for 2-year period

Modification Request (MR) interval

Herbsleb, J.D. & Mockus, A. (2003). An Empirical Study of Speed and Communication in Globally- Distributed Software Development. IEEE Transactions on Software Engineering, 29, 3, pp. 1-14.

slide-7
SLIDE 7

7

Modeling Interval

Variable Measure used in models MR interval Log of number of days, first delta to last delta Number of people Log of number of people Diffusion Log of number of modules touched by change Size Log of number of delta Time Date Severity Is high severity Fix Is fix Multi-site Set of sites of all actors has more than one element

slide-8
SLIDE 8

H1 Multi-site work just takes longer

Multi-site Size Number of People Work Interval Diffusion

H2 Multi-site MRs are larger, take longer H3 Multi-site MRs are more diffuse, take longer H4 Multi-site MRs involve more people, take longer

slide-9
SLIDE 9

Graphical model of work interval for Network Element A

slide-10
SLIDE 10

H1 Multi-site work just takes longer H2 Multi-site MRs are larger, take longer H3 Multi-site MRs are more diffuse, take longer H4 Multi-site MRs involve more people, take longer

148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

slide-11
SLIDE 11

H1 Multi-site work just takes longer H2 Multi-site MRs are larger, take longer H3 Multi-site MRs are more diffuse, take longer H4 Multi-site MRs involve more people, take longer

148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

slide-12
SLIDE 12

H1 Multi-site work just takes longer H2 Multi-site MRs are larger, take longer H3 Multi-site MRs are more diffuse, take longer H4 Multi-site MRs involve more people, take longer

148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

slide-13
SLIDE 13

H1 Multi-site work just takes longer H2 Multi-site MRs are larger, take longer H3 Multi-site MRs are more diffuse, take longer H4 Multi-site MRs involve more people, take longer

148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

slide-14
SLIDE 14

The Decision . . .

  • Published in ICSE
  • What next?
  • Declare victory and move on?
  • Replicate with different data?
  • What was different?
  • Locations
  • People
  • Product
  • Software type
slide-15
SLIDE 15

Multi-site Delay

10 20 30 multi site single site 12.7 4.9 18.1 6.9

Network Element A Network Element B

Work Days

Last Modification - First Modification All changes for 2-year period

Modification Request (MR) interval

Herbsleb, J.D. & Mockus, A. (2003). An Empirical Study of Speed and Communication in Globally- Distributed Software Development. IEEE Transactions on Software Engineering, 29, 3, pp. 1-14.

slide-16
SLIDE 16

Graphical model of work interval for Network Element A (left) and B (right)

148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

slide-17
SLIDE 17

96.2

  • 0.13

2009.7 0.55 566.8 0.25 701.7 0.34 Number of People Work Interval Diffusion Multi-site Size 148.9 0.25 Multi-site Size 199.7 0.27 154.1 0.24 35.9 0.12 Number of People Work Interval Diffusion

Graphical model of work interval for Network Element A (left) and B (right)

slide-18
SLIDE 18

Thoughts on Replication

  • Replicating the result was a bit scary
  • What do we do if the results are different?
  • But that’s science
  • How similar must results be?
slide-19
SLIDE 19

Graphical model of work interval for Network Element A (left) and B (right)

slide-20
SLIDE 20

Closer? More Differentiated?

  • Would we have learned more from a

closer replication?

  • From a more differentiated replication?
  • Differentiated how?
  • What would we have learned?
slide-21
SLIDE 21

Replication is Always about Generalization

  • Close replication
  • Generalize over concrete instances
  • Differentiated replication
  • Generalize over additional variables
  • External replication
  • Generalize over experimenters/labs
slide-22
SLIDE 22

What Do You Learn?

Closer More differentiated Amount Of Learning Same result: Robust effect Different result: Many possible causes Original result Was anomalous Effect unlikely To be anomalous

slide-23
SLIDE 23

Most of Science is Replication

Theory Relationship Hypothesis Measures

slide-24
SLIDE 24