What We Learned By Using the Rigor Metric Emily S. Patterson, PhD - - PowerPoint PPT Presentation

what we learned by using the rigor metric
SMART_READER_LITE
LIVE PREVIEW

What We Learned By Using the Rigor Metric Emily S. Patterson, PhD - - PowerPoint PPT Presentation

What We Learned By Using the Rigor Metric Emily S. Patterson, PhD (and numerous colleagues) Need for Analytic Rigor: Avoid Surprise August 7, 1998 Bombings of US Embassies in Africa 224 killed, including 12 US personnel Need for Rigor


slide-1
SLIDE 1

What We Learned By Using the Rigor Metric

Emily S. Patterson, PhD

(and numerous colleagues)

slide-2
SLIDE 2

August 7, 1998 Bombings of US Embassies in Africa 224 killed, including 12 US personnel Need for Analytic Rigor: Avoid Surprise

slide-3
SLIDE 3

NASA Columbia Accident Investigation

"Another lack of rigor cited by the panel is the widespread use of PowerPoint presentations in lieu of actual engineering data and analyses.”

Need for Rigor Transparency: Be Calibrated

slide-4
SLIDE 4

Down - Collect Conflict & Corroboration Hypothesis Exploration

How to Increase Rigor: Broadening Checks (“up” arrows)

Elm, W., Potter, S., Tittle, J., Woods, D.D., Grossman, J., Patterson, E.S. (2005). Finding decision support requirements for effective intelligence analysis tools. Proceedings of the Human Factors and Ergonomics Society 49th Annual Meeting.

slide-5
SLIDE 5

Rigor Metric

slide-6
SLIDE 6

What We Did with the Rigor Metric

  • Study 1: Plan to move troops and supplies

– 12, 3-person teams – Undergraduate students (security & intelligence specialization)

  • Study 2: Causes and impacts of Ariane 501 accident

– 2 novice, 8 expert intelligence analysts – 1 coder; entire session

  • Cases: Low-Moderate-High for 8 attributes

– 1992, 2008: Separatist movements in Georgia – 2009: Lebanese elections/pro-Western shift potential – 1999-09: Chavez manipulation of democracy to retain power – 2009: Uyghur separatist movement/regional stability

slide-7
SLIDE 7

Study 1: 3-Person Team; Logistics Planning Task

Constraints: Fastest (<2.5 hours) Cheapest (least fuel) Secure:

  • Avoid railway (enemy agents)
  • Avoid route C (most attacks)

Best answer: Trucks with supplies on route A Armored vehicles for troops on route B

slide-8
SLIDE 8

Solution Scoresheet

slide-9
SLIDE 9

Team 3 Rigor Measures

slide-10
SLIDE 10

Explanation Critiquing: Example of High and Low High (Team 3): Extensive error checking throughout task Low (Team 10): Supportive throughout: “I think that’s probably the best way to go.” Low (Team 12): Blanket agreement with dominant, intimidating leader

slide-11
SLIDE 11

Task Fidelity: No Specialist Collaboration

slide-12
SLIDE 12

Task/Participant Fidelity: No “High” Scores on 3 Attributes

slide-13
SLIDE 13

Perfect Task Score: Team 3

slide-14
SLIDE 14

Perfect Task Score: Team 10

slide-15
SLIDE 15

Perfect Task Score: Team 11

slide-16
SLIDE 16

Low Performance (32%): Team 6

slide-17
SLIDE 17

Low Performance (40%): Team 7

slide-18
SLIDE 18

Study 2: Individual Analysts; Ariane 501 Accident

  • Time:

2 hour session (avg = 55 min)

  • Task:

Causes, impacts of Ariane 501 accident

  • Participants: 10 NASIC analysts (avg = 13 yrs)
  • Tools:

Search/browse features of Pathfinder

  • Data:

“On topic” database (~2000 documents)

  • Briefing:

Verbal (video-taped)

  • Procotol:

Think aloud, semi-structured interviews

  • Analysis:

Process tracing, briefing accuracy (к = 0.84)

slide-19
SLIDE 19

› Repeated inaccurate information › Missed update that changed assessment › Inapplicable assumption

Embedded Risks for Inaccurate Statements

The monetary loss can be recovered by the insurance...

Cluster Satellite Program

program cancelled rebuild 1 rebuild all 4 lost satellites (no insurance)

slide-20
SLIDE 20

Task Fidelity: No Spec Collab or Expl Critiquing

slide-21
SLIDE 21

2 Novices vs. 8 Experts

slide-22
SLIDE 22

Rely on Weak vs. ‘High Profit’ Documents

slide-23
SLIDE 23

The Bottom Line: What We Learned Study 1

  • Reliable for 3-person team over session (K=0.92)
  • Discovers task and participant fidelity issues

Study 2

  • (Likely) Detects novice-expert differences
  • New insights from old study data

Cases

  • All attributes worked (all cases)
  • Low vs. High easy; Moderate more variable
  • Jargon (knowledge shields)
  • Context-dependent risks:
  • Linguistic barriers (information search)
  • Polarized issues (information validation)
  • Limited access (specialist collaboration)
  • Deliberate deception (stance analysis)
slide-24
SLIDE 24

Next Step: Guidance for When to Invest ‘More’