Mining Specifications from Documentation Using a Crowd *Peng Sun - PowerPoint PPT Presentation

Mining Specifications from Documentation Using a Crowd *Peng Sun *Chris Brown ^Ivan Beschastnikh *Kathryn Stolee * NC State University ^ University of British Columbia + + University of British Columbia 1

Mining Specifications from Documentation Using a Crowd *Peng Sun *Chris Brown ^Ivan Beschastnikh *Kathryn Stolee * NC State University ^ University of British Columbia University of British Columbia 2

Software Specifications Software systems and libraries usually lack up-to-date formal specifications. Formal specifications are Rapid Software Evolution non-trivial to write down University of British Columbia 3

Software Specifications Lack of Formal Specifications Maintainability & Reliability Challenges o Reduced code comprehension o Implicit assumptions may cause bugs o Difficult to identify regressions Software Specification Mining University of British Columbia 4

Software Specifications Mining • Many existing specification mining algorithms – Most automatically infer specs from execution traces Finite State Automata (FSA) TSE 1972, ICSE 2006, ASE 2009, Examples: k-tail, CONTRACTOR++, SEKT, TEMI, Synoptic,… FSE 2011, FSE 2014, ICSE 2014, TSE 2015, ASE 2015, … University of British Columbia 5

Software Specifications Mining • Many existing specification mining algorithms – Most automatically infer specs from execution traces Finite State Automata (FSA) TSE 1972, ICSE 2006, ASE 2009, Examples: k-tail, CONTRACTOR++, SEKT, TEMI, Synoptic,… FSE 2011, FSE 2014, ICSE 2014, TSE 2015, ASE 2015, … University of British Columbia 6

But, automation is a dimension Prior to 1990s Entirely Manual Formal methods experts University of British Columbia 7

But, automation is a dimension Prior to 1990s 1990s - present Entirely Completely Manual Automated Formal methods experts University of British Columbia 8

But, automation is a dimension Prior to 1990s 1990s - present Entirely Completely Manual Automated • False positives • Expensive • Requires artifact diversity • Not scalable • Requires accurate artifacts Formal methods experts University of British Columbia 9

Our contribution: crowd spec mining from docs Prior to 1990s SANER 2019 1990s - present Entirely Crowd Completely Manual Mining Automated • False positives • Expensive • Requires artifact diversity • Not scalable • Requires accurate artifacts Formal methods experts University of British Columbia 10

Prior to 1990s SANER 2019 1990s - present Entirely Crowd Completely Manual Mining Automated Formal methods experts RQ1: Can crowd do as well as experts? RQ2: Can crowd improve, or replace, existing spec miners? University of British Columbia 11

Crowd-sourcing in SE (not a new idea) ● Crowd is effective at a variety of SE tasks ● Testing [1] ● Evaluating code smells [2] ● Program synthesis [3] ● Building software [4] [1] Dolstra et al. Crowdsourcing GUI tests. ICST 2013. [2] Stolee et al. Exploring the use of crowdsourcing to support empirical studies in software engineering. ESEM 2010. [3] Cochran et al. Program boosting: Program synthesis via crowd-sourcing. SIGPLAN Not. Vol. 50 No. 1. L2015 [4] LaToza et al. Microtask programming: Building software with a crowd. UIST 2014. University of British Columbia 12

Crowd-sourcing in SE (not a new idea) ● Crowd is effective at a variety of SE tasks ● Testing [1] ● Prior work on crowd mining HW specs [5]. We differ: ● Evaluating code smells [2] ● Use docs instead of traces, SW specs not HW ● Program synthesis [3] ● We use standard quality controls, not gamification ● Building software [4] ● We improve spec miners/compare to experts [1] Dolstra et al. Crowdsourcing GUI tests. ICST 2013. [2] Stolee et al. Exploring the use of crowdsourcing to support empirical studies in software engineering. ESEM 2010. [3] Cochran et al. Program boosting: Program synthesis via crowd-sourcing. SIGPLAN Not. Vol. 50 No. 1. 2015 [4] LaToza et al. Microtask programming: Building software with a crowd. UIST 2014. [5] Li et al. Crowdmine: Towards crowdsourced human-assisted verification. DAC 2012. University of British Columbia 13

Crowd-sourcing spec mining [CrowdSpec] Design questions to answer: - What kind of spec to mine? - What resource to mine specs from? - How to solicit contributions from the crowd? - How to combine crowd responses? University of British Columbia 14

Crowd-sourcing spec mining [CrowdSpec] Design question/answers: - Type of spec? Temporal APIs - What resource? Documentation - How to solicit? MTurk microtasks - Combining responses? Voting University of British Columbia 15

Crowd-sourcing spec mining [CrowdSpec] Good for humans, if simple Design question/answers: Aligns with prior work (can compare) Notoriously difficult [1]; crowd could help? - Type of spec? Temporal APIs - What resource? Documentation - How to solicit? MTurk microtasks - Combining responses? Voting [1] Legunsen et al. How good are the specs? a study of the bug-finding effectiveness of existing java api specifications. ASE 2016. University of British Columbia 16

Crowd-sourcing spec mining [CrowdSpec] Design question/answers: Great for humans (beats traces!) Very few existing spec miners [1] - Type of spec? Temporal APIs Good temporal NLP is hard - What resource? Documentation - How to solicit? MTurk microtasks - Combining responses? Voting [1] Pandita et al. ICON: Inferring temporal constraints from natural language API descriptions. ICSME 2016. University of British Columbia 17

Crowd-sourcing spec mining [CrowdSpec] Design question/answers: - Type of spec? Temporal APIs - What resource? Documentation Existing platform with critical mass - How to solicit? MTurk microtasks Well-defined econ model: pay per HIT - Combining responses? Voting ( H uman I ntelligence T ask) University of British Columbia 18

Crowd-sourcing spec mining [CrowdSpec] Design question/answers: - Type of spec? Temporal APIs - What resource? Documentation - How to solicit? MTurk microtasks - Combining responses? Voting Lots of flexibility Implements reliability University of British Columbia 19

CrowdSpec contributions - CrowdSpec + SpecForge [1] can perform as well as voting experts: powerful hybrid spec mining alternatives - Qualitative analysis of where crowd made mistakes [1] T-D. B. et al. Synergizing specification miners through model fissions and fusions. ASE 2015. University of British Columbia 20

Approach overview University of British Columbia 21

Approach overview Crowd Quality Control Strategies: • Qualification test • Appealing to Participants’ Integrity • Random Click Detection • Gold Standard Questions ● 5 participants/task • Conflict Detection • ● $0.40 for each task JavaDoc Highlighting University of British Columbia 22

The crowd must be controlled “Where there is power, there is resistance.” -- Foucault Qualification test: One question from the Qualification Test. University of British Columbia 23

Study Design Task Design: University of British Columbia 24

Study Design Task Design: HIT with one temporal property (Always Followed By) for clear() and clone(): SpecForge University of British Columbia 25

Temporal Constraint Types • AF(a,b) : a is always followed by b a b a b a b b a c b b b c a a a • NF(a,b): a is never followed by b b b a a a b b a a c a a c b a b • AP(b,a): b always precedes a b b a a a b b b c b b b c a a b University of British Columbia 26

The Immediate Temporal Constraints • AIF(a,b) : a is always immediately followed by b • NIF(a,b): a is never immediately followed by b • AIP(a,b): a always immediately precedes b [1] Dwyer et al. Patterns in Property Specifications for Finite-state Verification , ICSE 1999 AIF, NIF, and AIP are [2] Yang et al. Perracotta: Mining temporal API rules from extensions of AF, NF, and AP imperfect traces. ICSE 2006. University of British Columbia 29

Temporal specification True property: A program that uses the API and does not follow the property may trigger a Java exception , or a violation of the property is impossible in the Java language . Examples: HashSet() always precedes size() ; clear() is always followed by size(). University of British Columbia 30

Evaluation: ground truth specs - Three paper authors manually labeled property instances - Targeted 3 Java APIs - HashSet - StringTokenizer - StackAr Inter-rater Kappa API Instances Agreement % True 1,014 0.82 6% (56) HashSet 384 0.76 9% (35) StringTokenizer 600 0.76 7% (43) StackAr University of British Columbia 31

Mining Specifications from Documentation Using a Crowd *Peng Sun - PowerPoint PPT Presentation

Mining Specifications from Documentation Using a Crowd Peng Sun Chris Brown ^Ivan Beschastnikh Kathryn Stolee NC State University ^ University of British Columbia + + University of British Columbia 1 Mining

EPUB in the Wild Liz Castro @lizcastro http://PigsGourdsandWikis.com

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Managing General and Individual Knowledge in Crowd Mining Applications Yael Amsterdamer, Susan

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen

POV & EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE & KARNA CROWDPOWER DREAM TEAM Sloane

participatory governance syros_14.07.2012 the power of the crowd some facts crowd (people)

CrowdsFunding Gilad Ravid, PhD Crowd Sourcing Pooling Collective Knowledge Ushahidi

Slides from session at online conference imoot 2013 May 26 th 2013 These were crowd sourced from

Taxonomy-Based Crowd Mining Antoine Amarilli 1 , 2 Yael Amsterdamer 1 Tova Milo 1 1 Tel Aviv

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Complete reworking of talks based on TSE 2012 paper with Bev Littlewood, plus new material from

Why We Are Doing Fewer Interactives Por qu estamos haciendo menos grficos interactivos Archie

(TSE) Olga Maslovskaya University of Southampton Survey Data Vast amounts of survey data are

Many Facets of the Test Oracle Problem Prof. T.H. Tse The University of Hong Kong Pokfulam, Hong

BWN Seminar 2009 1 BWN Seminar 2009 2 [1]P. Gupta and P. Kumar, The capacity of wireless

Fairness and Optimal Stochastic Control for Heterogeneous Networks sensor network wired network

Adversarial Nonnegative Matrix Factorization Lei Luo, Yanfu Zhang, Heng Huang Electrical and

Instructor: Chi Tse (Ricky) T o pic s: vi cheat sheet Escape characters / sequences

Mining Specifications from Documentation Using a Crowd *Peng Sun - PowerPoint PPT Presentation

Mining Specifications from Documentation Using a Crowd *Peng Sun *Chris Brown ^Ivan Beschastnikh *Kathryn Stolee * NC State University ^ University of British Columbia + + University of British Columbia 1 Mining

EPUB in the Wild Liz Castro @lizcastro http://PigsGourdsandWikis.com

Web Mining Web Mining Web Mining Web Mining Web mining is the use of data mining techniques

Utilizing Crowd Funding Utilizing Crowd Funding for Support SMEs funding for Support SMEs

Web Mining Web Mining Web mining is the use of data mining techniques to automatically

Managing General and Individual Knowledge in Crowd Mining Applications Yael Amsterdamer, Susan

How to Stand Out from the Crowd on How to Stand Out from the Crowd on LinkedIn LinkedIn Maureen

POV &amp; EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE &amp; KARNA CROWDPOWER DREAM TEAM Sloane

participatory governance syros_14.07.2012 the power of the crowd some facts crowd (people)

CrowdsFunding Gilad Ravid, PhD Crowd Sourcing Pooling Collective Knowledge Ushahidi

Slides from session at online conference imoot 2013 May 26 th 2013 These were crowd sourced from

Taxonomy-Based Crowd Mining Antoine Amarilli 1 , 2 Yael Amsterdamer 1 Tova Milo 1 1 Tel Aviv

Cement, Aggregates, Mining Presentation Cement, Aggregates and Mining Cement, Aggregates and

Frequent Pattern Mining Frequent Sequence Mining Frequent Tree Mining Christian Borgelt

Data Mining 2020 Frequent Pattern Mining (2) Ad Feelders Universiteit Utrecht October 2, 2020

Introduction What is data mining? to Data Mining: On what kind of data? Data Mining

Web MINING Web MINING Overview Overview Dr Ahmed Rafea Rafea Dr Ahmed 1 Web Mining Outline

Complete reworking of talks based on TSE 2012 paper with Bev Littlewood, plus new material from

Why We Are Doing Fewer Interactives Por qu estamos haciendo menos grficos interactivos Archie

(TSE) Olga Maslovskaya University of Southampton Survey Data Vast amounts of survey data are

Many Facets of the Test Oracle Problem Prof. T.H. Tse The University of Hong Kong Pokfulam, Hong

BWN Seminar 2009 1 BWN Seminar 2009 2 [1]P. Gupta and P. Kumar, The capacity of wireless

Fairness and Optimal Stochastic Control for Heterogeneous Networks sensor network wired network

Adversarial Nonnegative Matrix Factorization Lei Luo, Yanfu Zhang, Heng Huang Electrical and

Instructor: Chi Tse (Ricky) T o pic s: vi cheat sheet Escape characters / sequences

Mining Specifications from Documentation Using a Crowd Peng Sun Chris Brown ^Ivan Beschastnikh Kathryn Stolee NC State University ^ University of British Columbia + + University of British Columbia 1 Mining

POV & EXPERIENCE PROTOTYPES SLOANE, TINA, MARIE & KARNA CROWDPOWER DREAM TEAM Sloane