Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 - PowerPoint PPT Presentation

Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 Bench chmarking at at the SBST2016 Testin ing Tool ool Con ontest Wishnu Prasetya, Utrecht University http://www.cs.uu.nl/~wishnu https://git.science.uu.nl/prase101/t3/wikis/home

T3 • Random testing tool for Java Class • Provide convenient way for user to specify custom test data/generator • Typical use case: • to fastly generate large amount of test sequences • Test suites can be generated interactively • combined interactively: suite = suite1 + suite2 • interactive query • analyzed, e.g. to infer invariants 2

Querying test suite • H = hoare ( {s → s.arg[0] ≤ s.tobj.cutOff ()}, ”calculateTax”, {s → s.retval == 0}) • ltlquery (suite). with (always(H)).valid() • filter (suite). with (eventually(H.antecedent())) 3

Budget aware suite generation • Use case: running automated testing on a whole project, with an overall budget e.g. 1 hour. • Current implementation: pre-calculated fixed budget per class, e.g. 1 minute. • Class-level budgeting: • over inner classes • over test goals per target class 4

Test Goal • Test goal: a public/protected method of CUT. à generate a test suite for it. • All TGs are put in a worklist , to be processed in some order • Process TG m: generate/refine its suite. If not done , put m back in the worklist. • There is a limit on the max. number of this put-backs (in the competition: set to 8) • Repeat until either worklist is empty, or we run out of budget. 5

Refining suites • Let m be a TG. We maintain a test suite S m , generated for m so far. Generate newset of test sequences, each of the form: 𝜏 ++ o.m(...) ++ 𝜐 • Only add a new sequence to S m if it improves coverage. • Keeping in mind: proportionality. 6

Generating prefixes • For efficiency, prefixes are generated collectively and incrementally over all TGs • Maintain a set P of prefixes so we have so far, and only grow it incrementally : • If all TGs of generation k are processed, and worklist is not empty, we grow P by generating K fresh prefixes, but only adding those than can refine P. • Refinement: also keep track “unique” object structures • project object structures to trees • project primitive values to logarithmic representations 7

Processing order policy of the TGs • Random? • Used policy: • when budget is still ok (0.5 B), we just pick the next TG randomly • after that “easier” TG is favored. • linear over generations, to enforce fairness 8

Overall budget policy • CUT-level dynamic budget allocation: • Given a CUT and time budget B0, determine the set of classes in CUT to target. Each C gets is allocated a fragment of B0, proportional to its complexity. • When we are done with C, budget allocation is re-calculated based on remaining time at that moment. • T3 is tuned to use budget considerately, and not aggresively trying to exhaust all budget. 9

Result 60s 120s 240s 480s C M T C M T C M T C M T RAN 54.0 64.1 1439 57.2 67.2 2785 59.7 68.8 5493 62.3 70.5 11181 T3 59.2 74.4 1062 63.6 76.9 1579 64.8 77.9 2052 65.5 78.0 2780 EVO 44.1 63.1 1410 50.2 69.5 2601 60.6 80.0 4870 65.5 83.4 8805 JT 63.5 72.5 1653 68.1 79.9 2832 69.3 79.5 5143 70.8 84.4 9435 On subset of 22 CUTs of the original 80 CUTs in the SBST2016 benchmark, on which no tools crash, and on which the benchmarking tool itself has no issue. 10

Productivity 60s 120s 240s 480s RAN 0.14 (7) 0.06 (18) 0.03 (36) T3 0.51 (2) 0.15 (7) 0.06 (17) EVO 0.31 (3) 0.28 (4) 0.07 (13) JT 0.23 (4) 0.03 (32) 0.02 (48) productivity = additional % coverage gained per additional minute spent. 11

Conclusion & future work • When budget efficiency matters, enforcing a budget control algorithm makes sense. • On big budget, T3’s BCA is justified to stop its effort. • On low budget, T3’s BCA stops too early. Future work: smarter BCA. • Future work: project-level BCA. 12

Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 - PowerPoint PPT Presentation

Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 Bench chmarking at at the SBST2016 Testin ing Tool ool Con ontest Wishnu Prasetya, Utrecht University http://www.cs.uu.nl/~wishnu

Spelling, Punctuation and Grammar Suffixes -ing Year One SPaG | Suffixes -ing Suffixes Suffixes

A r ran andom omize ized, d, double ble-bli blinde nded, d, phas ase e III II study dy

Retrosp specti ctive: Feedback ack-directed Ran andom Test Ge Generation on Carlos Pacheco,

SIG IG1510: Power Your Material Editing wit ith Substance Designer, MDL and Ir Iray Sebastien

Water and Commun unit ity: : A Public ic Forum m on HABs Testin ing for Toxins Assessing

Documentatio ion, , testin ing and debugg gging docstring defensive programming

Sahar hara a Be Beach ch Sahara ara Beach ch Perfec fect place e to connec ect wit ith

A Structured Learnin ing Approach wit ith Neural Condit itio ional Random Fie ield lds for

Takin ing the next xt step: Fosterin ing self lf- determination in in young adult lts wit

MOS OSAIC: : Mas ask k Opt ptimiz imizing ing Solut olution ion Wit With h Proces

Steven Saunders, Director, HIV Prevention, NJDOH errol.saunders@doh.state.nj.us Wit ithout t

PARK OPERATIONS 1 COUNTY BUDGET 2 COUNTY BUDGET 3 COUNTY BUDGET 4 COUNTY BUDGET 5 COUNTY

Enhanc Enhancing ing S S&OP P Per erformanc mance w e wit ith h An Analytics cs

PE PERSIS ISTIN ING G DATA WIT ITH REALM MOBIL ILE DATABASE Adam Fisher Application

Managin ing Email ils wit ith SharePoin int A Practical How To Guide Richard Poole

Characterizing Ext xtragalactic Pre-Main- Sequence Stars wit ith Machine and Deep Learnin ing

Translating the Cantor set by a random Randall Dougherty Jack Lutz R. Daniel Mauldin Jason

Random Regular Graphs and Differential Equations Nick Wormald University of Waterloo Minicourse

Advanced Algorithms (II) Shanghai Jiao Tong University Chihao Zhang March 9th, 2020 Random

Spline approximation of a random process with singularity Konrad Abramowicz Department of

Liouville Quantum gravity and KPZ Scott Sheffield Scaling limits of random planar maps Central

Fuzzy Systems Are Universal . . . Universal Approximators Often, We Can Only . . . Main Idea:

Growing and Shrinking Polygons for Random Testing of Computational Geometry Algorithms

Liouville Quantum Gravity as a Mating of Trees Bertrand Duplantier, Jason Miller, and Scott

Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 - PowerPoint PPT Presentation

Bu Budget-aw awar are Ran andom om Testin ing wit ith T3 T3 Bench chmarking at at the SBST2016 Testin ing Tool ool Con ontest Wishnu Prasetya, Utrecht University http://www.cs.uu.nl/~wishnu

Spelling, Punctuation and Grammar Suffixes -ing Year One SPaG | Suffixes -ing Suffixes Suffixes

A r ran andom omize ized, d, double ble-bli blinde nded, d, phas ase e III II study dy

Retrosp specti ctive: Feedback ack-directed Ran andom Test Ge Generation on Carlos Pacheco,

SIG IG1510: Power Your Material Editing wit ith Substance Designer, MDL and Ir Iray Sebastien

Water and Commun unit ity: : A Public ic Forum m on HABs Testin ing for Toxins Assessing

Documentatio ion, , testin ing and debugg gging docstring defensive programming

Sahar hara a Be Beach ch Sahara ara Beach ch Perfec fect place e to connec ect wit ith

A Structured Learnin ing Approach wit ith Neural Condit itio ional Random Fie ield lds for

Takin ing the next xt step: Fosterin ing self lf- determination in in young adult lts wit

MOS OSAIC: : Mas ask k Opt ptimiz imizing ing Solut olution ion Wit With h Proces

Steven Saunders, Director, HIV Prevention, NJDOH errol.saunders@doh.state.nj.us Wit ithout t

PARK OPERATIONS 1 COUNTY BUDGET 2 COUNTY BUDGET 3 COUNTY BUDGET 4 COUNTY BUDGET 5 COUNTY

Enhanc Enhancing ing S S&amp;OP P Per erformanc mance w e wit ith h An Analytics cs

PE PERSIS ISTIN ING G DATA WIT ITH REALM MOBIL ILE DATABASE Adam Fisher Application

Managin ing Email ils wit ith SharePoin int A Practical How To Guide Richard Poole

Characterizing Ext xtragalactic Pre-Main- Sequence Stars wit ith Machine and Deep Learnin ing

Translating the Cantor set by a random Randall Dougherty Jack Lutz R. Daniel Mauldin Jason

Random Regular Graphs and Differential Equations Nick Wormald University of Waterloo Minicourse

Advanced Algorithms (II) Shanghai Jiao Tong University Chihao Zhang March 9th, 2020 Random

Spline approximation of a random process with singularity Konrad Abramowicz Department of

Liouville Quantum gravity and KPZ Scott Sheffield Scaling limits of random planar maps Central

Fuzzy Systems Are Universal . . . Universal Approximators Often, We Can Only . . . Main Idea:

Growing and Shrinking Polygons for Random Testing of Computational Geometry Algorithms

Liouville Quantum Gravity as a Mating of Trees Bertrand Duplantier, Jason Miller, and Scott

Enhanc Enhancing ing S S&OP P Per erformanc mance w e wit ith h An Analytics cs