Optimizing unit test execution in large software programs using - PowerPoint PPT Presentation

Optimizing unit test execution in large software programs using dependency analysis Taesoo Kim, Ramesh Chandra and Nickolai Zeldovich MIT CSAIL

Running unit tests takes too long It’s our policy to make sure all tests pass at all times . ● Large software programs often require running full unit tests for each commit ● But, unit tests take about 10 min in Django ● With our work , it can be done within 2 sec ! 2

Current approaches for shortening testing time ● Modular unit tests (e.g., testsuite) – Run a certain set of unit tests that might be affected ● Test bot (e.g., gtest, autotest) – Run unit tests remotely and get the results back 3

Problem: current approaches are very limited ● Manual efforts involved – Maintaining multiple test suites ● Overall testing still takes too long – Waiting for Test bot to complete full unit testing 4

Research: regression test selection (RTS) ● Goal: run only necessary tests instead of full tests – identify test cases whose results might change due to the current code modification – Step 1 : analyze test cases (e.g., execution traces) – Step 2 : syntactically analyze code changes – Step 3 : output the affected test cases Test cases Affected test RTS cases Code changes 5

Problem: RTS techniques are never adopted in practice ● “Soundness” of RTS techniques kills adoption – Soundness means no false negatives – Impose non-negligible perf. overheads (analysis/runtime) – Select lots of test cases (particularly in dynamic languages) – e.g., changes in a global variable run all test cases → 6

Goal: make RTS practical ● Idea 1: trade off soundness for performance – Keep track of function-level dependency / changes – Fewer tests selected, may have false negatives ● Idea 2: integrate test optimization into dev. cycle – Maintain dependency information in code repository 7

Current development cycle Repository server Source tree <HEAD> Check out code ① <HEAD> Local repo. Programmer's computer 8

Current development cycle Repository server Source tree <HEAD> Check out code ① <HEAD> Changes ② Local repo. Programmer's computer 9

Current development cycle Repository server Source tree <HEAD> Check out code ① <HEAD> Changes ② Local repo. Development Unit testing ③ cycle Test results ④ Programmer's computer 10

New development cycle Repository server Source tree <HEAD> Check out code ① <HEAD> Test case Diff ③ information Changes Analyzing ② dependencies Local repo. Affected test cases Development Unit testing ④ cycle ⑤ Test results Programmer's computer 11

New development cycle Repository server Source tree <HEAD> Check out code ① <HEAD> Test case Diff ③ information Changes Analyzing ② dependencies Local repo. Affected test cases Development Unit testing ④ cycle ⑤ Test results Programmer's computer 12

Identifying affected test cases by the code modification ● Plan: track which tests execute which functions – Step 1 : generate function-level dependency info. ● Map : invoked functions test case ↔ ● Construct map by running all unit tests – Step 2 : identify modified func., given code changes – Step 3 : identify tests that ran the modified func. 13

Identifying affected test cases by the code modification ● Plan: track which tests execute which functions – Step 1 : generate function-level dependency info. ● Map : invoked functions test case ↔ ● Construct map by running all unit tests – Step 2 : identify modified func., given code changes – Step 3 : identify tests that ran the modified func. 14

Bootstrapping dependency info. Repository server Source tree <HEAD> Generated by running full unit tests Check out code <HEAD> Dep. info Diff Changes Analyzing dependencies Local repo. Development Unit testing cycle Testing results Programmer's computer 15

Bootstrapping dependency info. Repository server Dependency server Source tree Dependency info <HEAD> <HEAD> Check out dep. info Check out code <HEAD> Dep. info Diff <HEAD> Changes Analyzing dependencies Local repo. Development Unit testing cycle Testing results Programmer's computer 16

Update dependency information Repository server Dependency server Source tree Dependency info <HEAD> <HEAD> <0xac0ffee> <0xac0ffee> <HEAD> Dep. Info Diff <HEAD> Changes Analyzing dependencies Local repo. Incremental Development Unit testing dep. info cycle Testing results Programmer's computer 17

Update dependency information Repository server Dependency server Source tree Dependency info <HEAD> <HEAD> <0xac0ffee> <0xac0ffee> <HEAD> Dep. Info Diff <HEAD> Changes Analyzing dependencies Local repo. Incremental Development Unit testing dep. info cycle Testing results Programmer's computer 18

Problem: false negatives ● Function-level tracking can miss some dependencies and cause false negatives – Failed to identify some test cases that are actually affected ● Identified five types of missing dependencies – Inter-class dependency – Non-determinism – Class variable – Global-scope – Lexical dependency 19

Problem: false negatives ● Function-level tracking can miss some dependencies and cause false negatives – Failed to identify some test cases that are actually affected ● Identified five types of missing dependencies – Inter-class dependency – Non-determinism – Class variable – Global-scope – Lexical dependency 20

Example: inter-class dep. in Python class A: def foo(): return 1 class B(A): pass def testcase(): assertEqual( B().foo(), 1) 21

Example: inter-class dep. in Python Dependency info: class A: testcase() → def foo(): B.__init__() return 1 A.foo() class B(A): pass def testcase(): assertEqual( B().foo(), 1) 22

Example: inter-class dep. in Python Dependency info: class A: testcase() → def foo(): B.__init__() return 1 A.foo() class B(A): pass - pass + def foo(): + return 2 Modified functions: def testcase(): B.foo() assertEqual( B().foo(), 1) 23

Example: missing dep. because of non-determinism in Python Dependency info: def foo(): - return 1 testcase() → testcase() → + return 2 rand() rand() or foo() def testcase(): if rand()%2: assertEqual( Modified functions: foo(), 1) foo() 24

Example: missing dep. because of non-determinism in Python Dependency info: def foo(): - return 1 testcase() → testcase() → + return 2 rand() rand() or foo() def testcase(): if rand()%2: assertEqual( Modified functions: foo(), 1) foo() 25

Example: class-var. dep. in Python Dependency info: class C: testcase() → - a = 1 foo() + a = 2 def foo(): return C.a def testcase(): Modified functions: assertEqual( foo(), 1) N/A 26

Solution: test server runs all tests async. Repository server Test server Dependency server Source tree Full unit testing Dependency info <HEAD> <HEAD> <HEAD> <HEAD> Dep. Info Diff <HEAD> Changes Changes Analyzing dependencies Local repo. Incremental Development Unit testing dep. info cycle Testing results Programmer's computer 27

Test server also verifies dep. info Repository server Test server Dependency server Verify Source tree Full unit testing Dependency info <HEAD> <HEAD> <HEAD> <HEAD> Dep. Info Diff <HEAD> Changes Changes Analyzing dependencies Local repo. Incremental Development Unit testing dep. info cycle Testing results Programmer's computer 28

TAO: a prototype for PyUnit Repository server Test server Dependency server Source tree Full unit testing Dependency info <HEAD> <HEAD> <HEAD> <HEAD> Dep. Info Diff <HEAD> Changes Analyzing dependencies Repository Incremental Development Unit testing dep. info cycle Testing results Programmer's computer 29

Implementation ● TAO: a prototype for PyUnit – Extending standard python-unittest library – Patch analysis: using ast/diff python module – Dependency tracking: using settrace() interface – 800 Lines of code in Python 30

Evaluation ● How many functions are modified in each commit in large software programs? ● How much testing time can be saved as result? ● How many false negatives does TAO incur? ● What is the overall runtime overhead of TAO? 31

Experiment setup ● Two popular projects: Django and Twisted – Django : a web application framework – Twisted : a network protocol engine – Use existing unit tests of both projects – Integrate TAO into both projects – Analyze the latest 100 commits of each project 32

Small number of functions are modified in each commit Django Twisted Commit IDs (recent 100 commits) ● Django: 50.8 / 13k functions (0.3%) ● Twisted: 18.2 / 23k functions (0.07%) 33

Small number of functions are modified in each commit Django Twisted Commit IDs (recent 100 commits) ● Django: 50.8 / 13k functions (0.3%) ● Twisted: 18.2 / 23k functions (0.07%) 34

Optimizing unit test execution in large software programs using - PowerPoint PPT Presentation

Optimizing unit test execution in large software programs using dependency analysis Taesoo Kim, Ramesh Chandra and Nickolai Zeldovich MIT CSAIL Running unit tests takes too long Its our policy to make sure all tests pass at all times .

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

Test automation Building automatically repeatable test suites Test automation n Test automation

Quality Assurance: Test Development & Execution Developing Test Strategy Ian S. King Test

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

Unit Title: Presentation Software Unit Level: 2 Unit Credit Value: 4 GLH: 30 LASER Unit

Multiple Programs How do programs communicate? 1 Multiple Programs How do programs communicate?

200511316 200511316 Test plan Test design specification g p

FLSA DUTIES TEST Exemption/Duties Test Types of Duties/Exemption Test Executive Exemption

Engineering Best Practices Test, test, test, and test some more; test as you go Start from a

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

Unit Testing Discussion C Unit Test public Method is smallest unit of code Input/output

Software Testing Testing: Our Experiences Test Case Software to be tested Output 1 When to

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

execution states with swapping Processes, Execution, and State 3F. Execution State Model exit

Intrus ntrusion ion Det Detection, ection, Fi Fire rewalls, alls, an and d Intr ntrusion

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning tasks

The European Commissions science and knowledge service Joint Research Centre Why machine

A True Positives Theorem for a Static Race Detector Nikos Gorogiannis Peter OHearn Ilya

and Evaluation CMSC 678 UMBC Central Question: How Well Are We Doing? Precision, Recall,

Experiences of of La Landing Machine Le Learning onto Market-Scale Mobile Malware Detection

MA162: Finite mathematics . Jack Schmidt University of Kentucky December 3, 2012 Schedule:

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Optimizing unit test execution in large software programs using - PowerPoint PPT Presentation

Optimizing unit test execution in large software programs using dependency analysis Taesoo Kim, Ramesh Chandra and Nickolai Zeldovich MIT CSAIL Running unit tests takes too long Its our policy to make sure all tests pass at all times .

Optimizing monitoring networks for Optimizing monitoring networks for Optimizing monitoring

MASTERING STRATEGY EXECUTION 18 BEST PRACTICES FOR STRATEGY EXECUTION STRATEGY EXECUTION AS

Model-Based Testing (ISTQB Chapter 4) Arie van Deursen 1 4.1 ISTQB Test Design Test Scripts

Test automation Building automatically repeatable test suites Test automation n Test automation

Quality Assurance: Test Development &amp; Execution Developing Test Strategy Ian S. King Test

HOUSING PROJECT 1 UNIT 4 UNIT 1 UNIT 6 UNIT 5 UNIT 3 UNIT 2 Application of the Concept

Unit Title: Presentation Software Unit Level: 2 Unit Credit Value: 4 GLH: 30 LASER Unit

Multiple Programs How do programs communicate? 1 Multiple Programs How do programs communicate?

200511316 200511316 Test plan Test design specification g p

FLSA DUTIES TEST Exemption/Duties Test Types of Duties/Exemption Test Executive Exemption

Engineering Best Practices Test, test, test, and test some more; test as you go Start from a

Nehemiah Prays Nehemiah 1-2 Here is some test text Here is some test text Here is some test

Unit Testing Discussion C Unit Test public Method is smallest unit of code Input/output

Software Testing Testing: Our Experiences Test Case Software to be tested Output 1 When to

Unit Identifier Unit October 21, 2014 Unit Identifiers Unit Members Representing Name Email

execution states with swapping Processes, Execution, and State 3F. Execution State Model exit

Intrus ntrusion ion Det Detection, ection, Fi Fire rewalls, alls, an and d Intr ntrusion

CSI5180. MachineLearningfor BioinformaticsApplications Fundamentals of Machine Learning tasks

The European Commissions science and knowledge service Joint Research Centre Why machine

A True Positives Theorem for a Static Race Detector Nikos Gorogiannis Peter OHearn Ilya

and Evaluation CMSC 678 UMBC Central Question: How Well Are We Doing? Precision, Recall,

Experiences of of La Landing Machine Le Learning onto Market-Scale Mobile Malware Detection

MA162: Finite mathematics . Jack Schmidt University of Kentucky December 3, 2012 Schedule:

Bayesian Updating: Discrete Priors: 18.05 Spring 2014 http://xkcd.com/1236/ January 1, 2017

Quality Assurance: Test Development & Execution Developing Test Strategy Ian S. King Test