Learning objectives Understand how automated program analysis - PowerPoint PPT Presentation

Learning objectives • � Understand how automated program analysis complements testing and manual inspection – � Most useful for properties that are difficult to test Program Analysis • � Understand fundamental approaches of a few representative techniques – � Lockset analysis, pointer analysis, symbolic testing, dynamic model extraction: A sample of contemporary techniques across a broad spectrum – � Recognize the same basic approaches and design trade-offs in other program analysis techniques (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 1 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 2 Why Analysis Why automated analysis • � Exhaustively check properties that are difficult • � Manual program inspection to test – � effective in finding faults difficult to detect with testing – � Faults that cause failures – � But humans are not good at • � rarely • � under conditions difficult to control • � repetitive and tedious tasks – � Examples • � maintaining large amounts of detail • � Automated analysis • � race conditions • � faulty memory accesses – � replace human inspection for some class of faults • � Extract and summarize information for – � support inspection by inspection and test design • � automating extracting and summarizing information • � navigating through relevant information (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 3 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 4

Static vs dynamic analysis Concurrency faults • � Concurrency faults • � Static analysis – � deadlocks: threads blocked waiting each other on a lock – � examine program source code – � data races: concurrent access to modify shared resources • � examine the complete execution space • � Difficult to reveal and reproduce • � but may lead to false alarms – � nondeterministic nature does not guarantee repeatibility • � Dynamic analysis • � Prevention – � Programming styles – � examine program execution traces • � eliminate concurrency faults by restricting program constructs • � no infeasible path problem • � examples • � but cannot examine the execution space exhaustively – � do not allow more than one thread to write to a shared item – � provide programming constructs that enable simple static checks (e.g., Java synchronized) • � Some constructs are difficult to check statically • � example – � C and C++ libraries that implement locks (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 5 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 6 Memory faults Example • � Dynamic memory access and allocation faults } else if (c == '%') { int digit_high = Hex_Values[*(++eptr)]; – � null pointer dereference int digit_low = Hex_Values[*(++eptr)]; – � illegal access • � fault – � memory leaks – � input string terminated by an hexadecimal digit • � Common faults – � scan beyond the end of the input string and corrupt – � buffer overflow in C programs memory – � access through dangling pointers – � failure may occur much after the execution of the faulty statement – � slow leakage of memory • � hard to detect • � Faults difficult to reveal through testing – � memory corruption may occur rarely – � no immediate or certain failure – � lead to failure more rarely (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 7 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 8

Memory Access Failures Symbolic Testing (explicit deallocation of memory - C,C++) • � Summarize values of variables with few • � Dangling pointers: deallocating memory accessible through pointers symbolic values • � Memory leak: failing to deallocate memory not accessible any more – � example: analysis of pointers misuse – � no immediate failure – � may lead to memory exhaustion after long periods of execution • � Values of pointer variables: null, notnull, invalid, unknown • � escape unit testing • � other variables represented by constraints • � show up only in integration, system test, actual use • � Use symbolic execution to evaluate conditional • � can be prevented by using – � program constructs statements • � saferC (dialect of C used in avionics applications) limited use of dynamic memory allocation -> eliminates dangling pointers and memory leaks • � Do not follow all paths, but (restriction principle) – � analysis tools – � explore paths to a limited depth • � Java dynamic checks for out-of-bounds indexing and null pointer dereferences – � prune exploration by some criterion (sensitivity principle) – � Automatic storage deallocation (garbage collection) (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 9 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 10 Path Sensitive Analysis Summarizing Execution Paths • � Different symbolic states from paths to the same location • � Find all program faults of a certain kind • � Partly context sensitive – � no prune exploration of certain program paths (depends on procedure call and return sequences) (symbolic testing) • � Strength of symbolic testing – � abstract enough to fold the state space down to a combine path and context sensitivity size that can be exhaustively explored • � detailed description of how a particular execution sequence leads to • � Example: a potential failure analyses based on finite state machines (FSM) • � very costly • � reduce costs by memoizing entry and exit conditions – � data values by states – � limited effect of passed values on execution – � operations by state transitions – � explore a new path only when the entry condition differs from previous ones (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 11 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 12

Pointer Analysis Merging States • � Pointer variable represented by a machine with three • � Flow analysis states: merge states obtained along different execution paths – � invalid value – � conventional data flow analysis: merge all states encountered – � possibly null value at a particular program location – � definitely not null value – � FSM: summarize states reachable along all paths with a set of • � Deallocation triggers transition from non-null to invalid states • � Finite state verification techniques • � Conditional branches may trigger transitions never merge states (path sensitive) – � E.g., testing a pointer for non-null triggers a transition from possibly null to definitely non-null – � procedure call and return: • � Potential misuse • � complete path- and context-sensitive analysis � too expensive • � throwing away all context information � too many false alarms – � Deallocation in possibly null state • � symbolic testing: cache and reuse (entry, exit) state pairs – � Dereference in possibly null – � Dereference in invalid states (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 13 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 14 Buffer Overflow Dynamic Memory Analysis (with Purify) … [I] Starting main int main (int argc, char *argv[]) { [E] ABR: Array bounds read in printf {1 occurrence} char sentinel_pre[] = "2B2B2B2B2B"; Reading 11 bytes from 0x00e74af8 (1 byte at 0x00e74b02 illegal) char subject[] = "AndPlus+%26%2B+%0D%"; Address 0x00e74af8 is at the beginning of a 10 byte block Address 0x00e74af8 points to a malloc'd block in heap 0x00e70000 char sentinel_post[] = "26262626"; Output parameter Thread ID: 0xd64 char *outbuf = (char *) malloc(10); ... of fixed length int return_code; [E] ABR: Array bounds read in printf {1 occurrence} Reading 11 bytes from 0x00e74af8 (1 byte at 0x00e74b02 illegal) Can overrun the Address 0x00e74af8 is at the beginning of a 10 byte block printf("First test, subject into outbuf\n"); Address 0x00e74af8 points to a malloc'd block in heap 0x00e70000 return_code = cgi_decode(subject, outbuf); output buffer Thread ID: 0xd64 printf("Original: %s\n", subject); ... [E] ABWL: Late detect array bounds write {1 occurrence} printf("Decoded: %s\n", outbuf); Memory corruption detected , 14 bytes at 0x00e74b02 printf("Return code: %d\n", return_code); Address 0x00e74b02 is 1 byte past the end of a 10 byte block at 0x00e74af8 Address 0x00e74b02 points to a malloc'd block in heap 0x00e70000 63 memory operations and 3 seconds since last-known good heap state printf("Second test, argv[1] into outbuf\n"); Detection location - error occurred before the following function call printf("Argc is %d\n", argc); printf [MSVCRT.dll] Identifies assert(argc == 2); ... return_code = cgi_decode(argv[1], outbuf); Allocation location the problem malloc [MSVCRT.dll] printf("Original: %s\n", argv[1]); ... printf("Decoded: %s\n", outbuf); [I] Summary of all memory leaks... {482 bytes, 5 blocks} printf("Return code: %d\n", return_code); ... [I] Exiting with code 0 (0x00000000) }… Process time: 50 milliseconds [I] Program terminated ... (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 15 (c) 2007 Mauro Pezzè & Michal Young Ch 19, slide 16

Learning objectives Understand how automated program analysis - PowerPoint PPT Presentation

Learning objectives Understand how automated program analysis complements testing and manual inspection Most useful for properties that are difficult to test Program Analysis Understand fundamental approaches of a few

Objectives Objectives Objectives Objectives Learning Learning Learning Learning

The Learning Tree Workshop: The Learning Tree Workshop: Experience-based Learning Series on

PVMD Delft University of Technology Learning objectives Improved light calibration Learning

Machine Learning 11 AI Slides (6e) c Lin Zuoquan@PKU 1998-2020 11 1 11 Machine Learning

What is mobile learning, mobile learning policies and technologies Dr. Mohamed Ally Learning

Standards for Professional Learning Standards for Professional Learning Learning objectives

Designs Learning designs Learning objectives Learners will be able to Provide a

Trigeminal Autonomic Cephalalgias Learning Objectives Learning Objectives At the conclusion

Learning Objectives Learning Objectives Overuse Injuries in Overuse Injuries in Endurance

Rate vs temporal code about synchrony Learning objectives: Learning objectives: To

Testing Object Oriented Software Chapter 15 p Learning objectives Learning objectives

PVMD Ren van Swaaij Delft University of Technology Learning objectives Why use a

Year 7 Learning Evening 2017 W elcome! Year 7 Learning Evening 2017 Year 7 Learning Evening

Learning is a never-ending process Tasks come and go, but learning is forever Learn more e ff

Welcome to the Machine Learning Toolbox! Machine Learning Toolbox Supervised learning caret

A Gentle Introduction to Machine Learning Supervised learning, unsupervised learning (very

21 st Century Office Showcase new ways of working enabled by technology

Partiality and Transparent Intensional Logic Logika: systmov rmec rozvoje oboru v R a

HMO Licensing The Oxford Way Adrian Chowns HMO Enforcement Team Manager

Tackling the need for affordable housing through a local housing company Louise Strongitharm

Reconstructing proprietary video streaming algorithms Maximilian Grner , Melissa Licciardello,

ARIES: Using Annular-Ring Embedded Resistors to Set Capacitor ESR in Power Distribution Networks

Video Delivery in Hybrid Network draft-huang-dispatch-hybrid-video-delivery-00 ietf99@Prague

Building a large scale CDN with Apache Trafficserver Jan