Putting the Science in Computer Science What makes for a good - PowerPoint PPT Presentation

Putting the “Science” in Computer Science What makes for a good program, and how can we measure / evaluate programs for “goodness”? Write as many definitions of “good” as you can, and describe how you would measure each one. Firstname Lastname Th. 10 / 13 (Your response)

Given a computational problem Is there a solution? What is it? How good is it?

Is it e ffi cient?

Data: which algorithm is best? Lower is better � � � � � � � � 2 4 6 8 Problem Size

Data: which algorithm is best? Lower is better � � � � � � � � � � � � � � � � � � � � � � � � 2 4 6 8 Problem Size

Data: which algorithm is best? Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 5 10 15 20 Problem Size

Data: which algorithm is best? � � � � � � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 10 20 30 40 Problem Size

Data: which algorithm is best? � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 20 40 60 80 Problem Size

Data: which algorithm is best? � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 20 40 60 80 100 120 Problem Size

Data: which algorithm is best? � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 50 100 150 Problem Size

Data: which algorithm is best? � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 50 100 150 200 Problem Size

Data: which algorithm is best? � Lower is better � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � � 100 200 300 Problem Size

Data: which algorithm is best? � � � � Lower is better � � � � � � � � � � � � � � � � � � � � �� 200 400 600 800 Problem Size

Interpreting empirical data Key take-away: it’s messy and incomplete ! We can measure • a particular algorithm • written in a particular language • as a particular program • compiled using a particular version of a particular compiler • with particular settings (e.g., enabling / disabling optimizations) • running on a particular data set , of a particular size • on a particular computer • with particular resources (CPUs, memory, hard drive, …) • under a particular version of a particular operating system • in a particular environment   e.g., with other programs running in the background

Interpreting a theoretical model Key take-away: it’s lossy ! A theory abstracts away certain details. cost metric : • corresponds to one “step” • highlights the essence of the work e.g., multiplications, comparisons, function calls… • serves as a proxy for an empirical measurement Instead of measuring time, we count steps. e.g., “This algorithm costs n 2 multiplications.”

good data + good theory = good science we can make predictions and   we can communicate with other scientists

General Decidability e.g., Can it be solved at all ? Complexity Class CS 81   CS 140   e.g., Can it be solved in polynomial time ? MA 167 Asymptotic Analysis “Big O” e.g., O(n) time , where n is list size CS 42   CS 70 Exact Theory Recurrence   relation e.g., 7n + 2 multiplications , where n is list size Empirical Data CS 105   HPC e.g., This run took 17.3 seconds on this data. Specific

Asymptotic Analysis (Big O)

Asymptotic analysis We’re always answering the same question: How does the cost scale   (when we try larger and larger inputs)? Not: • Exactly how many steps will it execute? • How many seconds will it take? • How many megabytes of memory will it need?

Ti e informal de fj nition of “Big O” A reasonable upper bound on   (an abstraction of)   a problem’s di ff iculty or   a solution’s performance,   for reasonably large input sizes.

In the limit (for VERY LARGE inputs) The running time is bounded   O(1) regardless of the input size. An input twice as big takes   O(n) no more than twice as long. An input twice as big takes   O(n 2 ) no more than four times as long. An input one bigger takes   O(2 n ) no more than twice as long.

If We Only Care About Scalability… What are the consequences? Constant factors can be ignored. n and 6n and 200n scale identically (“linearly”)   Small summands can be ignored.   n 2 and n 2 + n + 999999 are indistinguishable when n is huge.

Grouping Algorithms by Scalability takes 6 steps takes 1 (big) step O(1) no more than 4000 steps somewhere between 2 and 47 steps, depending on the input takes 100n + 3 steps O(n) takes n/20 + 10,000,000 steps anywhere between 3 and 68 steps per item, for n items. takes 2n 2 + 100n + 3 steps takes n 2 /17 steps O(n 2 ) somewhere between 1 and 40 steps per item, for n 2 items anywhere between 1 and 7n steps per item, for n items.

How hard is the problem? O(n n ) Intractable problems   O(n!) (exponential) O(2 n ) O(n 3 ) O(n 2 ) Tractable problems   (polynomial) O(n log(n)) O(n) O( √ n) No problem! O(log(n)) O(1)

logs aren’t scary! Ti ey’re our friends. log2(1) = 0 // 2 0 = 1 log2(2) = 1 // 2 1 = 2 log is the inverse of exponentiation. log2(3) ≈ 1.58 How many times can I cut N in half? log2(4) = 2 // 2 2 = 4 Can I avoid looking at all the input?! log2(5) ≈ 2.32 70.00 log2(6) ≈ 2.58 log2(7) ≈ 2.81 46.67 log2(8) = 3 // 2 3 = 8 23.33 log 0.00 s-media-cache-ak0.pinimg.com/736x/5d/f7/6d/5df76d1672ccdffc74af2e2bf55330aa.jpg 1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61 63

How hard are these problems ? cost metric cost double multiplications sum additions half-count divisions

How hard are these problems ? cost metric cost double multiplications O(1) sum additions O(n) half-count divisions O(log n)

What’s the cost, T, for each function? double   sum   half-count   multiplications additions divisions (define (double n) T(0) n/a (* n 2)) T(1) T(2) (define (sum n) (if (= n 0) T(3) 0 (+ n (sum (- n 1))))) T(4) … (define (half-count n) (if (= n 1) T(n) 0 (+ 1 (half-count (quotient n 2)))))

What’s the cost, T, for each function? double   sum   half-count   multiplications additions divisions (define (double n) T(0) 1 0 n/a (* n 2)) T(1) 1 1 0 T(2) 1 2 1 (define (sum n) (if (= n 0) T(3) 1 3 1 0 (+ n (sum (- n 1))))) T(4) 1 4 2 … … … … (define (half-count n) (if (= n 1) ⌊ log 2 n ⌋ T(n) 1 n 0 (+ 1 (half-count (quotient n 2)))))

Recurrence Relations   (translating code to math)

Translating recursion to recurrence relations For a given cost metric: additions 1. Translate the base case(s), using specific input sizes How many steps does this base case take? 2. Translate the recursive case(s), using input size N Define T(N) in terms of smaller cost recurrence relation ( define (sum n) base case → T(0) = 1 input size ( if (= n 0) recursive case → T(N) = 3 + T(N-1) 0 ( + n (sum (- n 1)))))

Translating recursion to recurrence relations For a given cost metric: additions 1. Translate the base case(s), using specific input sizes How many steps does this base case take? 2. Translate the recursive case(s), using input size N Define T(N) in terms of smaller cost recurrence relation ( define (sum n) base case → T(0) = 0 input size ( if (= n 0) recursive case → T(N) = 1 + T(N-1) 0 ( + n (sum (- n 1))))) T(N) = 1 + T(N-1) = 1*1 + T(N-1) closed form T(N) = 1 + 1 + T(N-2) = 2*1 + T(N-2) asymptotic form T(N) = 1 + 1 + 1 + T(N-3) = 3*1 + T(N-3) T(N) … … T(N) = 1 + 1 + 1 + … 1 + T(N-N) = N*1 + T(N-N) = N ∈ O(N)

Translating recursion to recurrence relations For a given cost metric: arithmetic operations and comparisons 1. Translate the base case(s), using specific input sizes How many steps does this base case take? 2. Translate the recursive case(s), using input size N Define T(N) in terms of smaller cost recurrence relation ( define (sum n) base case → T(0) = 1 input size ( if ( = n 0) recursive case → T(N) = 2 + T(N-1) 0 ( + n (sum ( - n 1)))))

Putting the Science in Computer Science What makes for a good - PowerPoint PPT Presentation

Putting the Science in Computer Science What makes for a good program, and how can we measure / evaluate programs for goodness? Write as many definitions of good as you can, and describe how you would measure each one.

Putting a socially responsible price on carbon Putting a socially responsible price on carbon

Putting Putting T Tubes ubes W Within ithin T Tubes: ubes: Enteral Therapeutic Access

Building Sustainable Organizations: Putting the Profit in Nonprofit May 9, 2012 Putting the

Putting on a Horse Show By: Candace Norman North Carolina 4-H Volunteer Contents Putting on a

Trade registers in Russia; Trade registers in Russia; Putting business or Putting business or

BC Government & BCTF Putting the Students First BC Government & BCTF Putting the

I Investor Presentation t P t ti Putting in Place the Growth Enablers Putting in Place the

PUTTING SCHOOLS AT THE PUTTING SCHOOLS AT THE HEART OF NEIGHBOURHOOD HEART OF NEIGHBOURHOOD

Putting the reef back into Ridge Ridge Putting the reef back into to Reef

Putting tec Putting tech h at the t the hear heart of t of an activ an active city e city

Putting drug development into Putting drug development into practice for Pancreatic Cancer:

Putting Kids Health First: Profitable and healthy fundraisers for Montana schools

Putting out a HIT Putting out a HIT Crowdsourcing Malware Installs Stephen Checkoway Keaton

Putting the Settlement Procedure into Putting the Settlement Procedure into Practice: the DRAMs

Putting a Price on Carbon Emissions Putting a Price on Carbon Emissions EES 3310/5310 EES

LECTURE 28: METRICS! CSE 442 Software Engineering Putting the E in SE Have mostly focused

CANOPY Redefining Debate Elena F Yasmeen A Teresa N Gamliel S Mission We started with wanting

CUDA Kernel based Collective Reduction Operations on Large-scale GPU Clusters Ching-Hsiang Chu ,

Top-Down Parsing Top-Down Parsing #1 Extra Credit Question Given this grammar G: E

OUTLINE CHAPTER 10 Recursive Hierarchies Table of contents Recursive Hierarchies and Bridges

LEARNING AFFINITY VIA SPATIAL PROPAGATION NETWORK Sifei Liu, Shalini De Mello, Jinwei Gu, Guangyu

TOWARDS FORMAL VERIFICATION IN AUTOMOTIVE APPLIED TO THE AUTONOMOUS DRIVING SUPERVISION FUNCTION

Outline Monday: design, interfaces, representation of information Tuesday: testing,

A Simple Computer Computing Models A simple computer model with a unified notion of

Putting the Science in Computer Science What makes for a good - PowerPoint PPT Presentation

Putting the Science in Computer Science What makes for a good program, and how can we measure / evaluate programs for goodness? Write as many definitions of good as you can, and describe how you would measure each one.

Putting a socially responsible price on carbon Putting a socially responsible price on carbon

Putting Putting T Tubes ubes W Within ithin T Tubes: ubes: Enteral Therapeutic Access

Building Sustainable Organizations: Putting the Profit in Nonprofit May 9, 2012 Putting the

Putting on a Horse Show By: Candace Norman North Carolina 4-H Volunteer Contents Putting on a

Trade registers in Russia; Trade registers in Russia; Putting business or Putting business or

BC Government &amp; BCTF Putting the Students First BC Government &amp; BCTF Putting the

I Investor Presentation t P t ti Putting in Place the Growth Enablers Putting in Place the

PUTTING SCHOOLS AT THE PUTTING SCHOOLS AT THE HEART OF NEIGHBOURHOOD HEART OF NEIGHBOURHOOD

Putting the reef back into Ridge Ridge Putting the reef back into to Reef

Putting tec Putting tech h at the t the hear heart of t of an activ an active city e city

Putting drug development into Putting drug development into practice for Pancreatic Cancer:

Putting Kids Health First: Profitable and healthy fundraisers for Montana schools

Putting out a HIT Putting out a HIT Crowdsourcing Malware Installs Stephen Checkoway Keaton

Putting the Settlement Procedure into Putting the Settlement Procedure into Practice: the DRAMs

Putting a Price on Carbon Emissions Putting a Price on Carbon Emissions EES 3310/5310 EES

LECTURE 28: METRICS! CSE 442 Software Engineering Putting the E in SE Have mostly focused

CANOPY Redefining Debate Elena F Yasmeen A Teresa N Gamliel S Mission We started with wanting

CUDA Kernel based Collective Reduction Operations on Large-scale GPU Clusters Ching-Hsiang Chu ,

Top-Down Parsing Top-Down Parsing #1 Extra Credit Question Given this grammar G: E

OUTLINE CHAPTER 10 Recursive Hierarchies Table of contents Recursive Hierarchies and Bridges

LEARNING AFFINITY VIA SPATIAL PROPAGATION NETWORK Sifei Liu, Shalini De Mello, Jinwei Gu, Guangyu

TOWARDS FORMAL VERIFICATION IN AUTOMOTIVE APPLIED TO THE AUTONOMOUS DRIVING SUPERVISION FUNCTION

Outline Monday: design, interfaces, representation of information Tuesday: testing,

A Simple Computer Computing Models A simple computer model with a unified notion of

BC Government & BCTF Putting the Students First BC Government & BCTF Putting the