4. Algorithmen und Datenstrukturen Algorithms and Data Structures, - PowerPoint PPT Presentation

4. Algorithmen und Datenstrukturen Algorithms and Data Structures, Overview [Cormen et al, Kap. 1; Ottman/Widmayer, Kap. 1.1] 53

Algorithm Algorithm Well-defined procedure to compute output data from input data 54

Example Problem: Sorting Input : A sequence of n numbers (comparable objects) ( a 1 , a 2 , . . . , a n ) Output : Permutation ( a ′ 1 , a ′ 2 , . . . , a ′ n ) of the sequence ( a i ) 1 ≤ i ≤ n , such that a ′ 1 ≤ a ′ 2 ≤ · · · ≤ a ′ n Possible input (1 , 7 , 3) , (15 , 13 , 12 , − 0 . 5) , (999 , 998 , 997 , 996 , . . . , 2 , 1) , (1) , () ... Every example represents a problem instance The performance (speed) of an algorithm usually depends on the problem instance. Often there are “good” and “bad” instances. Therefore we consider algorithms sometimes ”in the average“ and most often in the ”worst case“ . 55

Examples for algorithmic problems Tables and statistis: sorting, selection and searching routing: shortest path algorithm, heap data structure DNA matching: Dynamic Programming evaluation order: Topological Sorting autocomletion and spell-checking: Dictionaries / Trees Fast Lookup : Hash-Tables The travelling Salesman: Dynamic Programming, Minimum Spanning Tree, Simulated Annealing 56

Characteristics Extremely large number of potential solutions Practical applicability 57

Data Structures A data structure is a particular way of organizing data in a computer so that they can be used efficiently (in the algorithms operating on them). Programs = algorithms + data structures. 58

Efficiency If computers were infinitely fast and had an infinite amount of memory ... ... then we would still need the theory of algorithms (only) for statements about correctness (and termination). Reality: resources are bounded and not free: Computing time → Efficiency Storage space → Efficiency Actually, this course is nearly only about efficiency. 59

Hard problems. NP-complete problems: no known efficient solution (the existence of such a solution is very improbable – but it has not yet been proven that there is none!) Example: travelling salesman problem This course is mostly about problems that can be solved efficiently (in polynomial time). 60

5. Efficiency of algorithms Efficiency of Algorithms, Random Access Machine Model, Function Growth, Asymptotics [Cormen et al, Kap. 2.2,3,4.2-4.4 | Ottman/Widmayer, Kap. 1.1] 61

Efficiency of Algorithms Goals Quantify the runtime behavior of an algorithm independent of the machine. Compare efficiency of algorithms. Understand dependece on the input size. 62

Programs and Algorithms Technology Abstraction program algorithm implemented in specified in programming language pseudo-code specified for based on computer computation model 63

Technology Model Random Access Machine (RAM) Model Execution model: instructions are executed one after the other (on one processor core). Memory model: constant access time (big array) Fundamental operations: computations ( + , − , · ,...) comparisons, assignment / copy on machine words (registers), flow control (jumps) Unit cost model: fundamental operations provide a cost of 1 . Data types: fundamental types like size-limited integer or floating point number. 64

For Dynamic Data Strcutures Pointer Machine Model Objects bounded in size can be dynamically allocated in constant time Fields (with word-size) of the objects can be accessed in constant time 1. top null x n x n − 1 x 1 65

Asymptotic behavior An exact running time of an algorithm can normally not be predicted even for small input data. We consider the asymptotic behavior of the algorithm. And ignore all constant factors. An operation with cost 20 is no worse than one with cost 1 Linear growth with gradient 5 is as good as linear growth with gradient 1 . 66

Algorithms, Programs and Execution Time Program: concrete implementation of an algorithm. Execution time of the program: measurable value on a concrete machine. Can be bounded from above and below. Example 1 3GHz computer. Maximal number of operations per cycle (e.g. 8). ⇒ lower bound. A single operations does never take longer than a day ⇒ upper bound. From the perspective of the asymptotic behavior of the program, the bounds are unimportant. 67

5.2 Function growth O , Θ , Ω [Cormen et al, Kap. 3; Ottman/Widmayer, Kap. 1.1] 68

Superficially Use the asymptotic notation to specify the execution time of algorithms. We write Θ( n 2 ) and mean that the algorithm behaves for large n like n 2 : when the problem size is doubled, the execution time multiplies by four. 69

More precise: asymptotic upper bound provided: a function g : ◆ → ❘ . Definition: 1 O ( g ) = { f : ◆ → ❘ | ∃ c > 0 , ∃ n 0 ∈ ◆ : ∀ n ≥ n 0 : 0 ≤ f ( n ) ≤ c · g ( n ) } Notation: O ( g ( n )) := O ( g ( · )) = O ( g ) . 1 Ausgesprochen: Set of all functions f : ◆ → ❘ that satisfy: there is some (real valued) c > 0 and some n 0 ∈ ◆ such that 0 ≤ f ( n ) ≤ n · g ( n ) for all n ≥ n 0 . 70

Graphic g ( n ) = n 2 f ∈ O ( g ) h ∈ O ( g ) 71 n 0

Converse: asymptotic lower bound Given: a function g : ◆ → ❘ . Definition: Ω( g ) = { f : ◆ → ❘ | ∃ c > 0 , ∃ n 0 ∈ ◆ : ∀ n ≥ n 0 : 0 ≤ c · g ( n ) ≤ f ( n ) } 72

Example h ∈ Ω( g ) f ∈ Ω( g ) g ( n ) = n n 0 73

Asymptotic tight bound Given: function g : ◆ → ❘ . Definition: Θ( g ) := Ω( g ) ∩ O ( g ) . Simple, closed form: exercise. 74

Example g ( n ) = n 2 f ∈ Θ( n 2 ) h ( n ) = 0 . 5 · n 2 75

Notions of Growth bounded array access O (1) double logarithmic interpolated binary sorted sort O (log log n ) logarithmic binary sorted search O (log n ) O ( √ n ) like the square root naive prime number test linear unsorted naive search O ( n ) superlinear / loglinear good sorting algorithms O ( n log n ) quadratic simple sort algorithms O ( n 2 ) polynomial matrix multiply O ( n c ) exponential Travelling Salesman Dynamic Programming O (2 n ) factorial Travelling Salesman naively O ( n !) 76

Small n 2 n n 4 60 40 n 2 20 n ln n 2 3 4 5 6 77

Larger n · 10 6 2 n 1 0 . 8 0 . 6 0 . 4 n 4 0 . 2 n 2 n log n 5 10 15 20 78

“Large” n 1 · 10 20 2 n 0 . 8 0 . 6 0 . 4 0 . 2 n 4 n 2 n log n 20 40 60 80 100 79

Logarithms n 2 1 , 000 800 600 n 3 / 2 400 n log n 200 n log n 10 20 30 40 50 80

Time Consumption Assumption 1 Operation = 1 µs . problem size 10 6 10 9 1 100 10000 log 2 n 1 µs 7 µs 13 µs 20 µs 30 µs 17 minutes n 1 µs 100 µs 1 / 100 s 1 s 8 . 5 hours n log 2 n 1 µs 700 µs 13 / 100 µs 20 s 1 . 7 minutes 11 . 5 days 317 centuries n 2 1 µs 1 / 100 s 10 14 centuries 2 n 1 µs ≈ ∞ ≈ ∞ ≈ ∞ 81

Useful Tool Theorem 2 Let f, g : ◆ → ❘ + be two functions, then it holds that 1. lim n →∞ g ( n ) = 0 ⇒ f ∈ O ( g ) , O ( f ) � O ( g ) . f ( n ) 2. lim n →∞ g ( n ) = C > 0 ( C constant) ⇒ f ∈ Θ( g ) . f ( n ) 3. n →∞ ∞ ⇒ g ∈ O ( f ) , O ( g ) � O ( f ) . f ( n ) → g ( n ) 82

About the Notation Common casual notation f = O ( g ) should be read as f ∈ O ( g ) . Clearly it holds that f 1 = O ( g ) , f 2 = O ( g ) �⇒ f 1 = f 2 ! n = O ( n 2 ) , n 2 = O ( n 2 ) but naturally n � = n 2 . We avoid this notation where it could lead to ambiguities. 83

Reminder: Java Collections / Maps interface Map Collection Queue Set SortedMap List TreeMap HashMap SortedSet LinkedHashMap PriorityQueue ArrayList HashSet Klasse TreeSet LinkedList LinkedHashSet 84

ArrayList versus LinkedList run time measurements for 10000 operations (on [code] expert) ArrayList LinkedList 469 µ s 1787 µ s 37900 µ s 761 µ s 1840 µ s 2050 µ s 426 µ s 110600 µ s 31ms 301ms 38ms 141ms 228ms 1080ms 648 µ s 757 µ s 58075 µ s 609 µ s 85

Reminder: Decision e u a l v y - e k V a s l r u i e a s p duplicates? Order? no not important important yes HashMap Order? s s random access e sorted ordererd not important c important c by priority a m TreeMap o LinkedHashMap HashSet d n a r o sorted ordererd n ArrayList PriorityQueue LinkedList TreeSet LinkedHashSet 86

Asymptotic Runtimes (Java) With our new language ( Ω , O , Θ ), we can now state the behavior of the data structures and their algorithms more precisely Asymptotic running times (Anticipation!) Data structure Random Insert Next Insert Search Access After Element Θ(1) Θ(1) A Θ(1) Θ( n ) Θ( n ) ArrayList Θ( n ) Θ(1) Θ(1) Θ(1) Θ( n ) LinkedList – – Θ(log n ) Θ(log n ) Θ(log n ) TreeSet – – – Θ(1) P Θ(1) P HashSet A = amortized, P =expected, otherwise worst case 87

Asymptotic Runtimes (Python) Asymptotic running times Data structure Random Insert Iteration Insert Search Access After x in S Element Θ(1) Θ(1) A Θ( n ) Θ( n ) Θ( n ) list – – Θ(1) P Θ( n ) Θ(1) P set – – Θ(1) P Θ( n ) Θ(1) P dict A = amortized, P =expected, otherwise worst case 88

4. Algorithmen und Datenstrukturen Algorithms and Data Structures, - PowerPoint PPT Presentation

4. Algorithmen und Datenstrukturen Algorithms and Data Structures, Overview [Cormen et al, Kap. 1; Ottman/Widmayer, Kap. 1.1] 53 Algorithm Algorithm Well-defined procedure to compute output data from input data 54 Example Problem: Sorting

Algorithmen und Datenstrukturen D3. Kompression 1 Marcel L uthi Universit at Basel 23. Mai

Vorlesung Datenstrukturen und Algorithmen Letzte Vorlesung 2018 Felix Friedrich, 30.5.2018

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Temporal Coherence

Architecture of distributed systems Netzprogrammierung (Algorithmen und Programmierung V) Barry

Maschinelles Lernen: Methoden, Algorithmen, Potentiale und gesellschaftliche

Genetische Algorithmen Christian Borgelt Arbeitsgruppe Neuronale Netze und Fuzzy-Systeme

Repetitionen Clevere Algorithmen, ETH Zrich Dr. Tobias Kohn, University of Cambridge

A standard scale of well-formedness: Why syntax needs boiling and freezing points Sam Featherston

1 Ohne Genericity Ohne Genericity 7 8 class METRO_LINE feature class ROUTE feature start is do

Algorithms in Biology Molecular Organization und Evolution Peter Schuster Institut fr

ibl ) Sfzrn size (32 point, 1 point = 1/72 inch) serif ( better readable ), Sfzrn Sf rn

1 2 3 State R&D Graphic, Version 1 Version 1 4 State R&D Graphic, Version 1,

Gastrointestinal Lymphomas EATL, MALT, and beyond Maria A. Pletneva, MD, PhD Lymphoma in GI tract

Kokkos: Performance Portability and Photos placed in horizontal position with even amount

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Impacts of maximum deforestation/reforestation on the regional climate in Europe STRADA Susanna

Climate Extreme Index Analysis Based on High Resolution Regional Simulation Fulden Batbeniz,

Connecting Microscale Processes to Mesoscale Phenomena: Improving Cold Pool Parameterizations

Variational data assimilation of lightning with WRFDA system using nonlinear observation

4. Algorithmen und Datenstrukturen Algorithms and Data Structures, - PowerPoint PPT Presentation

4. Algorithmen und Datenstrukturen Algorithms and Data Structures, Overview [Cormen et al, Kap. 1; Ottman/Widmayer, Kap. 1.1] 53 Algorithm Algorithm Well-defined procedure to compute output data from input data 54 Example Problem: Sorting

Algorithmen und Datenstrukturen D3. Kompression 1 Marcel L uthi Universit at Basel 23. Mai

Vorlesung Datenstrukturen und Algorithmen Letzte Vorlesung 2018 Felix Friedrich, 30.5.2018

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Daniel Scherzer

Algorithmen fr die Echtzeitgrafik Algorithmen fr die Echtzeitgrafik Temporal Coherence

Architecture of distributed systems Netzprogrammierung (Algorithmen und Programmierung V) Barry

Maschinelles Lernen: Methoden, Algorithmen, Potentiale und gesellschaftliche

Genetische Algorithmen Christian Borgelt Arbeitsgruppe Neuronale Netze und Fuzzy-Systeme

Repetitionen Clevere Algorithmen, ETH Zrich Dr. Tobias Kohn, University of Cambridge

A standard scale of well-formedness: Why syntax needs boiling and freezing points Sam Featherston

1 Ohne Genericity Ohne Genericity 7 8 class METRO_LINE feature class ROUTE feature start is do

Algorithms in Biology Molecular Organization und Evolution Peter Schuster Institut fr

ibl ) Sfzrn size (32 point, 1 point = 1/72 inch) serif ( better readable ), Sfzrn Sf rn

1 2 3 State R&amp;D Graphic, Version 1 Version 1 4 State R&amp;D Graphic, Version 1,

Gastrointestinal Lymphomas EATL, MALT, and beyond Maria A. Pletneva, MD, PhD Lymphoma in GI tract

Kokkos: Performance Portability and Photos placed in horizontal position with even amount

11 Fuzzy Rule-Based Models Fuzzy Systems Engineering Toward Human-Centric Computing Contents

Impacts of maximum deforestation/reforestation on the regional climate in Europe STRADA Susanna

Climate Extreme Index Analysis Based on High Resolution Regional Simulation Fulden Batbeniz,

Connecting Microscale Processes to Mesoscale Phenomena: Improving Cold Pool Parameterizations

Variational data assimilation of lightning with WRFDA system using nonlinear observation

1 2 3 State R&D Graphic, Version 1 Version 1 4 State R&D Graphic, Version 1,