Search Algorithms
3
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 1
Search Algorithms 3 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 - - PowerPoint PPT Presentation
Search Algorithms 3 AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 1 3 Search Algorithms 3.1 Problem-solving agents 3.2 Basic search algorithms 3.3 Heuristic search Greedy search A search 3.4 Local search Hill-climbing
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 1
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 2
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 3
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 4
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 5
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 6
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 7
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 8
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 9
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 10
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 11
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 12
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 13
R L S S S S R L R L R L S S S S L L L L R R R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 14
R L S S S S R L R L R L S S S S L L L L R R R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 15
R L S S S S R L R L R L S S S S L L L L R R R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 16
R L S S S S R L R L R L S S S S L L L L R R R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 17
R L S S S S R L R L R L S S S S L L L L R R R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 18
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 19
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 20
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 21
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 22
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 23
R R R P R R
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 24
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 25
Rimnicu Vilcea
Lugoj Zerind Sibiu Arad Fagaras Oradea Timisoara Arad Arad Oradea Arad
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 26
Rimnicu Vilcea
Lugoj Arad Fagaras Oradea Arad Arad Oradea Zerind Arad Sibiu Timisoara
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 27
Lugoj Arad Arad Oradea
Rimnicu Vilcea
Zerind Arad Sibiu Arad Fagaras Oradea Timisoara
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 28
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
STATE PARENT ACTION = Right PATH-COST = 6
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 29
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 30
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 31
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 32
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 33
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 34
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 35
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 36
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 37
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 38
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 39
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 40
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 41
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 42
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 43
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 44
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 45
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 46
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 47
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 48
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 49
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 50
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 51
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 52
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 53
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 54
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 55
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 56
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 57
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 58
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 59
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 60
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 61
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 62
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 63
Limit = 0
A A
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 64
Limit = 1
A B C A B C A B C A B C
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 65
Limit = 2
A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G A B C D E F G
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 66
Limit = 3
A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H I J K L M N O A B C D E F G H J K L M N O I A B C D E F G H I J K L M N O
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 67
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 68
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 69
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 70
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 71
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 72
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 73
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 74
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 75
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 76
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 77
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 78
Bucharest Giurgiu Urziceni Hirsova Eforie Neamt Oradea Zerind Arad Timisoara Lugoj Mehadia Dobreta Craiova Sibiu Fagaras Pitesti Rimnicu Vilcea Vaslui Iasi
Straight−line distance to Bucharest 160 242 161 77 151 241 366 193 178 253 329 80 199 244 380 226 234 374 98
Giurgiu Urziceni Hirsova Eforie Neamt Oradea Zerind Arad Timisoara Lugoj Mehadia Dobreta Craiova Sibiu Fagaras Pitesti Vaslui Iasi Rimnicu Vilcea Bucharest
71 75 118 111 70 75 120 151 140 99 80 97 101 211 138 146 85 90 98 142 92 87 86
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 79
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 80
Arad 366
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 81
Zerind Arad Sibiu Timisoara 253 329 374
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 82
Rimnicu Vilcea
Zerind Arad Sibiu Arad Fagaras Oradea Timisoara 329 374 366 176 380 193
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 83
Rimnicu Vilcea
Zerind Arad Sibiu Arad Fagaras Oradea Timisoara Sibiu Bucharest 329 374 366 380 193 253
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 84
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 85
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 86
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 87
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 88
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 89
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 90
Arad 366=0+366
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 91
Zerind Arad Sibiu Timisoara 447=118+329 449=75+374 393=140+253
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 92
Zerind Arad Sibiu Arad Timisoara
Rimnicu Vilcea
Fagaras Oradea 447=118+329 449=75+374 646=280+366 413=220+193 415=239+176 671=291+380
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 93
Zerind Arad Sibiu Arad Timisoara Fagaras Oradea 447=118+329 449=75+374 646=280+366 415=239+176
Rimnicu Vilcea
Craiova Pitesti Sibiu 526=366+160 553=300+253 417=317+100 671=291+380
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 94
Zerind Arad Sibiu Arad Timisoara Sibiu Bucharest
Rimnicu Vilcea
Fagaras Oradea Craiova Pitesti Sibiu 447=118+329 449=75+374 646=280+366 591=338+253 450=450+0 526=366+160 553=300+253 417=317+100 671=291+380
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 95
Zerind Arad Sibiu Arad Timisoara Sibiu Bucharest
Rimnicu Vilcea
Fagaras Oradea Craiova Pitesti Sibiu Bucharest Craiova
Rimnicu Vilcea
418=418+0 447=118+329 449=75+374 646=280+366 591=338+253 450=450+0 526=366+160 553=300+253 615=455+160 607=414+193 671=291+380
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 96
G n G2 Start
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 97
O Z A T L M D C R F P G B U H E V I N
380 400 420
S
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 98
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 99
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 100
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 101
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 102
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 103
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 104
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 105
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 106
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 107
Start State Goal State
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 108
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 109
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 110
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 111
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 112
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 113
state space
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 114
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 115
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 116
E(x) kT
E(x∗) kT /e E(x) kT = e E(x∗)−E(x) kT
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 117
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 118
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 119
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 120
∂f
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 121
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 122
1 2 3 1 2 3
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 123
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 124
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 125
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 126
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 127
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 128
X X X X X X X X X MAX (X) MIN (O) X X O O O X O O O O O O O MAX (X) X O X O X O X X X X X X X MIN (O) X O X X O X X O X . . . . . . . . . . . . . . . . . . . . . TERMINAL X X −1 +1 Utility
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 129
MAX
A B C D 3 12 8 2 4 6 14 5 2 3 2 2 3 a1 a2 a3 b1 b2 b3 c1 c2 c3 d1 d2 d3
MIN
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 130
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 131
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 132
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 133
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 134
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 135
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 136
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 137
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 138
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 139
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 140
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 141
.. .. .. MAX MIN MAX MIN V
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 142
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 143
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 144
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 145
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 146
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 147
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 148
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 149
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 150
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 151
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 152
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 153
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 154
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 155
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 156
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 157
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 158
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 159
2010
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 160
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 161
Silver, D, et. al., A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play, Science 07 Dec 2018: Vol. 362, Issue 6419, pp. 1140-1144
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 162
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 163
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 164
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 165
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 166
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 167
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 168
8 9 2 6 6 6 8 7 6 6 7 6 6 7 6 6 7 6 7 4 2 9 3 4 2 9 3 4 2 3 4 3 4 3 6 4 8 9 2 6 6 8 7 6 6 7 6 6 7 6 6 7 7 2 9 3 2 9 3 2 3 3 3 4 4 4 4 6 6 4 8 9 2 6 6 8 7 6 6 7 6 6 7 2 9 3 2 9 3 2 3 7 3 6 4 6 6 7 3 4 4 4 6 6 7 3 4
MAX MIN MAX MIN MAX MIN
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 169
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 170
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 171
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 172
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 173
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 174
Ref: Pearl, J (1984), Heuristics: Intelligent Search Strategies for Computer Problem Solving, Addison- Wesley
AI Slides (6e) c Lin Zuoquan@PKU 2003-2020 3 175