CS 730/830: Intro AI 1 handout: slides Control Wheeler Ruml (UNH) - - PowerPoint PPT Presentation

cs 730 830 intro ai
SMART_READER_LITE
LIVE PREVIEW

CS 730/830: Intro AI 1 handout: slides Control Wheeler Ruml (UNH) - - PowerPoint PPT Presentation

CS 730/830: Intro AI 1 handout: slides Control Wheeler Ruml (UNH) Lecture 6, CS 730 1 / 12 EOLQs Control Wheeler Ruml (UNH) Lecture 6, CS 730 2 / 12 Control Problems MPC Break P Control PD Control PID Control


slide-1
SLIDE 1

CS 730/830: Intro AI

Control

Wheeler Ruml (UNH) Lecture 6, CS 730 – 1 / 12

1 handout: slides

slide-2
SLIDE 2

EOLQs

Control

Wheeler Ruml (UNH) Lecture 6, CS 730 – 2 / 12

slide-3
SLIDE 3

Control

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 3 / 12

slide-4
SLIDE 4

Planning Problems

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 4 / 12

Observability: complete, partial, hidden State: discrete, continuous Actions: deterministic, stochastic, discrete, continuous Nature: static, deterministic, stochastic Interaction:

  • ne decision, sequential

Time: static/off-line, on-line, discrete, continuous Percepts: discrete, continuous, uncertain Others: solo, cooperative, competitive

slide-5
SLIDE 5

Model Predictive Control

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 5 / 12

used with ‘receeding horizon’ (≈ real-time search) simulate a bunch of controls (near nominal), pick best!

  • r steer to a bunch of states (near nominal), pick best!

flexible, dangerous

slide-6
SLIDE 6

Break

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 6 / 12

asst3

projects

wildcard class

slide-7
SLIDE 7

P Control

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 7 / 12

u = KP (xr − ˆ x) responsiveness vs smoothness = spring model unstable with inertia!

slide-8
SLIDE 8

PD Control

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 8 / 12

u = KP (xr − ˆ x) + KD d(xr − ˆ x) dt dampen correction if error is changing a lot = dampened spring model! does nothing if persistent error balances P component

slide-9
SLIDE 9

PID Control

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 9 / 12

u = KP (xr − ˆ x) + KI

  • (xr − ˆ

x)dt + KD d(xr − ˆ x) dt removes any persistent error however, ‘wind-up’ widely used. not optimal or necessarily stable. tune by hand, or Thrun says coordinate-wise bisection search

slide-10
SLIDE 10

Bisection Search

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 10 / 12

given f and initial guesses l and r 1. bracket a local minimum (a) try guess m in middle (b) if m smallest, done! (local min between l and r) (c) if l smallest, r ← m, m ← l and move l left move l by at least original r − l (double interval) (d) if r smallest, m ← r and move r right 2. refine estimate (a) try lm between l and m. (b) if smaller than m, r ← m and m ← lm (c)

  • therwise, try mr between m and r.

(d) if smaller than m, l ← m and m ← mr (e)

  • therwise m is smallest, l ← lm and r ← mr

(f) until range small or values close

slide-11
SLIDE 11

See Also

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 11 / 12

  • ptimal control: eg, Linear-Quadratic-Gaussian (LQG)

discrete control: eg, Markov decision processes state estimation aka filtering: eg, Kalman filter, particle filter

slide-12
SLIDE 12

EOLQs

Control ■ Problems ■ MPC ■ Break ■ P Control ■ PD Control ■ PID Control ■ Bisection Search ■ See Also ■ EOLQs

Wheeler Ruml (UNH) Lecture 6, CS 730 – 12 / 12

Please write down the most pressing question you have about the course material covered so far and put it in the box on your way out. Thanks!