Path Planning in Unknown Environment by Optimal Transport on Graph - - PowerPoint PPT Presentation

path planning in unknown environment by optimal transport
SMART_READER_LITE
LIVE PREVIEW

Path Planning in Unknown Environment by Optimal Transport on Graph - - PowerPoint PPT Presentation

Path Planning in Unknown Environment by Optimal Transport on Graph Haomin Zhou School of Mathematics, Georgia Tech Collaborators: Magnus Egerstedt (ECE, GT), Haoyan Zhai (Math, GT) Partially Supported by NSF, ONR Optimal Path In Dynamical


slide-1
SLIDE 1

Path Planning in Unknown Environment by Optimal Transport on Graph

Haomin Zhou School of Mathematics, Georgia Tech

Collaborators: Magnus Egerstedt (ECE, GT), Haoyan Zhai (Math, GT)

Partially Supported by NSF, ONR

slide-2
SLIDE 2

Optimal Path In Dynamical Environment

Method of Evolving Junctions (MEJ) (Automatica 2017, IJRR 2017, with Chow-Egerstedt-Li-Lu, ).

slide-3
SLIDE 3

Multi-Agent System

The robots have limited detection ranges. Path generated by Intermittent Diffusion (with Egerstedt-Frederick).

slide-4
SLIDE 4

Path Exploration in Unknown Environment

The robot has a limited detection range.

slide-5
SLIDE 5

Path Exploration in Unknown Environment

The robots have limited detection ranges.

slide-6
SLIDE 6

Outline Path planning in unknown environments Optimal transport on finite graphs General control with unknown constraints

slide-7
SLIDE 7

Path Planning in Unknown Environments

γ(0) = x0, γ(T) = xf , φ(γ(t)) ≥ 0 for all t ∈ [0, T], ˆ ψ(γ(t), γ, t) ≥ 0 for all t ∈ [0, T],

– Problem: Find a continuous curve γ(t) in Ω ⊂ Rd such that

<latexit sha1_base64="P9C6C2BHJDvL09eXfnjvkzepY8=">ACZXicbVHLbhMxFPVMeYSBQloQGxZckSAVqY1mwoKVQUSYkdApK0Uh+iOx0ms+jHyo1I0zE+yY8uG38BJs4CWK1k6Pvfhe47LWgrn8/xnku7cun3nbudedv/B7sNH3b39U2eCZXzMjDT2vETHpdB87IWX/Ly2HFUp+Vl58X6dP7vk1gmjv/pVzacKF1rMBUMfqVn3+9ERpVlGtRG64tpD1tDN1KaUgbcwsiaOUu1b+BALAIEZ7YUOJjhgwV5y6NMFKoUH/lUfhI7XT4ovEKgLpeMeqEK/LMvmS/ut6h8CpeACW4JfodZt5cP8k3ATVBsQY9sYzTr/qCVYUHFRZlE5yZFXvtpg9YLJnmb0eB4jewCF3wSoUbF3bTZ6GnhZWQqmBsbTxS6Yf/uaFA5t1JlrFzv7K7n1uT/cpPg58fTRug6eK7Z1UPzIMEbWFsOlbCcebmKAJkVcVdgS7TIfPyYLJpQXJd8E5wOB8XrwfDzsHfybmtHhzwjL8gBKcgbckI+khEZE0Z+JZ1kL9lPfqe76ZP06Vpmx7HpN/In3+B4/ftbM=</latexit>

ˆ ψ(x, t, γ) = ⇢ ψ(x) if d(x, γ(τ)) ≤ R for some τ ≤ t

  • therwise

—————————————————————– φ(x) ≤ 0 are the known constraints, ψ(x) ≤ 0 are the unknown obstacles.

<latexit sha1_base64="FcXLM+KRkMKsTkBY41J/TkbvQ=">ACmnicpVFNTxRBEO0ZUGH8WvDAQ4VFxNMdDOzHuCACdFEMF4wYFkZ7Op6a1lO9vTPXTXoJsNP8q/4s1/Y+/HQYGbL+nk5VW96voKq08p+nvKF5ZfDw0dp68vjJ02fPGxubZ97WTlJHWm3dRYGetDLUYcWaLipHWBazovxp1n8/JqcV9ac8qSiXomXRg2VRA5Sv/Hz3f8CAPI8yY1VZkCGIdnJq5Ha/fEGck1XkMIOoCPgEcHY2O8GpDWeHSrD/u3CHBz+fkdtFh5beEapybeg32imrXQOuEuyJWmKJU76jV/5wMq6DM1Jjd53s7Ti3hQdq1DxJslrTxXKMV5SN1CDJfnedL7aG3gdlAEMrQsvDdX/3ZMsfR+UhYhs0Qe+duxmXhfrFvzcL83VaqmYxcfDSsNbCF2Z1goBxJ1pNAUDoVegU5QoeSwzWTsITs9sh3yVm7lb1vtb+1m4cHy3WsiZfildgVmdgTh+JYnIiOkNFW9CH6HB3F2/H+Ev8dZEaR0vPC/EP4tM/Txm6FA=</latexit>

Constraints are expressed in terms of level set functions.

slide-8
SLIDE 8

Challenges

  • Local traps and

replanning,

  • Narrow pathways,
  • Collisions,
  • Communications,
  • Computational cost in

higher dimensions.

slide-9
SLIDE 9

Existing methods

  • Bug family: Bug0, Bug1, Bug2, TangentBug, DistBug, …
  • Probabilistic Road Map (PRM),
  • Rapid-growing Random Tree (RRT), RRT* (dynamical version),
  • Artificial Potential Field (APF),
  • Graph based methods (Dijkstra style): A*, D, D*, focus-D*, D*-

lite and more,

  • Genetic algorithm, Neural network, fuzzy logic, fast marching

tree, and many more.

The convergence for many of the methods, if exists, is asymptotic.

slide-10
SLIDE 10

Our Algorithm Idea: potential guided, tree based 2-layer iterations. 3 main steps:

Tree generating, Path finding, Environment updating.

Potential is used to ensure convergence, and Trees are used to control the computation cost in high dimensions.

slide-11
SLIDE 11

Properties of Our Algorithm

Proposition

There exists a unique path from initial to target configurations over the generated graph G. And if the path is denoted by {xi}q

i=0 ⊂ V with

x0 → x1 → · · · → xq = xf , where xi is the ancestor of xi+1. We use back tracing to find the path:

slide-12
SLIDE 12

Convergence Analysis

Theorem

Assuming that sup

γ∈Γ

inf

t∈[0,T] sup r≥0

{r : B(γ(t), r) ∩ O = ∅} = L > 0, and l < 2L √n, where n is the dimension of Ω and l is the step size of the graph generation, the graph generation terminates in finite iterations. The generated graph G = (V , E) is connected and has a finite number of vertices |V | < ∞ with xs, xt ∈ V .

The tree generating iteration stops in finite steps.

slide-13
SLIDE 13

Convergence Analysis

Theorem

Let {i}m

i=1 be the paths produced by the algorithm with {Ti}m i=1 being

the stopping time set. If we use the same assumptions in the previous theorem and sup

i

inf

n ✏ : B(i(Ti), ✏) \ OTi

c 6= ;

  • = q < R,

holds, then m < 1.

The outer iteration stops in finite steps. Our algorithm stops in finite steps.

slide-14
SLIDE 14

Examples A 10-robot (20 dimensional) example. The entire computation is within 1 minute in Matlab on a laptop.

slide-15
SLIDE 15

Examples A 3-robot example. Most area is not explored.

slide-16
SLIDE 16

Summary of the Properties

  • The algorithm is deterministic, stops in finite steps,
  • Guarantees to find a feasible path if there exists one,
  • If the algorithm stops without returning a path, there isn’t one

that can be identified by the step size.

  • The growing rate for the tree is linear, not exponential, w.r.t.

the dimension of the configuration space,

  • Explores only a limited part of the configuration space.

The method is inspired by optimal transport on trees with intermittent diffusion.

slide-17
SLIDE 17

Limited Exploration Region

R

Theorem

Given any known environment, the generated graph G is bounded by R, produced by evolution of Fokker-Planck equation in the same environment: G ⊂ [

x∈R

Box(x, l).

The tree generation contains 2 phases: projected gradient and diffusion.

slide-18
SLIDE 18

Limited Exploration Region

The evolution of Fokker-Planck equation on graph has intermittent diffusion (diffusion coefficient is turned on-and-off).

Theorem

Assuming that the robots only stop

  • n node points with assumptions in

previous theorems and R > L, the complete path γ generated by the algorithm in the unknown environment satisfies γ ⊂ [

x∈R

Box(x, l), where R is produced with the full knowledge of the environment.

slide-19
SLIDE 19

Optimal Transport

  • Optimal Transport: Monge (1781), Kantorovich (1942), Otto, Kinderlehrer,

Villani, McCann, Carlen, Lott, Strum, Gangbo, Jordan, Evans, Brenier, Benamou, Caffarelli, Figalli, and many many more,

  • Related to linear programming, manifold learning, image processing, game

theory, … Benamou-Brenier Formula

W2(ρ0, ρ1) = inf

v (

Z 1 Z

RN v(t, x)2ρ(t, x)dxdt)

1 2

s.t. ∂ρ ∂t + r · (vρ) = 0, ρ(0, x) = ρ0, ρ(1, x) = ρ1.

  • h

2-Wasserstein distance: the minimal cost, with respect to the square of Euclidean distance, to move from ρ0 to ρ1.

slide-20
SLIDE 20
  • Randomly perturbed gradient system:
  • Time evolution of the probability density function, the Fokker-Planck

equation:

  • Invariant distribution at steady state -- Gibbs distribution:

ρt(x, t) = ⇥ · (⇥Ψ(x)ρ(x, t)) + β∆ρ(x, t)

ρ∗(x) = 1 K e−Ψ(x)/β

K = Z

RN e−Ψ(x)/β dx

Fokker-Planck Equations

dx = rΨ(x)dt + p 2βdWt, x 2 RN

slide-21
SLIDE 21
  • Free energy
  • Potential
  • Gibbs-Boltzmann Entropy
  • Fokker-Planck equation is the gradient flow of the free energy under

2-Wasserstein metric on the manifold of probability space.

  • Gibbs distribution is the global attractor of the gradient system.

F(ρ) = U(ρ) − βS(ρ)

U(ρ) = Z

RN Ψ(x)ρ(x)dx

S(ρ) = − Z

RN ρ(x) log ρ(x)dx

Free Energy and Fokker-Planck Equations

slide-22
SLIDE 22

Optimal Transport

. . . . . .

. .

F(ρ) = U(ρ) − βS(ρ)

dx = ⇤Ψ(x)dt +

  • 2βdWt

Free Energy SDE Fokker-Planck Equation

ρt = r · (rΨρ) + β∆ρ = r · (r(Ψ + β log ρ)ρ)

slide-23
SLIDE 23
  • Our Goal: establish optimal transport on graphs with finite vertices.
  • Why on graphs: Physical space (number of sites or states) is finite, not

necessary from a spatial discretization such as a lattice.

  • Applications: game theory, RNA folding, logistic, chemical reactions,

machine learning, Markov networks, numerical schemes, ...

  • Mathematics: Graph theory, Mass transport, Dynamical systems,

Stochastic Processes, PDE’s, ...

  • Many Recent Developments: Erbar, Mielke, Mass, Gigli, Ollivier,

Villani, Tetali, Fathi, Qian, Carlen, …

Optimal Transport on Finite graphs

slide-24
SLIDE 24

Basic Set-ups

Graph with finite vertices G = (V , E), V = {1, · · · , n}, E is the edge set. Probability set P(G) = {(ρi)n

i=1 | n

X

i=1

ρi = 1, ρi ≥ 0}. Discrete free energy F(ρ) = 1 2

n

X

i=1 n

X

j=1

wijρiρj +

n

X

i=1

viρi+β

n

X

i=1

ρi log ρi. Boltzmann-Shannon entropy

slide-25
SLIDE 25
  • Common discretizations of continuous fokker-planck equations often lead to

incorrect results,

  • Graphs are not length spaces and many of the essential techniques cannot be used

anymore,

  • The notion of random perturbation (white noise) of a Markov process on discrete

spaces is not clear.

  • Nodes on graphs may have very different neighborhood structures.

Theorem: Any given linear discretization of the continuous equation can be written as dρi dt =

  • j

((

  • k

ei

jkΦk) + ci j)ρj.

Let A = {Φ ∈ RN :

  • j

((

  • k

ei

jkΦk) + ci j)e−

Φj β = 0}.

Then A is a zero measure set.

Challenges

slide-26
SLIDE 26

Optimal Transport on Graphs

Discrete 2-Wasserstein distance W2;F(ρ0, ρ1) = inf

v (

Z 1 (v, v)ρdt)

1 2

where v and ρ satisfy dρ dt + divG(ρv) = 0, ρ(0, x) = ρ0, ρ(1, x) = ρ1. For any ρ0, ρ1 ∈ P(G), define

slide-27
SLIDE 27

Vector Operators on Graphs

Vector field on a graph: v = (vij)(i,j)∈E, satisfying vij = −vji Potential Φ induced vector field : rG = (Φi Φj)(i,j)∈E divG(ρv) = −(P

j∈N(i) vijgF ij(ρ))n i=1

Divergence w. r. t. ρ Inner product (v, u)ρ = P

(i,j)∈E vijuijgF ij(ρ)

Here Fi(ρ) =

∂ ∂ρi F(ρ) and

gF

ij (ρ) =

8 > < > : ρi if Fi(ρ) > Fj(ρ), j ∈ N(i); ρj if Fi(ρ) < Fj(ρ), j ∈ N(i);

ρi+ρj 2

if Fi(ρ) = Fj(ρ), j ∈ N(i).

slide-28
SLIDE 28

Discrete Fokker-Planck Equations

Theorem 1

For a finite graph G = (V , E) and a constant β > 0. The gradient flow of discrete free energy F(ρ) = 1 2

n

X

i=1 n

X

j=1

wijρiρj +

n

X

i=1

viρi + β

n

X

i=1

ρi log ρi

  • n the metric space (Po(G), W2;F) is

dρi dt = X

j∈N(i)

ρj(Fj(ρ) − Fi(ρ))+ − X

j∈N(i)

ρi(Fi(ρ) − Fj(ρ))+ (1) for any i ∈ V . Here Fi(ρ) =

∂ ∂ρi F(ρ) and (·)+ = max{·, 0}.

ρt = r · (rΨρ) + β∆ρ = r · (r(Ψ + β log ρ)ρ)

Continuous Fokker-Planck equation

slide-29
SLIDE 29

Fokker-Planck Equation and Exploring Region

∂ρj ∂t = ⇣ X

k∈Nb(j)

(Fk(ρ, σ) Fj(ρ, σ))+ρkdjk X

k∈Nb(j)

(Fj(ρ, σ) Fk(ρ, σ))+ρjdjk ⌘ 1 (∆x)2 ,

In the path planning case, the region is determined by,

where

—————————————- Fi(ρ, σ) = p(i) + σ log ρi.

<latexit sha1_base64="+y49NWe5/l59mS7Cjxckc5F/7Cc=">ACVHicjVFNSwMxEM2uVuv6terRy2ArVNSyWw96UBAE8ahgq9AtSzZN29BsiRZoZT+SD0I/hIvHkzrHtR68EHg8d4M/OSZJxpEwRvjruwWFpaLq94q2vrG5v+1nZLy1wR2iS/WYE05E7RpmOH0MVMUpwmnD8nwauo/PFGlmRT3ZpTRTor7gvUYwcZKsT8/icAIq8SEgmulQY8KrXMYNapAbyCLN+ik+gAvIauwADgsBIi7MC2JWbUOsV8J6sEME/CglRQgdvYf4m6kuSpnUc41rodBpnpjLEyjHA68aJc0wyTIe7TtqUCp1R3xrNQJrBvlS70pLP7jtTv3eMcar1KE1sZYrNQP/2puJfXjs3vbPOmIksN1SQr0G9nIORME0YukxRYvjIEkwUs7sCGWCFibH/4NkQwt8nz5NWox6e1Bt3jcrleRFHGe2iPVRDITpFl+gG3aImIugZvTvIcZxX58NdcEtfpa5T9OygH3A3PgHfMqcm</latexit>

—————————————- p(i) is the distance to the target.

<latexit sha1_base64="+t3A8SHKHadBV9Sr0u9u5dEceR4=">ACSXicjVA9T8MwEHXKd/gqMLJYFKQyUCVlgIGhEgtjkSgNVFxnGtr4TiRfUGqv49FjY2/gMLAwgx4YOQBl4kqXnd3e6dy/KpDoeU9OaWZ2bn5hcldXldWy9vbF6aNcWjyVqb6OmAEpFLRQoITrTANLIglX0e3puH51B9qIVF3gIMwYT0luoIztFKnfHPwT1BKg8ANVCpUDAqpu0uzqtjfpcJQ7AONrVemOFBMiz8y3QOs0U654tW8AnSa+BNSIRM0O+XHIE5ntglXDJj2r6XYThkGgWXMHKD3EDG+C3rQdtSxRIw4bBIYkT3rBLTbqrtsyYL9fvEkCXGDJLIdiYM+Z3bSz+VWvn2D0Oh0JlOYLiX4u6uSyutbHa8zVwlANLGNfCeqW8zTjaMN3bQj+75OnyW95h/W6uf1SuNkEsci2SY7pEp8ckQa5Iw0SYtwck+eySt5cx6cF+fd+fhqLTmTmS3yA6WZT+zNqCo=</latexit>

—————————————- σ = 0 projected gradient, σ > 0 diffusion.

<latexit sha1_base64="0qSzeSLTN3leVBCB+E/RCvptXc0=">ACXicjVDPT8IwGO2mKEzEqQcPXhrBxIOSDQ96UEPixSMm8iNhHRdB5WuXdrOhBD+SW968V+xA0xUPiSJi/ve9/X73thyqjSnvdm2Rubha3tYsnZKe9W9tz9g4SmcSkjQUTshciRjlpK2pZqSXSoKSkJFuOLnP690XIhUV/ElPUzJI0IjTmGKkjTR09cU/ASEMAifgvKIcA2dGgwUHSXo1oM1mErxTLAmERxJFjOM/tX5673BPROM7yRepDt+rVvQXgOvFXpApWaA3d1yASOEvMXMyQUn3fS/VghqSmJG5E2SKpAhP0Ij0DeUoIWowW6Qzh6dGiWAspHlm8YX6vWOGEqWmSWicCdJj9buWi3/V+pmOrwczytNME46XH8UZg1rAPGpzsDSZsKkhCEtqdoV4jCQyOUnlmBD83yevk06j7l/WG4+NavNmFUcRHIMTcAZ8cAWa4AG0QBtg8G4Bq2Q51odsMt2ZWm1rVXPIfgB+gTCxKpew=</latexit>

Distance value is not used, only the projected gradient direction is used,

slide-30
SLIDE 30

Fokker-Planck Equation and Exploring Region A different view point:

On a generated tree, Fokker-Planck equation selects nodes

(a) The Evolved Region (b) The Generated Graph

slide-31
SLIDE 31

General Control Problems

For a complete control system (Ω, U, f ) that is completely controllable in time T, with U being a compact set of Rn and f being a Lipschitz function, we let (Ω, g) be a d dimensional compact Riemannian manifold. There exists a distance function d(·, ·) induced by g(·, ·). We want to find u ∈ U[0,T) such that ˙ γ(t) = f (γ(t), u(t)), γ(0) = x0, γ(T) = xf , φ(γ(t)) ≥ 0 for all t ∈ [0, T], ˆ ψ(γ(t), γ, t) ≥ 0 for all t ∈ [0, T], where ˆ ψ(x, t, γ) = ⇢ ψ(x) if d(x, γ(τ)) ≤ R for some τ ≤ t

  • therwise
slide-32
SLIDE 32

An Example, Path Planning on a Sphere

slide-33
SLIDE 33

Thank you for your attention!