[PPT] - Path Planning in Unknown Environment by Optimal Transport on Graph PowerPoint Presentation

SLIDE 1

Path Planning in Unknown Environment by Optimal Transport on Graph

Haomin Zhou School of Mathematics, Georgia Tech

Collaborators: Magnus Egerstedt (ECE, GT), Haoyan Zhai (Math, GT)

Partially Supported by NSF, ONR

SLIDE 2

Optimal Path In Dynamical Environment

Method of Evolving Junctions (MEJ) (Automatica 2017, IJRR 2017, with Chow-Egerstedt-Li-Lu, ).

SLIDE 3

Multi-Agent System

The robots have limited detection ranges. Path generated by Intermittent Diffusion (with Egerstedt-Frederick).

SLIDE 4

Path Exploration in Unknown Environment

The robot has a limited detection range.

SLIDE 5

Path Exploration in Unknown Environment

The robots have limited detection ranges.

SLIDE 6

Outline Path planning in unknown environments Optimal transport on finite graphs General control with unknown constraints

SLIDE 7

Path Planning in Unknown Environments

γ(0) = x0, γ(T) = xf , φ(γ(t)) ≥ 0 for all t ∈ [0, T], ˆ ψ(γ(t), γ, t) ≥ 0 for all t ∈ [0, T],

– Problem: Find a continuous curve γ(t) in Ω ⊂ Rd such that

<latexit sha1_base64="P9C6C2BHJDvL09eXfnjvkzepY8=">ACZXicbVHLbhMxFPVMeYSBQloQGxZckSAVqY1mwoKVQUSYkdApK0Uh+iOx0ms+jHyo1I0zE+yY8uG38BJs4CWK1k6Pvfhe47LWgrn8/xnku7cun3nbudedv/B7sNH3b39U2eCZXzMjDT2vETHpdB87IWX/Ly2HFUp+Vl58X6dP7vk1gmjv/pVzacKF1rMBUMfqVn3+9ERpVlGtRG64tpD1tDN1KaUgbcwsiaOUu1b+BALAIEZ7YUOJjhgwV5y6NMFKoUH/lUfhI7XT4ovEKgLpeMeqEK/LMvmS/ut6h8CpeACW4JfodZt5cP8k3ATVBsQY9sYzTr/qCVYUHFRZlE5yZFXvtpg9YLJnmb0eB4jewCF3wSoUbF3bTZ6GnhZWQqmBsbTxS6Yf/uaFA5t1JlrFzv7K7n1uT/cpPg58fTRug6eK7Z1UPzIMEbWFsOlbCcebmKAJkVcVdgS7TIfPyYLJpQXJd8E5wOB8XrwfDzsHfybmtHhzwjL8gBKcgbckI+khEZE0Z+JZ1kL9lPfqe76ZP06Vpmx7HpN/In3+B4/ftbM=</latexit>

ˆ ψ(x, t, γ) = ⇢ ψ(x) if d(x, γ(τ)) ≤ R for some τ ≤ t

therwise

—————————————————————– φ(x) ≤ 0 are the known constraints, ψ(x) ≤ 0 are the unknown obstacles.

<latexit sha1_base64="FcXLM+KRkMKsTkBY41J/TkbvQ=">ACmnicpVFNTxRBEO0ZUGH8WvDAQ4VFxNMdDOzHuCACdFEMF4wYFkZ7Op6a1lO9vTPXTXoJsNP8q/4s1/Y+/HQYGbL+nk5VW96voKq08p+nvKF5ZfDw0dp68vjJ02fPGxubZ97WTlJHWm3dRYGetDLUYcWaLipHWBazovxp1n8/JqcV9ac8qSiXomXRg2VRA5Sv/Hz3f8CAPI8yY1VZkCGIdnJq5Ha/fEGck1XkMIOoCPgEcHY2O8GpDWeHSrD/u3CHBz+fkdtFh5beEapybeg32imrXQOuEuyJWmKJU76jV/5wMq6DM1Jjd53s7Ti3hQdq1DxJslrTxXKMV5SN1CDJfnedL7aG3gdlAEMrQsvDdX/3ZMsfR+UhYhs0Qe+duxmXhfrFvzcL83VaqmYxcfDSsNbCF2Z1goBxJ1pNAUDoVegU5QoeSwzWTsITs9sh3yVm7lb1vtb+1m4cHy3WsiZfildgVmdgTh+JYnIiOkNFW9CH6HB3F2/H+Ev8dZEaR0vPC/EP4tM/Txm6FA=</latexit>

Constraints are expressed in terms of level set functions.

SLIDE 8

Challenges

Local traps and

replanning,

Narrow pathways,
Collisions,
Communications,
Computational cost in

higher dimensions.

SLIDE 9

Existing methods

Bug family: Bug0, Bug1, Bug2, TangentBug, DistBug, …
Probabilistic Road Map (PRM),
Rapid-growing Random Tree (RRT), RRT* (dynamical version),
Artificial Potential Field (APF),
Graph based methods (Dijkstra style): A*, D, D*, focus-D*, D*-

lite and more,

Genetic algorithm, Neural network, fuzzy logic, fast marching

tree, and many more.

The convergence for many of the methods, if exists, is asymptotic.

SLIDE 10

Our Algorithm Idea: potential guided, tree based 2-layer iterations. 3 main steps:

Tree generating, Path finding, Environment updating.

Potential is used to ensure convergence, and Trees are used to control the computation cost in high dimensions.

SLIDE 11

Properties of Our Algorithm

Proposition

There exists a unique path from initial to target configurations over the generated graph G. And if the path is denoted by {xi}q

i=0 ⊂ V with

x0 → x1 → · · · → xq = xf , where xi is the ancestor of xi+1. We use back tracing to find the path:

SLIDE 12

Convergence Analysis

Theorem

Assuming that sup

γ∈Γ

inf

t∈[0,T] sup r≥0

{r : B(γ(t), r) ∩ O = ∅} = L > 0, and l < 2L √n, where n is the dimension of Ω and l is the step size of the graph generation, the graph generation terminates in finite iterations. The generated graph G = (V , E) is connected and has a finite number of vertices |V | < ∞ with xs, xt ∈ V .

The tree generating iteration stops in finite steps.

SLIDE 13

Convergence Analysis

Theorem

Let {i}m

i=1 be the paths produced by the algorithm with {Ti}m i=1 being

the stopping time set. If we use the same assumptions in the previous theorem and sup

i

inf

✏

n ✏ : B(i(Ti), ✏) \ OTi

c 6= ;

= q < R,

holds, then m < 1.

The outer iteration stops in finite steps. Our algorithm stops in finite steps.

SLIDE 14

Examples A 10-robot (20 dimensional) example. The entire computation is within 1 minute in Matlab on a laptop.

SLIDE 15

Examples A 3-robot example. Most area is not explored.

SLIDE 16

Summary of the Properties

The algorithm is deterministic, stops in finite steps,
Guarantees to find a feasible path if there exists one,
If the algorithm stops without returning a path, there isn’t one

that can be identified by the step size.

The growing rate for the tree is linear, not exponential, w.r.t.

the dimension of the configuration space,

Explores only a limited part of the configuration space.

The method is inspired by optimal transport on trees with intermittent diffusion.

SLIDE 17

Limited Exploration Region

R

Theorem

Given any known environment, the generated graph G is bounded by R, produced by evolution of Fokker-Planck equation in the same environment: G ⊂ [

x∈R

Box(x, l).

The tree generation contains 2 phases: projected gradient and diffusion.

SLIDE 18

Limited Exploration Region

The evolution of Fokker-Planck equation on graph has intermittent diffusion (diffusion coefficient is turned on-and-off).

Theorem

Assuming that the robots only stop

n node points with assumptions in

previous theorems and R > L, the complete path γ generated by the algorithm in the unknown environment satisfies γ ⊂ [

x∈R

Box(x, l), where R is produced with the full knowledge of the environment.

SLIDE 19

Optimal Transport

Optimal Transport: Monge (1781), Kantorovich (1942), Otto, Kinderlehrer,

Villani, McCann, Carlen, Lott, Strum, Gangbo, Jordan, Evans, Brenier, Benamou, Caffarelli, Figalli, and many many more,

Related to linear programming, manifold learning, image processing, game

theory, … Benamou-Brenier Formula

W2(ρ0, ρ1) = inf

v (

Z 1 Z

RN v(t, x)2ρ(t, x)dxdt)

1 2

s.t. ∂ρ ∂t + r · (vρ) = 0, ρ(0, x) = ρ0, ρ(1, x) = ρ1.

h

2-Wasserstein distance: the minimal cost, with respect to the square of Euclidean distance, to move from ρ0 to ρ1.

SLIDE 20

Randomly perturbed gradient system:
Time evolution of the probability density function, the Fokker-Planck

equation:

Invariant distribution at steady state -- Gibbs distribution:

ρt(x, t) = ⇥ · (⇥Ψ(x)ρ(x, t)) + β∆ρ(x, t)

ρ∗(x) = 1 K e−Ψ(x)/β

K = Z

RN e−Ψ(x)/β dx

Fokker-Planck Equations

dx = rΨ(x)dt + p 2βdWt, x 2 RN

SLIDE 21

Free energy
Potential
Gibbs-Boltzmann Entropy
Fokker-Planck equation is the gradient flow of the free energy under

2-Wasserstein metric on the manifold of probability space.

Gibbs distribution is the global attractor of the gradient system.

F(ρ) = U(ρ) − βS(ρ)

U(ρ) = Z

RN Ψ(x)ρ(x)dx

S(ρ) = − Z

RN ρ(x) log ρ(x)dx

Free Energy and Fokker-Planck Equations

SLIDE 22

Optimal Transport

. . . . . .

. .

F(ρ) = U(ρ) − βS(ρ)

dx = ⇤Ψ(x)dt +

2βdWt

Free Energy SDE Fokker-Planck Equation

ρt = r · (rΨρ) + β∆ρ = r · (r(Ψ + β log ρ)ρ)

SLIDE 23

Our Goal: establish optimal transport on graphs with finite vertices.
Why on graphs: Physical space (number of sites or states) is finite, not

necessary from a spatial discretization such as a lattice.

Applications: game theory, RNA folding, logistic, chemical reactions,

machine learning, Markov networks, numerical schemes, ...

Mathematics: Graph theory, Mass transport, Dynamical systems,

Stochastic Processes, PDE’s, ...

Many Recent Developments: Erbar, Mielke, Mass, Gigli, Ollivier,

Villani, Tetali, Fathi, Qian, Carlen, …

Optimal Transport on Finite graphs

SLIDE 24

Basic Set-ups

Graph with finite vertices G = (V , E), V = {1, · · · , n}, E is the edge set. Probability set P(G) = {(ρi)n

i=1 | n

X

i=1

ρi = 1, ρi ≥ 0}. Discrete free energy F(ρ) = 1 2

n

X

i=1 n

X

j=1

wijρiρj +

n

X

i=1

viρi+β

n

X

i=1

ρi log ρi. Boltzmann-Shannon entropy

SLIDE 25

Common discretizations of continuous fokker-planck equations often lead to

incorrect results,

Graphs are not length spaces and many of the essential techniques cannot be used

anymore,

The notion of random perturbation (white noise) of a Markov process on discrete

spaces is not clear.

Nodes on graphs may have very different neighborhood structures.

Theorem: Any given linear discretization of the continuous equation can be written as dρi dt =

j

((

k

ei

jkΦk) + ci j)ρj.

Let A = {Φ ∈ RN :

j

((

k

ei

jkΦk) + ci j)e−

Φj β = 0}.

Then A is a zero measure set.

Challenges

SLIDE 26

Optimal Transport on Graphs

Discrete 2-Wasserstein distance W2;F(ρ0, ρ1) = inf

v (

Z 1 (v, v)ρdt)

1 2

where v and ρ satisfy dρ dt + divG(ρv) = 0, ρ(0, x) = ρ0, ρ(1, x) = ρ1. For any ρ0, ρ1 ∈ P(G), define

SLIDE 27

Vector Operators on Graphs

Vector field on a graph: v = (vij)(i,j)∈E, satisfying vij = −vji Potential Φ induced vector field : rG = (Φi Φj)(i,j)∈E divG(ρv) = −(P

j∈N(i) vijgF ij(ρ))n i=1

Divergence w. r. t. ρ Inner product (v, u)ρ = P

(i,j)∈E vijuijgF ij(ρ)

Here Fi(ρ) =

∂ ∂ρi F(ρ) and

gF

ij (ρ) =

8 > < > : ρi if Fi(ρ) > Fj(ρ), j ∈ N(i); ρj if Fi(ρ) < Fj(ρ), j ∈ N(i);

ρi+ρj 2

if Fi(ρ) = Fj(ρ), j ∈ N(i).

SLIDE 28

Discrete Fokker-Planck Equations

Theorem 1

For a finite graph G = (V , E) and a constant β > 0. The gradient flow of discrete free energy F(ρ) = 1 2

n

X

i=1 n

X

j=1

wijρiρj +

n

X

i=1

viρi + β

n

X

i=1

ρi log ρi

n the metric space (Po(G), W2;F) is

dρi dt = X

j∈N(i)

ρj(Fj(ρ) − Fi(ρ))+ − X

j∈N(i)

ρi(Fi(ρ) − Fj(ρ))+ (1) for any i ∈ V . Here Fi(ρ) =

∂ ∂ρi F(ρ) and (·)+ = max{·, 0}.

ρt = r · (rΨρ) + β∆ρ = r · (r(Ψ + β log ρ)ρ)

Continuous Fokker-Planck equation

SLIDE 29

Fokker-Planck Equation and Exploring Region

∂ρj ∂t = ⇣ X

k∈Nb(j)

(Fk(ρ, σ) Fj(ρ, σ))+ρkdjk X

k∈Nb(j)

(Fj(ρ, σ) Fk(ρ, σ))+ρjdjk ⌘ 1 (∆x)2 ,

In the path planning case, the region is determined by,

where

—————————————- Fi(ρ, σ) = p(i) + σ log ρi.

<latexit sha1_base64="+y49NWe5/l59mS7Cjxckc5F/7Cc=">ACVHicjVFNSwMxEM2uVuv6terRy2ArVNSyWw96UBAE8ahgq9AtSzZN29BsiRZoZT+SD0I/hIvHkzrHtR68EHg8d4M/OSZJxpEwRvjruwWFpaLq94q2vrG5v+1nZLy1wR2iS/WYE05E7RpmOH0MVMUpwmnD8nwauo/PFGlmRT3ZpTRTor7gvUYwcZKsT8/icAIq8SEgmulQY8KrXMYNapAbyCLN+ik+gAvIauwADgsBIi7MC2JWbUOsV8J6sEME/CglRQgdvYf4m6kuSpnUc41rodBpnpjLEyjHA68aJc0wyTIe7TtqUCp1R3xrNQJrBvlS70pLP7jtTv3eMcar1KE1sZYrNQP/2puJfXjs3vbPOmIksN1SQr0G9nIORME0YukxRYvjIEkwUs7sCGWCFibH/4NkQwt8nz5NWox6e1Bt3jcrleRFHGe2iPVRDITpFl+gG3aImIugZvTvIcZxX58NdcEtfpa5T9OygH3A3PgHfMqcm</latexit>

—————————————- p(i) is the distance to the target.

<latexit sha1_base64="+t3A8SHKHadBV9Sr0u9u5dEceR4=">ACSXicjVA9T8MwEHXKd/gqMLJYFKQyUCVlgIGhEgtjkSgNVFxnGtr4TiRfUGqv49FjY2/gMLAwgx4YOQBl4kqXnd3e6dy/KpDoeU9OaWZ2bn5hcldXldWy9vbF6aNcWjyVqb6OmAEpFLRQoITrTANLIglX0e3puH51B9qIVF3gIMwYT0luoIztFKnfHPwT1BKg8ANVCpUDAqpu0uzqtjfpcJQ7AONrVemOFBMiz8y3QOs0U654tW8AnSa+BNSIRM0O+XHIE5ntglXDJj2r6XYThkGgWXMHKD3EDG+C3rQdtSxRIw4bBIYkT3rBLTbqrtsyYL9fvEkCXGDJLIdiYM+Z3bSz+VWvn2D0Oh0JlOYLiX4u6uSyutbHa8zVwlANLGNfCeqW8zTjaMN3bQj+75OnyW95h/W6uf1SuNkEsci2SY7pEp8ckQa5Iw0SYtwck+eySt5cx6cF+fd+fhqLTmTmS3yA6WZT+zNqCo=</latexit>

—————————————- σ = 0 projected gradient, σ > 0 diffusion.

<latexit sha1_base64="0qSzeSLTN3leVBCB+E/RCvptXc0=">ACXicjVDPT8IwGO2mKEzEqQcPXhrBxIOSDQ96UEPixSMm8iNhHRdB5WuXdrOhBD+SW968V+xA0xUPiSJi/ve9/X73thyqjSnvdm2Rubha3tYsnZKe9W9tz9g4SmcSkjQUTshciRjlpK2pZqSXSoKSkJFuOLnP690XIhUV/ElPUzJI0IjTmGKkjTR09cU/ASEMAifgvKIcA2dGgwUHSXo1oM1mErxTLAmERxJFjOM/tX5673BPROM7yRepDt+rVvQXgOvFXpApWaA3d1yASOEvMXMyQUn3fS/VghqSmJG5E2SKpAhP0Ij0DeUoIWowW6Qzh6dGiWAspHlm8YX6vWOGEqWmSWicCdJj9buWi3/V+pmOrwczytNME46XH8UZg1rAPGpzsDSZsKkhCEtqdoV4jCQyOUnlmBD83yevk06j7l/WG4+NavNmFUcRHIMTcAZ8cAWa4AG0QBtg8G4Bq2Q51odsMt2ZWm1rVXPIfgB+gTCxKpew=</latexit>

Distance value is not used, only the projected gradient direction is used,

SLIDE 30

Fokker-Planck Equation and Exploring Region A different view point:

On a generated tree, Fokker-Planck equation selects nodes

(a) The Evolved Region (b) The Generated Graph

SLIDE 31

General Control Problems

For a complete control system (Ω, U, f ) that is completely controllable in time T, with U being a compact set of Rn and f being a Lipschitz function, we let (Ω, g) be a d dimensional compact Riemannian manifold. There exists a distance function d(·, ·) induced by g(·, ·). We want to find u ∈ U[0,T) such that ˙ γ(t) = f (γ(t), u(t)), γ(0) = x0, γ(T) = xf , φ(γ(t)) ≥ 0 for all t ∈ [0, T], ˆ ψ(γ(t), γ, t) ≥ 0 for all t ∈ [0, T], where ˆ ψ(x, t, γ) = ⇢ ψ(x) if d(x, γ(τ)) ≤ R for some τ ≤ t

therwise

SLIDE 32

An Example, Path Planning on a Sphere

SLIDE 33