CSC477 Introduction to Mobile Robotics

Week #4: Discrete Planning in Known Environments

Florian Shkurti

Today’s agenda

Dijkstra’s Planning Algorithm
A* Planning Algorithm
Sampling Based Planners
- Rapidly-exploring Random Trees (RRT)
- Probabilistic Roadmaps (PRM)

Planning

So far we have been trying to compute state-dependent feedback controllers u(x)= Kx

A plan is usually “open-loop,” in the sense that it is assumed that once computed you can execute it perfectly
This is not realistic because: wind, drag, external forces, friction, unknown factors make the system diverge from the planned trajectory.

Planning does not usually take external disturbances into account.
(External, independent feedback controllers have to make sure the robot is following the path closely)

Why Bother Planning?

Sense-Plan-Act Paradigm: Planning Is Necessary

Sense-Plan-Act Paradigm: Planning Is Necessary

Subsumption Architecture: Planning Is Not Necessary

He means: why bother
estimating state and planning?
It’s too much work and could be
error-prone. Why not only have
a hierarchy of reactive
behaviors/controllers?

One possibility:
instead of u(state)=…
use u(sensory observation)=…

Subsumption Architecture: Planning Is Not Necessary

Planning as graph search

Graph nodes represent discrete states
Edges represent transitions/actions
Edges have weights
Potential queries:
- Shortest path from node a to node b, that does not hit obstacles
- Is b reachable from a?

Typical assumptions:
- Current state is known
- Map is known
- Map is mostly static

Dynamic Programming

\[ D(v) = \min_{u \in N(v)} [d(v, u) + D(u)] \]

\[ D(v_{\text{dest}}) = 0 \]

Dynamic Programming

\[ D(v) = \min_{u \in N(v)} [d(v, u) + D(u)] \]

\[ D(v_{\text{dest}}) = 0 \]

Note: this should remind you
of the LQR cost-to-go update

\[ \begin{align} J_{t+1}(\mathbf{x}) &= \min_{\mathbf{u}} [g(\mathbf{x}_t, \mathbf{u}_t) + J_t(A\mathbf{x} + B\mathbf{u})] \\ J_0(\mathbf{x}) &= \mathbf{x}^T Q\mathbf{x} \end{align} \]

Dynamic Programming

\[ D(v) = \min_{u \in N(v)} [d(v, u) + D(u)] \]

\[ D(v_{\text{dest}}) = 0 \]

Worst-Case
Complexity:
\(O(|V|^2)\)

In 2D grid world
\(O(|V|)\)

Dijkstra’s algorithm: example runs

Dijkstra’s algorithm

Let \(D(v)\) denote the length of the optimal path from the source node to node \(v\) (i.e. cost-to-come, not cost-to-go like before)
Set \(D(v) = \infty\) for all nodes except the source: \(D(v_{\text{src}}) = 0\)
Add all nodes to priority queue Q with cost-to-come as priority
While Q is not empty:
- Extract the node with minimum cost-to-come from the queue Q
- If found goal then done
- Remove from the queue
  The cost-to-come of \(v\) is final at this point. Need to check if we can reduce the cost-to-come of its neighbors.
- For \(u\) in neighborhood of \(v\):
  - If \(d(u, v) + D(v) < D(u)\) then
    - Update priority \(u\) of in Q to be \(d(u,v) + D(v)\)

For Fibonacci heaps

\(O(|E|T_{\text{update priority}} + |V|T_{\text{remove min}}) = O(|E| + |V|\log|V|)\)

Dijkstra’s algorithm: example runs

Many nodes are explored
unnecessarily. We are sure that
they are not going to be part of
the solution.

A* Search: Main Idea

Modifies Dijkstra’s algorithm to be more efficient
Expands fewer nodes than Dijkstra’s by using a heuristic
While Dijkstra prioritizes nodes based on cost-to-come
A* prioritizes them based on:
cost-to-come to \(v\) + lower bound on cost-to-go from \(v\) to \(v_{\text{dest}}\)

Optimistic search: explore node with smallest f(v) next

Lower bound on
cost of path from
source to destination
that passes through \(v\)

A* Search: Main Idea

Modifies Dijkstra’s algorithm to be more efficient
Expands fewer nodes than Dijkstra’s by using a heuristic
While Dijkstra prioritizes nodes based on cost-to-come
A* prioritizes them based on:
cost-to-come to \(v\) + lower bound on cost-to-go from \(v\) to \(v_{\text{dest}}\)

h() is called a heuristic. h() must be admissible, i.e. underestimate the cost-to-go from v to destination. h() must also be monotonic, i.e. satisfy the triangle inequality.

Lower bound on
cost of path from
source to destination
that passes through \(v\)

A* Search

Set \(g(v) = \infty\) for all nodes except the source: \(g(v_{\text{src}}) = 0\)
Set \(f(v) = \infty\) for all nodes except the source: \(f(v_{\text{src}}) = h(v_{\text{src}})\)
Add \(v_{\text{src}}\) to priority queue Q with priority \(f(v_{\text{src}})\)
While Q is not empty:
- Extract the node \(v\) with minimum \(f(v)\) from the queue Q
- If found goal then done. Follow the parent pointers from \(v\) to get the path.
- Remove \(v\) from the queue Q
- explored(\(v\)) = true
- For \(u\) in neighborhood \(v\) of if not explored(\(u\)):
  - If \(u\) not in Q then
    - Add u in Q with cost-to-come \(g(u) = g(v) + d(v, u)\) and priority \(f(u) = g(u) + h(u)\)
    - Set the parent \(u\) of to be \(v\)
  - Else if \(g(v) + d(v, u) < g(u)\)
    - Update the cost-to-come and the priority of \(u\) in Q
    - Set the parent of \(u\) to be \(v\)

Dijkstra vs A*

A* for cars

Configuration Space

Idea: dilate obstacles to account for the ways the robot can collide with them.

Why? Instead of planning in the work space and checking whether the robot’s body collides with obstacles, plan in configuration space where you can treat the robot as a point because the obstacles are dilated.

This idea is typically not used for robots with high-dimensional states.

Configuration Space

How do we dilate obstacles?

Minkowski Sum

\(P \oplus Q = \{p + q \mid p \in P, q \in Q\}\)

Drawbacks of grid-based planners

Grid-based planning works well for grids of up to 3-4 dimensions
State-space discretization suffers from combinatorial explosion:
If the state is \(x = [x_1, ... , x_D]\) and we split each dimension into N bins then we will have \(N^D\) nodes in the graph.
This is not practical for planning paths for robot arms with multiple joints, or other high-dimensional systems.

Sampling the state-space

Need to find ways to reduce the continuous domain into a sparse representation: graphs, trees etc.
Today:
Rapidly-exploring Random Tree (RRT),
Probabilistic RoadMap (PRM)
Visibility Planning
Smoothing Planned Paths

RRT

Main idea: maintain a tree of reachable configurations from the root

Main steps:

Sample random state
Find the closest state (node) already in the tree
Steer the closest node towards the random state