Model Predictive Path Integral Controller

Source code on Github.

MPPI on Turtlebot3 demo

The MPPI Controller implements a Model Predictive Path Integral Controller. The new Nav2 MPPI Controller is a predictive controller - a successor to TEB and pure path tracking MPC controllers. It uses a sampling based approach to select optimal trajectories, optimizing between successive iterations. It contains plugin-based objective functions for customization and extension for various behaviors and behavioral attributes.

It works currently with Differential, Omnidirectional, and Ackermann robots. This controller is measured to run at 50+ Hz on a modest Intel processor (4th gen i5).

The MPPI algorithm is an MPC variant that finds a control velocity for the robot using an iterative approach. Using the previous time step’s best control solution and the robot’s current state, a set of randomly sampled perturbations from a Gaussian distribution are applied. These noised controls are forward simulated to generate a set of trajectories within the robot’s motion model. Next, these trajectories are scored using a set of plugin-based critic functions to find the best trajectory in the batch. The output scores are used to set the best control with a soft max function. This process is then repeated a number of times and returns a converged solution. This solution is then used as the basis of the next time step’s initial control.

A powerful result of this work is the ability to utilize objective functions which are not require to be convex nor differentiable, providing greater designer flexibility in behavior.

See the package’s README for more complete information.

MPPI Parameters

motion_model:

Type

Default

string

“DiffDrive”

Description

The desired motion model to use for trajectory planning. Options are DiffDrive, Omni, or Ackermann. Differential drive robots may use forward/reverse and angular velocities; Omni add in lateral motion; and Ackermann adds minimum curvature constraints.

critics:

Type

Default

string vector

N/A

Description

A vector of critic plugin functions to use, without mppi::critic:: namespace which will be automatically added on loading.

iteration_count:

Type

Default

int

1

Description

Iteration count in the MPPI algorithm. Recommended to remain as 1 and instead prefer larger batch sizes.

batch_size:

Type

Default

int

1000

Description

Count of randomly sampled candidate trajectories from current optimal control sequence in a given iteration. 1000 @ 50 Hz or 2000 @ 30 Hz seems to produce good results.

time_steps:

Type

Default

int

56

Description

Number of time steps (points) in candidate trajectories

model_dt:

Type

Default

double

0.05

Description

Length of each time step’s dt timestep, in seconds. time_steps * model_dt is the prediction horizon.

vx_std:

Type

Default

double

0.2

Description

Sampling standard deviation for Vx

vy_std:

Type

Default

double

0.2

Description

Sampling standard deviation for Vy

wz_std:

Type

Default

double

0.2

Description

Sampling standard deviation for Wz (angular velocity)

vx_max:

Type

Default

double

0.5

Description

Target maximum forward velocity (m/s).

vy_max:

Type

Default

double

0.5

Description

Target maximum lateral velocity, if using Omni motion model (m/s).

vx_min:

Type

Default

double

-0.35

Description

Maximum reverse velocity (m/s).

wz_max:

Type

Default

double

1.9

Description

Maximum rotational velocity (rad/s).

ax_max:

Type

Default

double

3.0

Description

Maximum forward acceleration (m/s^2).

ay_max:

Type

Default

double

3.0

Description

Maximum lateral acceleration in either direction, if using Omni motion model (m/s^2).

ax_min:

Type

Default

double

-3.0

Description

Maximum deceleration along the X-axis (m/s^2).

az_max:

Type

Default

double

3.5

Description

Maximum angular acceleration (rad/s^2).

temperature:

Type

Default

double

0.3

Description

Selectiveness of trajectories by their costs (The closer this value to 0, the “more” we take in consideration controls with less cost), 0 mean use control with best cost, huge value will lead to just taking mean of all trajectories without cost consideration.

gamma:

Type

Default

double

0.015

Description

A trade-off between smoothness (high) and low energy (low). This is a complex parameter that likely won’t need to be changed from the default. See Section 3D-2 in “Information Theoretic Model Predictive Control: Theory and Applications to Autonomous Driving” for detailed information.

visualize:

Type

Default

bool

false

Description

Whether to publish debugging trajectories for visualization. This can slow down the controller substantially (e.g. 1000 batches of 56 size every 30hz is a lot of data).

retry_attempt_limit:

Type

Default

int

1

Description

Number of attempts to find feasible trajectory on failure for soft-resets before reporting total failure.

reset_period:

Type

Default

double

1.0

Description

Required time of inactivity to reset optimizer (only in Humble due to backport ABI policies).

regenerate_noises:

Type

Default

bool

false

Description

Whether to regenerate noises each iteration or use single noise distribution computed on initialization and reset. Practically, this is found to work fine since the trajectories are being sampled stochastically from a normal distribution and reduces compute jittering at run-time due to thread wake-ups to resample normal distribution.

Trajectory Visualization

trajectory_step:

Type

Default

int

5

Description

The step between trajectories to visualize to downsample candidate trajectory pool.

time_step:

Type

Default

int

3

Description

The step between points on trajectories to visualize to downsample trajectory density.

Path Handler

transform_tolerance:

Type

Default

double

0.1

Description

Time tolerance for data transformations with TF (s).

prune_distance:

Type

Default

double

1.5

Description

Distance ahead of nearest point on path to robot to prune path to (m).

max_robot_pose_search_dist:

Type

Default

double

Costmap size / 2

Description

Max integrated distance ahead of robot pose to search for nearest path point in case of path looping.

enforce_path_inversion:

Type

Default

bool

false

Description

If true, it will prune paths containing cusping points for segments changing directions (e.g. path inversions) such that the controller will be forced to change directions at or very near the planner’s requested inversion point. This is targeting Smac Planner users with feasible paths who need their robots to switch directions where specifically requested.

inversion_xy_tolerance:

Type

Default

double

0.2

Description

Cartesian proximity (m) to path inversion point to be considered “achieved” to pass on the rest of the path after path inversion.

inversion_yaw_tolerance:

Type

Default

double

0.4

Description

Angular proximity (radians) to path inversion point to be considered “achieved” to pass on the rest of the path after path inversion. 0.4 rad = 23 deg.

Ackermann Motion Model

min_turning_r:

Type

Default

double

0.2

Description

The minimum turning radius possible for the vehicle platform (m).

Constraint Critic

This critic penalizes trajectories that have components outside of the set dynamic or kinematic constraints

cost_weight:

Type

Default

double

4.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

Goal Angle Critic

This critic incentivizes navigating to achieve the angle of the goal posewhen in reasonable proximity to goal

cost_weight:

Type

Default

double

3.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

0.5

Description

Minimal distance (m) between robot and goal above which angle goal cost considered.

Goal Critic

This critic incentivizes navigating spatially towards the goal when in reasonable proximity to goal

cost_weight:

Type

Default

double

5.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

1.4

Description

Minimal distance (m) between robot and goal above which goal distance cost considered. It is wise to start with this as being the same as your prediction horizon to have a clean hand-off with the path follower critic.

Obstacles Critic

This critic incentivizes navigating away from obstacles and critical collisions using either a circular robot point-check or full SE2 footprint check using distances from obstacles.

critical_weight:

Type

Default

double

20.0

Description

Weight to apply to critic for near collisions closer than collision_margin_distance to prevent near collisions only as a method of virtually inflating the footprint. This should not be used to generally influence obstacle avoidance away from critical collisions.

repulsion_weight:

Type

Default

double

1.5

Description

Weight to apply to critic for generally preferring routes in lower cost space. This is separated from the critical term to allow for fine tuning of obstacle behaviors with path alignment for dynamic scenes without impacting actions which may directly lead to near-collisions. This is applied within the inflation_radius distance from obstacles.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

consider_footprint:

Type

Default

bool

false

Description

Whether to use point cost (if robot is circular or low compute power) or compute SE2 footprint cost.

collision_cost:

Type

Default

double

100000.0

Description

Cost to apply to a true collision in a trajectory.

collision_margin_distance:

Type

Default

double

0.10

Description

Margin distance (m) from collision to apply severe penalty, similar to footprint inflation. Between 0.05-0.2 is reasonable. Note that it will highly influence the controller not to enter spaces more confined than this, so ensure this parameter is set lower than the narrowest you expect the robot to need to traverse through.

near_goal_distance:

Type

Default

double

0.50

Description

Distance (m) near goal to stop applying preferential obstacle term to allow robot to smoothly converge to goal pose in close proximity to obstacles.

cost_scaling_factor:

Type

Default

double

10.0

Description

Exponential decay factor across inflation radius. This should be the same as for your inflation layer (Humble only)

inflation_radius:

Type

Default

double

0.55

Description

Radius to inflate costmap around lethal obstacles. This should be the same as for your inflation layer (Humble only)

inflation_layer_name:

Type

Default

string

“”

Description

Name of the inflation layer. If empty, it uses the last inflation layer in the costmap. If you have multiple inflation layers, you may want to specify the name of the layer to use.

Cost Critic

This critic incentivizes navigating away from obstacles and critical collisions using either a circular robot point-check or full SE2 footprint check using the costmap values.

cost_weight:

Type

Default

double

3.81

Description

Weight to apply to critic.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

consider_footprint:

Type

Default

bool

false

Description

Whether to use point cost (if robot is circular or low compute power) or compute SE2 footprint cost.

collision_cost:

Type

Default

double

1000000.0

Description

Cost to apply to a true collision in a trajectory.

critical_cost:

Type

Default

double

300.0

Description

Cost to apply to a pose with any point in in inflated space to prefer distance from obstacles.

near_goal_distance:

Type

Default

double

0.50

Description

Distance (m) near goal to stop applying preferential obstacle term to allow robot to smoothly converge to goal pose in close proximity to obstacles.

inflation_layer_name:

Type

Default

string

“”

Description

Name of the inflation layer. If empty, it uses the last inflation layer in the costmap. If you have multiple inflation layers, you may want to specify the name of the layer to use.

trajectory_point_step:

Type

Default

int

2

Description

The step to take in trajectories for evaluating them in the critic. Since trajectories are extremely dense, its unnecessary to evaluate each point and computationally expensive.

Path Align Critic

This critic incentivizes aligning with the global path, if relevant. It does not implement path following behavior.

cost_weight:

Type

Default

double

10.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

0.5

Description

Distance (m) between robot and goal to stop considering path alignment and allow goal critics to take over.

offset_from_furthest:

Type

Default

int

20

Description

Checks that the candidate trajectories are sufficiently far along their way tracking the path to apply the alignment critic. This ensures that path alignment is only considered when actually tracking the path, preventing awkward initialization motions preventing the robot from leaving the path to achieve the appropriate heading.

max_path_occupancy_ratio:

Type

Default

double

0.07

Description

Maximum proportion of the path that can be occupied before this critic is not considered to allow the obstacle and path follow critics to avoid obstacles while following the path’s intent in presence of dynamic objects in the scene. Between 0-1 for 0-100%.

use_path_orientations:

Type

Default

bool

false

Description

Whether to consider path’s orientations in path alignment, which can be useful when paired with feasible smac planners to incentivize directional changes only where/when the smac planner requests them. If you want the robot to deviate and invert directions where the controller sees fit, keep as false. If your plans do not contain orientation information (e.g. navfn), keep as false.

trajectory_point_step:

Type

Default

int

4

Description

The step to take in trajectories for evaluating them in the critic. Since trajectories are extremely dense, its unnecessary to evaluate each point and computationally expensive.

Path Angle Critic

This critic penalizes trajectories at a high relative angle to the path. This helps the robot make sharp turns when necessary due to large accumulated angular errors.

cost_weight:

Type

Default

double

2.2

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

0.5

Description

Distance (m) between robot and goal to stop considering path angles and allow goal critics to take over.

offset_from_furthest:

Type

Default

int

20

Description

Number of path points after furthest one any trajectory achieves to compute path angle relative to.

max_angle_to_furthest:

Type

Default

double

0.785398

Description

Angular distance (rad) between robot and goal above which path angle cost starts being considered

mode:

Type

Default

int

0

Description

Enum type for mode of operations for the path angle critic depending on path input types and behavioral desires. 0: Forward Preference, penalizes high path angles relative to the robot’s orientation to incentivize turning towards the path. 1: No directional preference, penalizes high path angles relative to the robot’s orientation or mirrored orientation (e.g. reverse), which ever is less, when a particular direction of travel is not preferable. 2: Consider feasible path orientation, when using a feasible path whereas the path points have orientation information (e.g. Smac Planners), consider the path’s requested direction of travel to penalize path angles such that the robot will follow the path in the requested direction.

Path Follow Critic

This critic incentivizes making progress along the path. This is what drives the robot forward along the path.

cost_weight:

Type

Default

double

5.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

1.4

Description

Distance (m) between robot and goal to stop considering path following and allow goal critics to take over. It is wise to start with this as being the same as your prediction horizon to have a clean hand-off with the goal critic.

offset_from_furthest:

Type

Default

int

6

Description

Number of path points after furthest one any trajectory achieves to drive path tracking relative to.

Prefer Forward Critic

This critic incentivizes moving in the forward direction, rather than reversing.

cost_weight:

Type

Default

double

5.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

threshold_to_consider:

Type

Default

double

0.5

Description

Distance (m) between robot and goal to stop considering preferring forward and allow goal critics to take over.

Twirling Critic

This critic penalizes unnecessary ‘twisting’ with holonomic vehicles. It adds a constraint on the rotation angle to keep it consistent.

cost_weight:

Type

Default

double

10.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

Velocity Deadband Critic

This critic penalizes velocities that fall below the deadband threshold, helping to mitigate hardware limitations on certain platforms.

cost_weight:

Type

Default

double

35.0

Description

Weight to apply to critic term.

cost_power:

Type

Default

int

1

Description

Power order to apply to term.

deadband_velocities:

Type

Default

array of double

[0.05, 0.05, 0.05]

Description

The array of deadband velocities [vx, vz, wz]. A zero array indicates that the critic will take no action.

Example

controller_server:
  ros__parameters:
    controller_frequency: 30.0
    FollowPath:
      plugin: "nav2_mppi_controller::MPPIController"
      time_steps: 56
      model_dt: 0.05
      batch_size: 2000
      vx_std: 0.2
      vy_std: 0.2
      wz_std: 0.4
      vx_max: 0.5
      vx_min: -0.35
      vy_max: 0.5
      wz_max: 1.9
      ax_max: 3.0
      ax_min: -3.0
      ay_max: 3.0
      az_max: 3.5
      iteration_count: 1
      prune_distance: 1.7
      transform_tolerance: 0.1
      temperature: 0.3
      gamma: 0.015
      motion_model: "DiffDrive"
      visualize: false
      reset_period: 1.0 # (only in Humble)
      regenerate_noises: false
      TrajectoryVisualizer:
        trajectory_step: 5
        time_step: 3
      AckermannConstraints:
        min_turning_r: 0.2
      critics: ["ConstraintCritic", "CostCritic", "GoalCritic", "GoalAngleCritic", "PathAlignCritic", "PathFollowCritic", "PathAngleCritic", "PreferForwardCritic"]
      ConstraintCritic:
        enabled: true
        cost_power: 1
        cost_weight: 4.0
      GoalCritic:
        enabled: true
        cost_power: 1
        cost_weight: 5.0
        threshold_to_consider: 1.4
      GoalAngleCritic:
        enabled: true
        cost_power: 1
        cost_weight: 3.0
        threshold_to_consider: 0.5
      PreferForwardCritic:
        enabled: true
        cost_power: 1
        cost_weight: 5.0
        threshold_to_consider: 0.5
      # ObstaclesCritic:
      #   enabled: true
      #   cost_power: 1
      #   repulsion_weight: 1.5
      #   critical_weight: 20.0
      #   consider_footprint: false
      #   collision_cost: 10000.0
      #   collision_margin_distance: 0.1
      #   near_goal_distance: 0.5
      #   inflation_radius: 0.55 # (only in Humble)
      #   cost_scaling_factor: 10.0 # (only in Humble)
      CostCritic:
        enabled: true
        cost_power: 1
        cost_weight: 3.81
        critical_cost: 300.0
        consider_footprint: true
        collision_cost: 1000000.0
        near_goal_distance: 1.0
        trajectory_point_step: 2
      PathAlignCritic:
        enabled: true
        cost_power: 1
        cost_weight: 14.0
        max_path_occupancy_ratio: 0.05
        trajectory_point_step: 4
        threshold_to_consider: 0.5
        offset_from_furthest: 20
        use_path_orientations: false
      PathFollowCritic:
        enabled: true
        cost_power: 1
        cost_weight: 5.0
        offset_from_furthest: 5
        threshold_to_consider: 1.4
      PathAngleCritic:
        enabled: true
        cost_power: 1
        cost_weight: 2.0
        offset_from_furthest: 4
        threshold_to_consider: 0.5
        max_angle_to_furthest: 1.0
        mode: 0
      # VelocityDeadbandCritic:
      #   enabled: true
      #   cost_power: 1
      #   cost_weight: 35.0
      #   deadband_velocities: [0.05, 0.05, 0.05]
      # TwirlingCritic:
      #   enabled: true
      #   twirling_cost_power: 1
      #   twirling_cost_weight: 10.0

Notes to Users

General Words of Wisdom

The model_dt parameter generally should be set to the duration of your control frequency. So if your control frequency is 20hz, this should be 0.05. However, you may also set it lower but not larger.

Visualization of the trajectories using visualize uses compute resources to back out trajectories for visualization and therefore slows compute time. It is not suggested that this parameter is set to true during a deployed use, but is a useful debug instrument while tuning the system, but use sparingly. Visualizing 2000 batches @ 56 points at 30 hz is a lot.

The most common parameters you might want to start off changing are the velocity profiles (vx_max, vx_min, wz_max, and vy_max if holonomic) and the motion_model to correspond to your vehicle. Its wise to consider the prune_distance of the path plan in proportion to your maximum velocity and prediction horizon. The only deeper parameter that will likely need to be adjusted for your particular settings is the Obstacle critics’ repulsion_weight since the tuning of this is proprtional to your inflation layer’s radius. Higher radii should correspond to reduced repulsion_weight due to the penalty formation (e.g. inflation_radius - min_dist_to_obstacle). If this penalty is too high, the robot will slow significantly when entering cost-space from non-cost space or jitter in narrow corridors. It is noteworthy, but likely not necessary to be changed, that the Obstacle critic may use the full footprint information if consider_footprint = true, though comes at an increased compute cost.

Otherwise, the parameters have been closely pre-tuned by your friendly neighborhood navigator to give you a decent starting point that hopefully you only need to retune for your specific desired behavior lightly (if at all). Varying costmap parameters or maximum speeds are the actions which require the most attention, as described below:

Prediction Horizon, Costmap Sizing, and Offsets

As this is a predictive planner, there is some relationship between maximum speed, prediction times, and costmap size that users should keep in mind while tuning for their application. If a controller server costmap is set to 3.0m in size, that means that with the robot in the center, there is 1.5m of information on either side of the robot. When your prediction horizon (time_steps * model_dt) at maximum speed (vx_max) is larger than this, then your robot will be artificially limited in its maximum speeds and behavior by the costmap limitation. For example, if you predict forward 3 seconds (60 steps @ 0.05s per step) at 0.5m/s maximum speed, the minimum required costmap radius is 1.5m - or 3m total width.

The same applies to the Path Follow and Align offsets from furthest. In the same example if the furthest point we can consider is already at the edge of the costmap, then further offsets are thresholded because they’re unusable. So its important while selecting these parameters to make sure that the theoretical offsets can exist on the costmap settings selected with the maximum prediction horizon and velocities desired. Setting the threshold for consideration in the path follower + goal critics as the same as your prediction horizon can make sure you have clean hand-offs between them, as the path follower will otherwise attempt to slow slightly once it reaches the final goal pose as its marker.

The Path Follow critic cannot drive velocities greater than the projectable distance of that velocity on the available path on the rolling costmap. The Path Align critic offset_from_furthest represents the number of path points a trajectory passes through while tracking the path. If this is set either absurdly low (e.g. 5) it can trigger when a robot is simply trying to start path tracking causing some suboptimal behaviors and local minima while starting a task. If it is set absurdly high (e.g. 50) relative to the path resolution and costmap size, then the critic may never trigger or only do so when at full-speed. A balance here is wise. A selection of this value to be ~30% of the maximum velocity distance projected is good (e.g. if a planner produces points every 2.5cm, 60 can fit on the 1.5m local costmap radius. If the max speed is 0.5m/s with a 3s prediction time, then 20 points represents 33% of the maximum speed projected over the prediction horizon onto the path). When in doubt, prediction_horizon_s * max_speed / path_resolution / 3.0 is a good baseline.

Obstacle, Inflation Layer, and Path Following

There also exists a relationship between the costmap configurations and the Obstacle critic configurations. If the Obstacle critic is not well tuned with the costmap parameters (inflation radius, scale) it can cause the robot to wobble significantly as it attempts to take finitely lower-cost trajectories with a slightly lower cost in exchange for jerky motion. It may also perform awkward maneuvers when in free-space to try to maximize time in a small pocket of 0-cost over a more natural motion which involves moving into some low-costed region. Finally, it may generally refuse to go into costed space at all when starting in a free 0-cost space if the gain is set disproportionately higher than the Path Follow scoring to encourage the robot to move along the path. This is due to the critic cost of staying in free space becoming more attractive than entering even lightly costed space in exchange for progression along the task.

Thus, care should be taken to select weights of the obstacle critic in conjunction with the costmap inflation radius and scale so that a robot does not have such issues. How I (Steve, your friendly neighborhood navigator) tuned this was to first create the appropriate obstacle critic behavior desirable in conjunction with the inflation layer parameters. Its worth noting that the Obstacle critic converts the cost into a distance from obstacles, so the nature of the distribution of costs in the inflation isn’t overly significant. However, the inflation radius and the scale will define the cost at the end of the distribution where free-space meets the lowest cost value within the radius. So testing for quality behavior when going over that threshold should be considered.

As you increase or decrease your weights on the Obstacle, you may notice the aforementioned behaviors (e.g. won’t overcome free to non-free threshold). To overcome them, increase the FollowPath critic cost to increase the desire for the trajectory planner to continue moving towards the goal. Make sure to not overshoot this though, keep them balanced. A desirable outcome is smooth motion roughly in the center of spaces without significant close interactions with obstacles. It shouldn’t be perfectly following a path yet nor should the output velocity be wobbling jaggedly.

Once you have your obstacle avoidance behavior tuned and matched with an appropriate path following penalty, tune the Path Align critic to align with the path. If you design exact-path-alignment behavior, its possible to skip the obstacle critic step as highly tuning the system to follow the path will give it less ability to deviate to avoid obstacles (though it’ll slow and stop). Tuning the critic weight for the Obstacle critic high will do the job to avoid near-collisions but the repulsion weight is largely unnecessary to you. For others wanting more dynamic behavior, it can be beneficial to slowly lower the weight on the obstacle critic to give the path alignment critic some more room to work. If your path was generated with a cost-aware planner (like all provided by Nav2) and providing paths sufficiently far from obstacles for your satisfaction, the impact of a slightly reduced Obstacle critic with a Path Alignment critic will do you well. Not over-weighting the path align critic will allow the robot to deviate from the path to get around dynamic obstacles in the scene or other obstacles not previous considered during path planning. It is subjective as to the best behavior for your application, but it has been shown that MPPI can be an exact path tracker and/or avoid dynamic obstacles very fluidly and everywhere in between. The defaults provided are in the generally right regime for a balanced initial trade-off.