Full article: Model predictive control of legged and humanoid robots: models and algorithms

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

Model predictive control (MPC) of legged and humanoid robotic systems has been an active research topic in the past decade. While MPC for robotic systems has a long history, its paradigm such as problem formulations and algorithms has changed along with the recent drastic progress in robot hardware, computational processors, and algorithms. This survey paper reviews recent progress on MPC for legged and humanoid robotics from the following three points of view. First, we review a variety of dynamical models of robotic systems used in the MPC formulation. Second, we give an overview of MPC algorithms, particularly focusing on suitable ones for robotic problems. Finally, we introduce methods and applications of MPC for practical robotic problems from MPC based on reduced-order models to recent progress on MPC based on whole-body models.

GRAPHICAL ABSTRACT

KEYWORDS:

1. Introduction

1.1. Growing utilization of MPC in legged and humanoid robotics

Legged and humanoid robots have seen considerable technological advances in the past two decades. Two major factors have contributed to the development of legged robot research: hardware and control technology. For quadruped robots, hardware platforms such as LittleDog [Citation1], have facilitated research on rough terrain locomotion. Recently, more advanced robots, such as HyQ [Citation2], ANYmal [Citation3], MIT Cheetah [Citation4], and Mini Cheetah [Citation5], have been developed by different research institutes. For bipedal and humanoid robots, many significant research achievements using hardware platforms, such as the HRP series [Citation6–8], TORO [Citation9], Atlas [Citation10], TALOS [Citation11], E2-DR [Citation12], and Cassie [Citation13], have been reported in the literature.

Advanced control technology is crucial for drawing maximum performance from robotic hardware. However, the traditional methodology based on offline motion planning and linear control is incapable of sufficiently harnessing hardware performance and responding swiftly to a dynamically changing environment. Model predictive control (MPC), on the other hand, enables trajectory generation in nonlinear domain and full utilization of kinematic movable range and torque limits. Moreover, frequent replanning of motion reflecting the current state makes the robot more robust against disturbances. Therefore, the application of MPC to legged robotic systems has been a subject of active research in the past decade. Notably, 19 out of 28 workshop papers presented during the legged robotics workshop in ICRA2022 included the keywords ‘model predictive’ or ‘receding horizon’, of which 9 were directly focused on MPC [Citation14].

Table summarizes legged robotic systems reported in the literature that use MPC to realize certain functions. The columns of the table detail, from left to right, the year of publication, the dynamical model, the problem formulation and/or the solution method, the parameter setting, and the name of the robot. Cells containing information that could not be found in the original paper are left empty. The definitions of abbreviations are listed at the bottom of the table. In early research of legged robotics, MPC was been used for online walking pattern generation of bipedal and humanoid robots to improve responsiveness to disturbances and user commands. Although not all methods proposed during this period are referred to as MPC, we included them in the table as they are based on a similar concept. Following the DARPA Robotics Challenge held from 2013 to 2015, more studies that attempt to use MPC for whole-body control have been published. In recent years, the application of MPC to both biped and quadruped robots has seen a dramatic increase, where MPC is used for online trajectory generation as well as for whole-body control. In a broad sense, the term ‘online’ means that the planned trajectory is updated at a certain frequency inside of a feedback loop. The actual update frequency for MPC varies considerably across robotic systems and ranges from 10 Hz to 1 kHz, as shown in Table . In exceptional cases, the update is synchronized with walking steps [Citation20,Citation24,Citation42].

1.2. Focus and organization of this survey

The objective of this survey paper is to provide an extensive review of recent progress in the theory, implementation, and application of model predictive control techniques to legged and humanoid robotics. Several well-written survey papers [Citation49–52] and handbook articles [Citation53–55] on motion planning and control of legged and humanoid robots exist in the literature. Among them, [Citation54] surveyed MPC for adaptive walking pattern generation and [Citation55] discussed multi-contact MPC with a predetermined contact sequence. However, as mentioned previously, the application of MPC to legged and humanoid robots has increased rapidly ever since the publication of these surveys, and several important theoretical and technical developments have not been addressed in the existing surveys. The original contribution of this paper is twofold. First, we reviewed numerical algorithms of MPC that are particularly efficient for robotic problems, which is often discussed in research on MPC robotic applications but has not been extensively reviewed yet in the aforementioned surveys. Second, we provide an extensive survey of the recent developments in MPC for legged and humanoid robots including the most recent topics such as contact-implicit and whole-body MPC. The target period for this survey is approximately two decades, from the early 2000s to the present. To ensure reliability, we have avoided citing papers that have appeared only on pre-print servers such as arXiv.

Table 1. Summary of legged robot systems that utilizes MPC.

Display Table

The organization of the rest of this survey is as follows. In Section 2, a brief review of dynamical systems of legged robots is provided. In Section 3, the theory and algorithms for MPC are reviewed from a general perspective. In Section 4, MPC methods specific to legged and humanoid robots are reviewed. Section 5 presents concluding remarks and future prospects.

2. Dynamical system models of legged robots

2.1. Dynamical models with different levels of detail

Because legged robots have many degrees of freedom, a mathematical model that fully describes its dynamic and kinematic characteristics can be very high-dimensional and complex. It is, therefore, unrealistic to rely entirely on such complex models for implementing MPC with a long prediction horizon, for which real-time update is a strong requirement. For capturing and utilizing the dynamics of legged robots in different levels of detail, mathematical models with various degrees of abstraction and dimensionality have been proposed. The complexity of a model strongly influences the computational complexity of MPC algorithms and the variety of motions that can be generated. In the following, models widely used for MPC are briefly reviewed in the complex-to-simple order.

2.2. Whole-body dynamics

The whole-body dynamics (WBD, also known as rigid-body dynamics or multi-body dynamics) is a model that represents a robot as a floating-base mechanical system comprising multiple rigid bodies connected by joints (see Figure (a)). Its equation of motion is expressed as follows. (1) $\begin{aligned} M (q) \dot{v} + h (q, v) = S^{T} τ + J (q)^{T} λ \end{aligned}$ (1) Here, $q$ , $v$ , and $τ$ are the generalized coordinate, velocity, and torque of the system, M is the inertia matrix, h is the Coriolis-centrifugal term, S is the selection matrix that specifies actuated joints, and J is the contact Jacobian. The domain of the generalized coordinate $q$ is $S E (3) \times R^{n_{j}}$ , where $n_{j}$ is the number of joints. Here, the orientation of the base link is parameterized either by a unit quaternion or Euler angles. The contact force vector $λ$ is a concatenation of all contact forces; $λ = [λ_{1}^{T}, λ_{2}^{T}, \dots, λ_{n_{c}}^{T}]^{T}$ , where $λ_{i}$ is the contact force acting on the ith contact point. In rigid contact model, the contact force $λ$ is determined to satisfy the complementarity between contact force and contact acceleration at each contact point. Moreover, to avoid slipping, each contact force must be inside the friction cone associated with that contact point. See, for example, [Citation56] for more details on the whole-body dynamics.

Figure 1. Dynamical models with different dimensionality.

2.3. Centroidal dynamics

The centroidal dynamics equation given below describes the dynamics of the center of mass and the total angular momentum (see Figure (b)). (2) $\begin{aligned} m {\ddot{p}}_{c o m} & = f_{c} - m g \\ \dot{L} & = τ_{c} + (r - p_{c o m}) \times f_{c} \end{aligned}$ (2) Here, $p_{c o m}$ and $L$ are the position of the center of mass (CoM, also called the center of gravity, CoG) and the total angular momentum with respect to the CoM. Moreover, $f_{c}$ is the total contact force and $τ_{c}$ is the total contact moment around the point of application $r$ , which can be chosen arbitrarily. The 6D vector consisting of $f_{c}$ and $τ_{c}$ is called the total contact wrench.

The centroidal dynamics can be rigorously derived from the whole-body dynamics (Equation1(1) $\begin{aligned} M (q) \dot{v} + h (q, v) = S^{T} τ + J (q)^{T} λ \end{aligned}$ (1) ) without approximation. The relationship between the whole-body and centroidal dynamics is well described in [Citation57].

One difficulty of the centroidal dynamics is that the relationship between the angular momentum $L$ and the robot's posture is not explicit. A frequently used approach is the single rigid-body dynamics (SRBD) approximation which expresses the angular momentum as $L = I ω$ , where I and $ω$ are the base link's inertia tensor and angular velocity, respectively. This approximation is only valid if the inertia tensor does not change drastically depending on the robot's posture, and therefore, is often used for MPC of quadrupeds with light-weight legs [Citation26,Citation30]. One limitation of single rigid-body approximation is that it cannot express movements involving dynamical changes of the inertia tensor, such as changing the posture in the air [Citation58].

2.4. Contact models

With the whole-body and centroidal dynamics models, the contacts are often modeled as constraints on the kinematics. When the contact sequence (e.g. gait patterns and contact points) is given, the contacts are typically expressed as equality constraints. Let $r (q)$ and $\bar{r}$ be the stacks of positions of bodies in contacts and given contact points, respectively. The contact constraint is then given by (3) $\begin{aligned} r (q) - \bar{r} = 0. \end{aligned}$ (3) Alternatively, one can consider contact velocity or contact acceleration constraints, i.e. time derivative and second time derivatives of (Equation3(3) $\begin{aligned} r (q) - \bar{r} = 0. \end{aligned}$ (3) ), respectively. Note that instead of the position $r (q)$ and given contact point $\bar{r}$ , the distance between the body and point (e.g. the height of a foot from the ground) or its time derivative is also utilized to determine the horizontal contact positions through the MPC optimization.

In contrast, when the planning problem involves the decision-making on the contact sequence, the underlying dynamics must be modeled so that various possibilities can be considered in the problems. A typical way is to model the contact dynamics using the complementarity constraints [Citation59]. An intuitive example of the complementarity constraints expresses on/off of a point contact: (4) $\begin{aligned} φ (q) \geq 0, f_{N} \geq 0, φ (q) f_{N} = 0, \end{aligned}$ (4) where $φ (q)$ denotes the distance between the robot's body and the environment, and $f_{N}$ denotes the corresponding normal contact force. While the complementarity constraints can rigorously model the unilateral aspect of contacts, the optimization problems are generally hard to solve. More computationally tractable but inaccurate contact models have been proposed to mitigate such difficulties [Citation31,Citation60,Citation61]. These models can be regarded as soft contact models and approximations of the complementarity constraints. Neunert et al. [Citation31,Citation60] modeled the contacts via a mass-spring-damper system, which is simple but leads to a stiff dynamical system and inaccuracy. Todorov [Citation61] proposed an analytically invertible and differentiable contact model used in MuJoCo simulator [Citation62]. However, it still lacks physical accuracy compared with the rigorous rigid contact models, e.g. allowing penetration between rigid objects, as seen in a contact model comparison of [Citation63].

2.5. Linear inverted pendulum mode

The linear inverted pendulum mode, initially proposed in [Citation64], describes the movement of the CoM as a linear second-order ordinary differential equation (ODE). Initially proposed as a 2D dynamical model, it has been generalized extensively in later studies [Citation65]. The LIPM can be directly derived from the centroidal dynamics with an additional assumption that no tilting moment act around the CoM. Some earlier studies claim that the LIPM is valid only under the lumped mass assumption; that is, the whole mass is concentrated around the CoM, but this assumption is, though sufficient, not necessary. Its derivation can be found in [Citation50].

The 3D LIPM equation is expressed as follows. (5) $\begin{aligned} {\ddot{p}}_{c o m} = \frac{1}{T^{2}} (p_{c o m} - p_{z m p}) - g \end{aligned}$ (5) Here, $p_{c o m}$ is the CoM, $p_{z m p}$ is the ZMP, $g = [0, 0, g]^{T}$ is the acceleration of gravity, and T is an arbitrary constant (more generally, it can be time-varying). In a simple setup where the CoM maintains a constant height h from the ground, the condition that the z-component of ${\ddot{p}}_{c o m}$ is 0 gives $T = \sqrt{\frac{h}{g}}$ . Because the LIPM is a second-order linear ODE, it yields a closed-form solution. This characteristic is extremely useful for planning long trajectories with a small number of decision variables.

The LIPM has been used extensively for walking pattern generation and stabilization control of bipedal and humanoid robots. Theoretically, the LIPM is applicable not only to bipedal locomotion but to general-legged locomotion. However, its application to non-biped robots such as quadrupeds is relatively rare (see Section 4.1).

2.6. Stability criteria based on contacts

It is essential to consider contact stability in the planning and control of legged locomotion. That is, generated trajectories must be supported by the distribution of valid contact forces. The CoM-based static stability criterion was used in early studies. Namely, the robot is statically stable if the ground projection of its CoM is inside the support region. However, this conservative stability criterion inevitably leads to static walking, which is impractical in terms of walking speed. For dynamic walking, the ZMP-based stability criterion is used instead. The ZMP (zero (tilting) moment point) or the CoP (center of pressure) is a point at which the tangential components of the net contact moment are zero. In coplanar contact cases (i.e. when all contact points lie on a single common plane), the condition that the ZMP is inside the convex hull of contact points is a necessary condition for surface contact stability. It is not generally sufficient because it neglects friction constraint (in other words, it assumes that the friction coefficient is large enough). Consideration of friction was discussed in [Citation65]. For non-coplanar contact cases, the feasible set of the ZMP or total contact wrench can be still defined but are difficult to compute explicitly. Instead, some recent studies treated contact forces of multiple contact points as decision variables and impose contact stability constraints in optimization.

The CoM-based static stability is too conservative, while the ZMP/contact wrench-based criteria merely ensure the existence of valid contact force distribution at every instant. A theoretical framework that treats true dynamical stability (a property of a dynamical system that its state can be kept inside a bounded region indefinitely) in the context of legged locomotion has been developed as the capturability theory [Citation66]. The robot is said to be in an N-step capturable state if it can be brought to a halt within N walking steps. In particular, the robot is 0-step capturable if the instantaneous capture point (ICP) is inside the support region. It is a natural extension of the classical CoM-based static stability criterion. The capturability criterion is used to ensure the closed-loop stability of MPC [Citation37].

3. MPC algorithms

Before reviewing the robotic applications of MPC, we provide an overview of MPC algorithms. Figure shows the classification of solution approaches to MPC problems, i.e. optimal control problems (OCPs). As shown in Figure , the solution approaches to MPC problems are roughly divided into three methods: the Hamilton-Jacobi-Bellman equation (dynamic programming), indirect methods (e.g. Pontryagin's maximum principle), and direct methods. However, the two former approaches are difficult to use for practical robotic problems that involve nonlinear and high-dimensional constrained MPC problems. Therefore, this survey focuses on the direct methods in which the MPC problems are transformed into nonlinear programs (NLPs) and solved by numerical optimization techniques.

Figure 2. Overview of solution approaches of MPC.

3.1. Linear MPC

If the dynamical system and constraints are linear and the cost function is quadratic, the MPC is classified as linear MPC. The NLPs of the direct methods are then reduced to quadratic programs (QPs). Dedicated QP solvers have been developed, such as active-set [Citation67,Citation68], interior point [Citation69–71], and alternating direction method of multipliers (ADMM)-based methods [Citation72]. Their efficiency typically depends on problem settings, e.g. numbers of optimization variables and inequality constraints. A comprehensive review of QP solvers for MPC can be found in [Citation73].

3.2. Nonlinear MPC

If the dynamical systems or constraints are nonlinear or the cost function is nonconvex, the MPC optimization problem is classified as a nonlinear (or nonconvex) MPC problem. A challenge in nonlinear MPC is that we must solve an NLP problem within a predefined sampling time of MPC, which is typically small as shown in Table . Refs. [Citation74,Citation75] are pioneer works on nonlinear MPC. These two works utilize tangential predictors to track the solution manifold of the NLP and perform only one inexact Newton iteration to update the solution at each time. Such ideas were further studied in [Citation76,Citation77].

On the other hand, improving the computational speed of solving the NLPs arising in nonlinear MPC has gained attention. In particular, the Newton-type methods such as sequential quadratic programming (SQP) methods and (nonlinear) interior point (IP) methods (specifically, primal-dual IP method) [Citation78] have been developed along with efficient general-purpose NLP solvers such as Snopt [Citation79] and Ipopt [Citation80]. Developments in active-set QP solvers for MPC, such as [Citation67] and [Citation68], have also pushed the efficiency of SQP methods for MPC. SQP methods, particularly with active-set QP solvers, can leverage warm-start but lack efficiency for large-scale problems. However, IP methods can treat large-scale problems efficiently while they cannot leverage warm-start. These general-purpose NLP solvers or SQP methods with active-set QP solvers still lack computational efficiency. Therefore, it is still challenging to implement real-time MPC for fast, large-scale, and complicated robotic systems.

3.3. Structure-exploiting Newton-type methods for nonlinear MPC problems

To further improve the computational efficiency of nonlinear MPC, researchers have attempted to exploit the structure of the NLP problems of the direct methods of MPC. The NLP problems for MPC, particularly of single shooting and multiple-shooting methods, have special structures [Citation81,Citation82]. As the multiple-shooting method is more suitable for nonlinear MPC than the single shooting methods in terms of computational speed and convergence property [Citation82], we show an example of the structured NLP of the multiple-shooting method. Let N be the length of the horizon, $x_{0}, \dots, x_{N}$ and $u_{0}, \dots, u_{N - 1}$ be the state and control input of the system over the horizon, and $\bar{x}$ be the given initial state. Then, the NLP is given as (6a) $\begin{aligned} min_{x_{0}, \dots, x_{N}, u_{0}, \dots, u_{N - 1}} V_{N} (x_{N}) + \sum_{i = 0}^{N - 1} l_{i} (x_{i}, u_{i}) \end{aligned}$ (6a) (6b) $\begin{aligned} s . t . x_{0} = \bar{x}, \end{aligned}$ (6b) (6c) $\begin{aligned} x_{i + 1} = f_{i} (x_{i}, u_{i}), i = 0, \dots, N - 1, \end{aligned}$ (6c) (6d) $\begin{aligned} g_{i} (x_{i}, u_{i}) \leq 0, i = 0, \dots, N - 1, \end{aligned}$ (6d) (6e) $\begin{aligned} g_{N} (x_{N}) \leq 0, \end{aligned}$ (6e) where $V_{N} (x_{N})$ denotes the given terminal cost function $l_{i} (x_{i}, u_{i})$ denotes the given cost function, (Equation6c(6c) $\begin{aligned} x_{i + 1} = f_{i} (x_{i}, u_{i}), i = 0, \dots, N - 1, \end{aligned}$ (6c) ) denotes the discrete-time state equation, and (Equation6d(6d) $\begin{aligned} g_{i} (x_{i}, u_{i}) \leq 0, i = 0, \dots, N - 1, \end{aligned}$ (6d) ) and (Equation6e(6e) $\begin{aligned} g_{N} (x_{N}) \leq 0, \end{aligned}$ (6e) ) denote the given inequality constraints. Note that we can obtain the NLP for the single shooting method by eliminating $x_{i}$ from the decision variables via (Equation6b(6b) $\begin{aligned} s . t . x_{0} = \bar{x}, \end{aligned}$ (6b) ) and (Equation6c(6c) $\begin{aligned} x_{i + 1} = f_{i} (x_{i}, u_{i}), i = 0, \dots, N - 1, \end{aligned}$ (6c) ). Here, we treat the inequality constraints (Equation6d(6d) $\begin{aligned} g_{i} (x_{i}, u_{i}) \leq 0, i = 0, \dots, N - 1, \end{aligned}$ (6d) ) and (Equation6e(6e) $\begin{aligned} g_{N} (x_{N}) \leq 0, \end{aligned}$ (6e) ) using the IP methods. Let $Δ x_{0}, \dots, Δ x_{N}$ , $Δ u_{0}, \dots, Δ u_{N - 1}$ be the Newton steps of $x_{0}, \dots, x_{N}$ , $u_{0}, \dots, u_{N - 1}$ . The computation of the Newton steps, which is one of the most time-consuming aspects of the Newton-type iteration, can then be reduced to the following QP subproblem [Citation82]: (7a) $\begin{aligned} min_{\begin{matrix} Δ x_{0}, \dots, Δ x_{N}, \\ Δ u_{0}, \dots, Δ u_{N - 1} \end{matrix}} \frac{1}{2} {[\begin{matrix} Δ x_{N} \\ 1 \end{matrix}]}^{T} [\begin{matrix} P_{N} & p_{N} \\ p_{N}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{N} \\ 1 \end{matrix}] \\ + \frac{1}{2} {[\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}]}^{T} [\begin{matrix} Q_{i} & S_{i} & q_{i} \\ S_{i}^{T} & R_{i} & r_{i} \\ q_{i}^{T} & r_{i}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}] \end{aligned}$ (7a) (7b) $\begin{aligned} s . t . Δ x_{0} = \bar{x}, \end{aligned}$ (7b) (7c) $\begin{aligned} Δ x_{i + 1} = A_{i} Δ x_{i} + B_{i} Δ u_{i} + b_{i}, i = 0, \dots, N - 1, \end{aligned}$ (7c) where $A_{i} := \nabla_{x} f (x_{i}, u_{i})$ , $B_{i} := \nabla_{u} f (x_{i}, u_{i})$ . The coefficient matrices in the quadratic cost function of (Equation7a(7a) $\begin{aligned} min_{\begin{matrix} Δ x_{0}, \dots, Δ x_{N}, \\ Δ u_{0}, \dots, Δ u_{N - 1} \end{matrix}} \frac{1}{2} {[\begin{matrix} Δ x_{N} \\ 1 \end{matrix}]}^{T} [\begin{matrix} P_{N} & p_{N} \\ p_{N}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{N} \\ 1 \end{matrix}] \\ + \frac{1}{2} {[\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}]}^{T} [\begin{matrix} Q_{i} & S_{i} & q_{i} \\ S_{i}^{T} & R_{i} & r_{i} \\ q_{i}^{T} & r_{i}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}] \end{aligned}$ (7a) ) are derived from the derivatives of the Lagrangian of the NLP (Equation6a(6a) $\begin{aligned} min_{x_{0}, \dots, x_{N}, u_{0}, \dots, u_{N - 1}} V_{N} (x_{N}) + \sum_{i = 0}^{N - 1} l_{i} (x_{i}, u_{i}) \end{aligned}$ (6a) ) and some matrix manipulations related to IP methods. Here, we omitted the detailed derivations of the coefficient matrices, but they can be found, e.g. in [Citation78,Citation82,Citation83]. According to (Equation7a(7a) $\begin{aligned} min_{\begin{matrix} Δ x_{0}, \dots, Δ x_{N}, \\ Δ u_{0}, \dots, Δ u_{N - 1} \end{matrix}} \frac{1}{2} {[\begin{matrix} Δ x_{N} \\ 1 \end{matrix}]}^{T} [\begin{matrix} P_{N} & p_{N} \\ p_{N}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{N} \\ 1 \end{matrix}] \\ + \frac{1}{2} {[\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}]}^{T} [\begin{matrix} Q_{i} & S_{i} & q_{i} \\ S_{i}^{T} & R_{i} & r_{i} \\ q_{i}^{T} & r_{i}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}] \end{aligned}$ (7a) ), the arising QP subproblems of IP methods take the same structure as unconstrained time-varying linear-quadratic OCPs. Note that we can have the same QP subproblems in the inner iterations of Augmented Lagrangian (AL) methods. Efficient MPC algorithms that leverage the unconstrained LQR structure to compute the Newton steps were proposed in [Citation83–86].

3.3.1. Riccati recursion

Riccati recursion [Citation83], which corresponds to dynamic programming for the linear-quadratic OCP (Equation7a(7a) $\begin{aligned} min_{\begin{matrix} Δ x_{0}, \dots, Δ x_{N}, \\ Δ u_{0}, \dots, Δ u_{N - 1} \end{matrix}} \frac{1}{2} {[\begin{matrix} Δ x_{N} \\ 1 \end{matrix}]}^{T} [\begin{matrix} P_{N} & p_{N} \\ p_{N}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{N} \\ 1 \end{matrix}] \\ + \frac{1}{2} {[\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}]}^{T} [\begin{matrix} Q_{i} & S_{i} & q_{i} \\ S_{i}^{T} & R_{i} & r_{i} \\ q_{i}^{T} & r_{i}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}] \end{aligned}$ (7a) ), has gained popularity both in MPC and robotics communities as it can compute the Newton step in $O (N)$ computational burden. This contrasts the general NLP solvers and SQP methods with active-set QP solvers in which the Newton step computation typically requires $O (N^{3})$ time-complexity. In the MPC community, IP methods are typically used with Riccati recursion for inequality constraints [Citation83]. An efficient open-source IP solver with Riccati recursion is developed in [Citation70], and a nonlinear MPC toolbox based on it is developed in [Citation87].

Treating the equality constraints, which often represent the contact constraints as stated in Section 2.4, with the Riccati recursion-type methods is another issue because IP methods cannot be used. A simple way is to use AL methods [Citation88], which generally lack convergence speed, however. Variants of the Riccati recursion-type methods to treat the equality constraints have been proposed [Citation28,Citation89–91].

3.3.2. Differential dynamic programming

Riccati recursion-based Newton's methods are often interpreted as differential dynamic programming (DDP) [Citation92] in the robotics community. Also, its Gauss-Newton counterpart is called an iterative linear quadratic regulator (iLQR) [Citation93]. Variants of DPP and iLQR have been proposed, such as sequential linear quadratic (SLQ) [Citation28] and feasible-driven DDP [Citation94]. In robotic applications, a variant of IP methods [Citation95] is often used to treat inequality constraints with DDP-type methods. AL methods are also used, e.g. [Citation88,Citation96], because they have the same unconstrained LQR structure as (Equation7a(7a) $\begin{aligned} min_{\begin{matrix} Δ x_{0}, \dots, Δ x_{N}, \\ Δ u_{0}, \dots, Δ u_{N - 1} \end{matrix}} \frac{1}{2} {[\begin{matrix} Δ x_{N} \\ 1 \end{matrix}]}^{T} [\begin{matrix} P_{N} & p_{N} \\ p_{N}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{N} \\ 1 \end{matrix}] \\ + \frac{1}{2} {[\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}]}^{T} [\begin{matrix} Q_{i} & S_{i} & q_{i} \\ S_{i}^{T} & R_{i} & r_{i} \\ q_{i}^{T} & r_{i}^{T} & 1 \end{matrix}] [\begin{matrix} Δ x_{i} \\ Δ u_{i} \\ 1 \end{matrix}] \end{aligned}$ (7a) ) in the inner Newton-type iteration.

If the MPC problem is deterministic, DDP-type methods are almost identical to the Riccati recursion-based Newton (or Gauss-Newton) method for the single shooting method (see Chapter 8 of [Citation82]). From this point of view, it is easy to implement ‘DDP/iLQR-like methods with inequality constraints’ in deterministic MPC by using the Riccati recursion to solve the QP subproblems of nonlinear IP methods. This is considered challenging in the robotics community [Citation88,Citation95]. Further, it is trivial to implement ‘multiple shooting DDP/iLQR-like methods’ just by using the Riccati recursion in solving the QP subproblems of multiple shooting NLPs [Citation81]. This is also recognized as nontrivial in the robotics community [Citation94].

3.4. MPC with hybrid systems/complementarity constraints

When the contact sequence and contact timings are fixed in MPC, e.g. when a higher-level contact planner provides them, MPC can be implemented by the above numerical methods, in principle. In contrast, when MPC involves the decision-making on the contact sequence, the robotic systems with contacts have to be modeled such as a hybrid system [Citation97] or using complementarity constraints as stated in Section 2.4. The aforementioned numerical methods are then no longer applicable. Alternatively, the approximated soft contact models introduced in Section 2.4 are used to incorporate the decision-making on the contact sequence into MPC even with the above numerical methods. However, it has limitations as reviewed in Section 2.4. Its applications are introduced later in Section 4.3.2.

When the robotic systems are modeled as hybrid systems, MPC is typically implemented as a mixed-integer program (MIP) [Citation98], in which the discrete variables are introduced explicitly. However, MIPs are hard to implement in real time owing to exponentially growing computational time [Citation98]. Recently, suboptimal solution methods [Citation99,Citation100] and MPC-specific solution methods [Citation101,Citation102] have been studied. When the robotic systems are modeled using the complementarity constraints, the resultant OCP is often termed as contact-implicit trajectory optimization (CITO), whose pioneer work is [Citation103]. A solution method of the CITO is using mathematical programs with complementarity constraints (MPCC) [Citation104], which solves the optimization problems under complementarity constraints. The MPCC approach was first utilized for the CITO in [Citation105]. However, MPCC inherently lacks constraint qualifications such as linear independence constraint qualification (LICQ), which causes numerical ill-conditionings [Citation104]. Moreover, numerical alleviations for such problems [Citation106] often lead to undesirable stationary points [Citation107]. An alternative approach for the CITO is bilevel optimization (BO) methods [Citation103,Citation108–110], whose solution can be a solution to MPCC under certain conditions [Citation111]. However, BO problems also involve combinatorial structure and are therefore hard to solve in general [Citation112].

3.5. MPC of switched systems

Even when the contact sequence is given, we can further optimize the contact timings or switching times, which is classified as OCP of switched systems [Citation113,Citation114]. A classical solution approach for OCP of switched systems is to decompose the optimization problem into two stages: one is for switching times and the other is for state and control trajectory [Citation113,Citation115]. This two-stage approach has been successfully applied for offline motion planning [Citation116,Citation117]. However, it lacks computational speed and a local convergence guarantee, making it difficult to realize MPC. Another approach is to optimize the switching times, state, and control simultaneously in Newton-type methods [Citation114]. In [Citation114], the simultaneous optimization problems were solved by the off-the-shelf NLP solvers such as [Citation80]. However, their applications are also limited to offline trajectory optimization due to inefficient computation of the general NLP solvers, and it is still challenging to optimize the contact timings online.

4. MPC applications for legged robots

4.1. MPC with LIPM

For bipedal robots, MPC based on the LIPM has been widely used. This model is linear to the horizontal position of the CoM and is time-invariant if the CoM is constrained in a plane. The ZMP is equivalent to the center of pressure on the sole and must lie within the convex hull of the foot support region. MPC is used to plan a CoM trajectory that satisfies this ZMP constraint.

As an early approach, instead of explicitly considering the ZMP constraint, a method was proposed to minimize the error with the reference ZMP trajectory [Citation16,Citation118]. This method, also called preview control [Citation119], is computationally efficient because the solution is written in closed form using the solution of the Riccati equation. However, inequality constraints cannot be considered. The reference ZMP trajectory is determined based on the footstep sequence obtained in the preprocessing footstep planning [Citation53]. Graph search [Citation120] is widely used for footstep planning [Citation121,Citation122], but mathematical optimizations have also been used in recent years [Citation123,Citation124]. Many extensions have been made to CoM trajectory generation based on preview control; for example, in [Citation125], the angular momentum trajectory was generated simultaneously with the CoM trajectory.

Owing to improved computational performance and efficient QP solvers, linear MPC that explicitly considers inequality constraints of ZMP has come into use [Citation126]. See [Citation54] for a survey of linear MPC for bipedal robots. Linear MPC is formulated as QP and solved in real time by coarsening the horizon sample. Some MPCs impose an equality constraint on the capture point or DCM at the horizon terminal, which is specific to MPCs based on the LIPM [Citation29,Citation37]. As the CoM height changes along a predefined trajectory, the system becomes linear time-varying, which can be solved with linear MPC without significant change in computational cost [Citation127]. Control inputs at sampling points within the horizon are often used as decision variables, but several attempts have been proposed to reduce the dimension of the variables. For example, in [Citation29], the ZMP trajectory was expressed as a polynomial function, and the coefficients of the polynomial function are used as decision variables. In addition, in [Citation21,Citation42,Citation128,Citation129], by analytically solving the linear differential equation of the LIPM, the CoM at the timing of footstep switching was derived in closed form, eliminating the fine sampling within the horizon.

A major extension of linear MPC in bipedal walking is the simultaneous planning of CoM trajectory and foot placement sequence [Citation29,Citation37,Citation128,Citation130]. In [Citation131], $l_{1}$ - and $l_{\infty}$ -norm were used in the regularization of foot placement instead of the commonly used $l_{2}$ -norm, so that the foot placement modification is sparse. Linear MPCs that can simultaneously plan CoM trajectory and foot placement sequence have also been proposed, using the first-order dynamics of the DCM instead of the LIPM [Citation132,Citation133]. In [Citation133], the angular momentum trajectory was also planned with the CoM trajectory and foot placement sequence, using the dynamics of the DCM. Furthermore, MPCs beyond the framework of linear MPCs have also been proposed for planning foot placement [Citation27,Citation132], although they can no longer be formulated as QP. In [Citation132], the MPC was represented as an MIP by approximating the trigonometric functions required to describe the foot orientation as piecewise linear functions with 0–1 variables [Citation123]. In [Citation27], the nonlinear MPC was solved by SQP, taking into account the nonconvex constraints representing the obstacle avoidance of the feet. There are also some methods [Citation18,Citation24,Citation134,Citation135] in which the foot placement modification was treated in a different computational process other than the MPC for CoM trajectory generation.

Another major extension of linear MPC in bipedal walking is to account for external forces in the robot's hands. In linear MPC, external forces are treated in different ways; in [Citation136] they were included in the state of the LIPM, in [Citation33] they were included in the control input, and in [Citation137] they were considered as offsets of the model. In [Citation138], whether hand contact is necessary to avoid a fall due to disturbances was determined in linear MPC by introducing slack variables in ZMP's inequality constraints. Morisawa et al. [Citation139] extended the LIPM by introducing a point corresponding to ZMP that accounts for hand contacts and represented the dynamics of multi-contact motion as a linear system.

The dynamics of bipedal walking become nonlinear if the horizontal and vertical positions of the CoM are considered as state variables. Some MPCs treat nonlinear systems and plan the three-dimensional trajectory of the CoM [Citation140–142]. In [Citation140], DDP was applied to plan the CoM trajectory, omitting ZMP's inequality constraints. In [Citation141], using the three-dimensional position of the CoM as decision variables, the ZMP constraints were expressed as quadratic constraints, and the nonconvex quadratically constrained quadratic program (QCQP) was solved by the SQP method. In [Citation142], by setting bounds on the LIPM time constant, the nonlinearity caused by the CoM vertical motion was avoided, and the three-dimensional CoM trajectory and foot placement sequence were simultaneously planned using linear MPC.

As a supplement, we present several methods for planning bipedal CoM trajectory without using linear MPC, formulated as QP. In [Citation134,Citation143,Citation144], the CoM trajectory was efficiently planned by integrating the first-order dynamics of the capture point or DCM in the reverse direction. In [Citation135,Citation145], an analytical solution to the linear MPC problem with boundary conditions was derived. MPCs based on the LIPM have been applied mainly to bipedal and humanoid robots, but have also been applied to quadruped robots [Citation41,Citation129,Citation146,Citation147].

4.2. MPC with centroidal dynamics and SRBD models

The LIPM does not explicitly consider friction constraints on contact forces and is, therefore, inappropriate for handling non-coplanar contacts. The centroidal dynamics [Citation148] is a model based on the Newton-Euler equation for solving this problem and is widely used in humanoid and quadrupedal robots.

4.2.1. Application to humanoid robots

The centroidal dynamics model is used to plan and control humanoid multi-contact motion in which the feet and hands make non-coplanar contacts. See survey [Citation55] for some MPCs in humanoid multi-contact motion. The difficulty of the centroidal dynamics model compared to the LIPM is that it has nonlinearity in the rotational dynamics [Citation55].

Several MPC-based methods [Citation22,Citation149–154] that mitigate this nonlinearity in some way and avoid MPC with computationally expensive nonconvex optimization are presented. In [Citation22], the nonlinear terms in the rotational dynamics were neglected to construct a linear MPC, formulated as a QP, under the contact force constraints. Audren et al. [Citation149] focused on the fact that the rotational dynamics can be represented as a linear system by predetermining the CoM vertical motion. In [Citation150], the rotational dynamics were approximated linearly by assuming a convex region for the CoM and by treating the upper bound of the angular momentum. Fernbach et al. [Citation151] eliminate the nonlinear term in the rotational dynamics by representing the CoM trajectory as a Bezier curve parameterized by a one-degree-of-freedom decision variable. In [Citation152], nonlinear trajectory optimization was attributed to QCQP by expressing the nonlinear term in rotational dynamics as the difference between two convex functions. In [Citation153], a linear MPC is constructed with a system that linearizes the rotational dynamics while considering the effect of the angular momentum generated by the motion of each link. In [Citation150,Citation154], the friction constraints at each contact point were integrated and converted to CoM acceleration constraints, and CoM acceleration was used instead of contact forces as the decision variables for linear MPC to reduce the dimension of the variables.

In contrast, MPCs that use NLP solvers to solve nonconvex optimization have also emerged. Because nonconvex optimization is computationally expensive, the early methods were positioned as offline trajectory optimization rather than online MPC. In [Citation155–157], trajectory generation of CoM and angular momentum under a centroidal dynamics model was formulated as a nonlinear optimization and solved offline by an off-the-shelf SQP solver. Nonlinear trajectory optimization can recently be solved online with the MPC scheme owing to improvements in computational performance and optimization algorithms. In [Citation57], nonlinear MPC with learned kinematic reachability was solved by the multiple-shooting method using the OCP solver. In [Citation46], the centroidal trajectory and contact position sequence were simultaneously planned online considering a nonlinear centroidal dynamics model.

Furthermore, by using a full-body model for kinematic constraints and a centroidal dynamics model for dynamic constraints, a method of simultaneously planning centroidal and joint position trajectories while avoiding high-dimensional full-body dynamics has been proposed [Citation155]. Chignoli et al. [Citation158] further imposed constraints on the speed-torque relationship of the actuators.

4.2.2. Application to quadrupedal robots

Centroidal dynamics-based MPCs were also applied for quadrupedal robots which typically involve point contacts in their feet. Existing quadrupedal robots, such as MIT Mini Cheetah, were designed such that the weights of limbs are small relative to the whole-body weight. For such hardware, the single rigid-body dynamics (SRBD) model is a simple and powerful model that can simplify the MPC optimization problem while accurately capturing the dynamics. Di Carlo et al. [Citation30] achieved fast and highly dynamic motion of a quadrupedal robot via a fast linear MPC in which the robot dynamics were approximated by the (linearized) SRBD model, and the contact forces of feet (i.e. ground reaction forces (GRFs)) were treated as the control input to the SRBD model. When the weights of limbs are negligibly small compared with the base and a lower-level joint controller can track the GRFs from MPC quickly, the SRBD-based simplified model can be accurate. Ding et al. [Citation159] further improved the SRBD-based linear MPC in terms of linearization of the rotation and achieved acrobatic motion on a quadrupedal robot. In [Citation160], because the nonlinearity of rotational dynamics is due to the cross-product, the nonlinear trajectory optimization was solved by alternately solving two types of QPs with one variable fixed in the cross-product term.

Nonlinear MPC is also applied with the SRBD and centroidal dynamics models. Refs. [Citation26,Citation32,Citation34] incorporated the contact positions as the optimization variables in the linearized SRBD-based MPC of [Citation30]. The resultant optimization problem was nonlinear; therefore, a nonlinear MPC was implemented for the problem. Rathod et al. [Citation45] formulated a nonlinear MPC with a nonlinear SRBD model to capture the nonlinear rotational dynamics and achieved quadrupedal walking that can traverse over a large obstacle only with on-board embedded computation. Further, Refs. [Citation28,Citation95] applied nonlinear MPC for the nonlinear SRBD and full-body kinematics model, in which the GRFs and joint velocities were treated as the control input to the dynamical systems. In [Citation40], instead of the nonlinear SRBD model, the centroidal momentum matrix (CMM) [Citation148] was used with the full-kinematics in nonlinear MPC, which is the same formulation of the offline trajectory optimization of [Citation155]. The CMM enabled us to consider the exact CoM dynamics, including the weights of limbs in MPC, while the SRBD-based methods ignored it. The full-kinematic model (and the CMM) are versatile so that they can realize dynamic loco-manipulation tasks [Citation40] of ALMA [Citation161], locomotion under safety constraints [Citation39], and wheeled-legged control [Citation38]. In contrast to the aforementioned research assuming the predefined contact sequence, recent research reported the successful online CITO [Citation48] with the centroidal dynamics model with some simplifications on the problem, e.g. frictional complementarity constraints were neglected.

4.3. MPC with full-body models (whole-body MPC)

4.3.1. Developments in rigid-body dynamics algorithms

MPC with whole-body (full-body) dynamics, called whole-body MPC, had been challenging owing to the large problem size of the NLP and the high computational cost of the whole-body dynamics and their derivatives. Therefore, early studies on whole-body MPC were limited, e.g. linear MPC based on linearly approximated dynamics, as studied in the computer graphics community [Citation162]. This has come closer to reality owing to the developments in computational algorithms of the whole-body dynamics and their derivatives, namely, rigid-body dynamics algorithms [Citation56]. Well-known rigid-body dynamics algorithms are the recursive Newton-Euler algorithm (RNEA) [Citation163] for inverse dynamics computation and the articulated body algorithm (ABA) [Citation164] for forward dynamics computation. Carpentier et al. [Citation165] proposed efficient forward dynamics computation under a closed-kinematic chain or rigid contacts.

For MPC, the computational efficiency of derivatives of rigid-body dynamics is also critical. Neunert et al. [Citation166] proposed automatics differentiation of the RNEA and ABA. Carpentier et al. [Citation167] proposed analytical derivatives of the RNEA and ABA and achieved faster computation than the automatic differentiation. Singh et al. [Citation168] further improved the computational efficiency of the analytical derivatives using spatial vector algebra. Also, Ayusawa and Yoshida [Citation169] proposed analytical derivatives of forward and inverse dynamics computations. Nganga and Wensin [Citation170] proposed an automatic differentiation of the second-order derivatives rigid-body dynamics in direct OCPs to improve the convergence speed of the Newton-type method with the exact Hessian, which is an extension of [Citation171].

Some software have been developed to efficiently compute rigid-body dynamics and their derivatives. RobCoGen [Citation172] is an open-source C++ code-generation tool supporting the RNEA, ABA, and automatic differentiations of these algorithms [Citation166]. Pinocchio [Citation173] is an open-source C++ library for RNEA, ABA, and the analytical derivatives of the RNEA and ABA [Citation167].

4.3.2. State-of-the-art and challenges

Researchers can try the whole-body MPC because of the above developments in dynamics and their derivatives computation. Because the whole-body MPC is based on accurate full-body robot dynamics, it can achieve energy-efficient, reasonable, and highly dynamic motions for various problems. The whole-body MPC does not involve limitations on hardware platforms, working environment, and desired tasks, which can arise in MPC based on reduced-order models, i.e. the SRBD and centroidal dynamics models, due to the problem approximations. In the following, we introduce the researches that successfully implemented the whole-body MPC. The following MPC methods utilized Riccati recursion (specifically iLQR) to perform Newton-type iteration efficiently.

Refs. [Citation23,Citation31,Citation36] formulated whole-body MPC problems with approximated soft contact models that are introduced in Section 2.4. Ref. [Citation23] was the first study that realized the whole-body MPC in both simulation and real robot hardware. It utilized the analytically invertible and smooth contact model of [Citation61]. Ishihara et al. [Citation36] utilized the same contact model and proposed a hierarchical MPC strategy based on a singular perturbation technique to improve the control performance for fast dynamics. Neunert et al. [Citation31] implemented whole-body MPC with a contact model approximation based on the mass-spring-damper model. Neunert et al. [Citation31] also leveraged the automatic differentiation of the rigid-body dynamics proposed in [Citation166] while [Citation23] relied on slower finite differentiation. As stated in Section 2.4, these studies based on approximated contact models do not require the prespecified contact sequence and enable fast dynamics computation while lacking the solution's accuracy. Note that the offline trajectory optimization based on these contact models has also been studied, e.g. early in the computer graphics community [Citation174].

In contrast, Refs. [Citation94,Citation175] formulated whole-body MPC problems taking the rigid contacts into account under the given contact sequence. Specifically, the rigid collisions and contact acceleration constraints were explicitly considered as utilized for the offline trajectory optimization in [Citation176]. Further, Mastalli et al. [Citation94] leveraged the efficient derivative computations of rigid-body dynamics of [Citation167]. Dantec et al. [Citation44,Citation47] realized whole-body MPC of a humanoid robot using the algorithm of [Citation94]. These rigid contact-based MPC can determine the control action by taking the future impact dynamics (i.e. instantaneous changes in the state due to impacts) into account, which cannot be achieved in the aforementioned MPC based on reduced-order models.

While whole-body MPC has been implemented in various robotic problems, computational challenges remain. First, the full-body dynamics involve high nonlinearity, which requires many Newton-type iterations and reduces the reliability of the numerical optimization solvers. For example, Dantec et al. [Citation44] proposed to give an initial guess using the histories of the MPC solutions to improve the too-slow convergence speed of the whole-body MPC. Further, the computational burdens of dynamics and their derivatives and that of the Newton-type iterations still requires a long computational time, particularly compared with the simple SRBD model in which the computational cost of dynamics and Newton-type iterations is small.

4.4. Software for MPC of robotic systems

The robotic MPC problems, particularly in complicated models such as the centroidal dynamics model with full-kinematics or whole-body dynamics, involve complicated and variable MPC problem structures due to contacts. Moreover, rigid-body kinematics and dynamics calculations are required in solving optimization problems, which are difficult to implement only with the general NLP modeling tool [Citation177] or nonlinear MPC tool [Citation87]. Dedicated software for robotic MPC problems has been developed to interface with these difficulties. Control-toolbox [Citation178] is a toolbox that implements MPC algorithms such as iLQR and SQP and has interfaces for rigid-body library RobCoGen [Citation172]. ocs2 [Citation179] is a collection of optimal control solvers such as the iLQR-type method for switched systems [Citation180] and SQP methods. In ocs2, an approximated IP method [Citation95] and AL methods are implemented to treat constraints. It also supports interfacing the switched systems, i.e. robotic systems with contacts, and has robotic MPC examples such as MPC with the SRBD/centroidal dynamics model and the full-kinematics model using Pinocchio [Citation173]. Crocoddyl [Citation94] implemented iLQR-type methods for whole-body dynamics with rigid contacts in which Pinocchio was used to efficiently compute rigid-body dynamics and their derivatives. It can treat box constraints on the control input efficiently. robotic [Citation181] implemented the same whole-body dynamics with rigid contacts model based on Pinocchio as Crocoddyl. It utilized the nonlinear IP method for general inequality constraints and implemented the multiple-shooting method with Riccati recursion.

5. Conclusion

This survey paper reviewed the recent development of model predictive control methods in legged and humanoid robotics. Section 2 gave an overview of dynamical models of robotic systems. Section 3 reviewed MPC algorithms that are particularly suitable for robotic problems. Section 4 reviewed MPC methods for legged and humanoid robots. In the following, we point out open problems in the MPC of robotic systems.

First, ensuring the closed-loop stability is an unsolved problem. It is particularly difficult in whole-body MPC compared to trajectory generation MPC based on the reduced-order models because we must handle high-dimensional models with a short prediction horizon (due to the heavy computation cost). Some recent studies on combining control Lyapunov functions with MPC [Citation182] and imposing passivity in whole-body control [Citation183–185] are expected to provide insights to tackling this issue.

Second, reducing the computation cost of MPC is still an everlasting problem, particularly with on-board computation. In the MPC based on reduced-order models, there is a low-level controller that determines joint commands from the solution to MPC. It will be then a practical way to compensate for the computational inefficiency of MPC via the low-level controller such as the control barrier function [Citation186] as presented in [Citation39]. Whole-body MPC is challenging even with off-board computation. Recent advances in rigid-body algorithms such as [Citation165,Citation168,Citation170] will potentially reduce the computational burden of whole-body MPC.

The third problem is robustness against disturbances and model mismatches. This is particularly problematic in whole-body MPC since the control commands are directly sent to the actuators. In contrast, MPC based on the reduced-order models are inherently inaccurate and the low-level controllers such as QP-based whole-body controllers typically try to compensate for the modeling error. Possible approaches are robust MPC [Citation187,Citation188] and stochastic MPC [Citation189], in which the uncertainties are explicitly considered in the MPC formulation. Other possibilities are data-driven approaches such as the combination of MPC and adaptive-control [Citation190,Citation191] or reinforcement learning methods [Citation192].

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by JSPS KAKENHI [grant number JP22J11441].

Notes on contributors

Sotaro Katayama

Sotaro Katayama received Bachelor of Engineering Science, Master of Informatics, and Ph.D. in Informatics from Kyoto University, Kyoto, Japan, in 2017, 2019, and 2022, respectively. His research interest includes model predictive control of robotic systems.

Masaki Murooka

Masaki Murooka received the BE, MS, and PhD degrees in information science and technology from The University of Tokyo, Japan, in 2013, 2015, and 2018, respectively. He was a project assistant professor in the Department of Mechano-Informatics at The University of Tokyo from 2018 to 2020. He joined the CNRS-AIST Joint Robotics Laboratory in the National Institute of Advanced Industrial Science and Technology (AIST) in 2020 as a researcher. His research interest includes the motion planning and control of humanoid robots.

Yuichi Tazaki

Yuichi Tazaki received Dr. Eng degree in control engineering from the Tokyo Institute of Technology, Tokyo, Japan, in 2008. Since 2016, he has been an associate professor at Kobe University, Japan. His research interests include autonomous vehicles, mobile robots, and humanoid robots.

References

Murphy MP, Saunders A, Moreira C, et al. The LittleDog robot. Int J Rob Res. 2011;30(2):145–149.
Web of Science ®Google Scholar
Semini C, Tsagarakis NG, Guglielmino E, et al. Design of HyQ – a hydraulically and electrically actuated quadruped robot. Proc Inst Mech Eng I J Syst Control Eng. 2011;225(6):831–849.
PubMed Web of Science ®Google Scholar
Hutter M, Gehring C, Jud D, et al. ANYmal – a highly mobile and dynamic quadrupedal robot. In: 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2016. p. 38–44.
Google Scholar
Bledt G, Powell MJ, Katz B, et al. MIT cheetah 3: design and control of a robust, dynamic quadruped robot. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2018. p. 2245–2252.
Google Scholar
Katz B, Carlo JD, Kim S. Mini Cheetah: a platform for pushing the limits of dynamic quadruped control. In: International Conference on Robotics and Automation. 2019. p. 6295–6301.
Google Scholar
Hirukawa H, Kanehiro F, Kaneko K, et al. Humanoid robotics platforms developed in HRP. Rob Auton Syst. 2004;48(4):165–175. Humanoids 2003.
Web of Science ®Google Scholar
Kaneko K, Harada K, Kanehiro F, et al. Humanoid robot HRP-3. In: 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems. 2008. p. 2471–2478.
Google Scholar
Kaneko K, Kanehiro F, Morisawa M, et al. Humanoid robot HRP-4 – humanoid robotics platform with lightweight and slim body. In: 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems. 2011. p. 4400–4407.
Google Scholar
Englsberger J, Werner A, Ott C, et al. Overview of the torque-controlled humanoid robot TORO. In: 2014 IEEE-RAS International Conference on Humanoid Robots. 2014. p. 916–923.
Google Scholar
Feng S, Whitman E, Xinjilefu X, et al. Optimization based full body control for the atlas robot. In: 2014 IEEE-RAS International Conference on Humanoid Robots. 2014. p. 120–127.
Google Scholar
Stasse O, Flayols T, Budhiraja R, et al. TALOS: a new humanoid research platform targeted for industrial applications. In: 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids). 2017. p. 689–695.
Google Scholar
Kanemoto Y, Yoshiike T, Muromachi M, et al. Compact and high performance torque-controlled actuators and its implementation to disaster response robot. In: 2018 IEEE International Conference on Robotics and Automation (ICRA). 2018. p. 4057–4063.
Google Scholar
Gong Y, Hartley R, Da X, et al. Feedback control of a Cassie bipedal robot: walking, standing, and riding a segway. In: 2019 American Control Conference (ACC). 2019. p. 4559–4566.
Google Scholar
6th workshop on legged robots; 2022. Available from: https://leggedrobots.org/lrws_2022/index.html
Google Scholar
Nishiwaki K, Kagami S, Kuniyoshi Y, et al. Online generation of humanoid walking motion based on a fast generation method of motion pattern that follows desired ZMP. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2002. p. 2684–2689.
Google Scholar
Kajita S, Kanehiro F, Kaneko K, et al. Biped walking pattern generation by using preview control of zero-moment point. In: IEEE International Conference on Robotics and Automation. Vol. 2, 2003. p. 1620–1626.
Google Scholar
Nishiwaki K, Kagami S. High frequency walking pattern generation based on preview control of ZMP. In: IEEE International Conference on Robotics and Automation. 2006. p. 2667–2672.
Google Scholar
Urata J, Nshiwaki K, Nakanishi Y, et al. Online decision of foot placement using singular LQ preview regulation. In: IEEE-RAS International Conference on Humanoid Robots. 2011. p. 13–18.
Google Scholar
Erez T, Lowrey K, Tassa Y, et al. An integrated system for real-time model predictive control of humanoid robots. In: IEEE-RAS International Conference on Humanoid Robots. 2013. p. 292–299.
Google Scholar
Feng S, Xinjilefu X, Huang W, et al. 3D walking based on online optimization. In: IEEE-RAS International Conference on Humanoid Robots. 2013. p. 21–27.
Google Scholar
Faraji S, Pouya S, Atkeson CG, et al. Versatile and robust 3D walking with a simulated humanoid robot (Atlas): a model predictive control approach. In: IEEE International Conference on Robotics and Automation. 2014. p. 1943–1950.
Google Scholar
Henze B, Ott C, Roa MA. Posture and balance control for humanoid robots in multi-contact scenarios based on model predictive control. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2014. p. 3253–3258.
Google Scholar
Koenemann J, Del Prete A, Tassa Y, et al. Whole-body model-predictive control applied to the HRP-2 humanoid. In: 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2015. p. 3346–3351.
Google Scholar
Feng S, Xinjilefu X, Atkeson CG, et al. Robust dynamic walking using online foot step optimization. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2016. p. 5373–5378.
Google Scholar
Kuindersma S, Deits R, Fallon M, et al. Optimization based locomotion planning, estimation and control design for the atlas humanoid robot. Auton Robots. 2016;40(3):429–455.
Web of Science ®Google Scholar
Bledt G, Wensing PM, Kim S. Policy-regularized model predictive control to stabilize diverse quadrupedal gaits for the MIT cheetah. In: 2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE; 2017. p. 4102–4109.
Google Scholar
Naveau M, Kudruss M, Stasse O, et al. A reactive walking pattern generator based on nonlinear model predictive control. IEEE Robot Autom Lett. 2017;2(1):10–17.
Web of Science ®Google Scholar
Farshidian F, Jelavic E, Satapathy A, et al. Real-time motion planning of legged robots: a model predictive control approach. In: 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids). IEEE; 2017. p. 577–584.
Google Scholar
Kamioka T, Kaneko H, Takenaka T, et al. Simultaneous optimization of ZMP and footsteps based on the analytical solution of divergent component of motion. In: IEEE International Conference on Robotics and Automation. 2018. p. 1763–1770.
Google Scholar
Di Carlo J, Wensing PM, Katz B, et al. Dynamic locomotion in the MIT cheetah 3 through convex model-predictive control. In: 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2018. p. 1–9.
Google Scholar
Neunert M, Stäuble M, Giftthaler M, et al. Whole-body nonlinear model predictive control through contacts for quadrupeds. IEEE Robot Autom Lett. 2018;3(3):1458–1465.
Google Scholar
Bledt G, Kim S. Implementing regularized predictive control for simultaneous real-time footstep and ground reaction force optimization. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE; 2019. p. 6316–6323.
Google Scholar
Agravante DJ, Cherubini A, Sherikov A, et al. Human-humanoid collaborative carrying. IEEE Trans Robot. 2019;35(4):833–846.
Web of Science ®Google Scholar
Bledt G, Kim S. Extracting legged locomotion heuristics with regularized predictive control. In: IEEE International Conference on Robotics and Automation. 2020. p. 406–412.
Google Scholar
Chignoli M, Wensing PM. Variational-based optimal control of underactuated balancing for dynamic quadrupeds. IEEE Access. 2020;8:49785–49797.
Web of Science ®Google Scholar
Ishihara K, Itoh TD, Morimoto J. Full-body optimal control toward versatile and agile behaviors in a humanoid robot. IEEE Robot Autom Lett. 2020;5(1):119–126.
Web of Science ®Google Scholar
Scianca N, De Simone D, Lanari L, et al. MPC for humanoid gait generation: stability and feasibility. IEEE Trans Robot. 2020;36(4):1171–1188.
Web of Science ®Google Scholar
Bjelonic M, Grandia R, Harley O, et al. Whole-body MPC and online gait sequence generation for wheeled-legged robots. In: 2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE; 2021. p. 8388–8395.
Google Scholar
Grandia R, Taylor AJ, Ames AD, et al. Multi-layered safety for legged robots via control barrier functions and model predictive control. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE; 2021. p. 8352–8358.
Google Scholar
Sleiman JP, Farshidian F, Minniti MV, et al. A unified MPC framework for whole-body dynamic locomotion and manipulation. IEEE Robot Autom Lett. 2021;6(3):4688–4695.
Web of Science ®Google Scholar
Xin G, Xin S, Cebe O, et al. Robust footstep planning and LQR control for dynamic quadrupedal locomotion. IEEE Robot Autom Lett. 2021;6(3):4488–4495.
Web of Science ®Google Scholar
Xiong X, Reher J, Ames AD. Global position control on underactuated bipedal robots: step-to-step dynamics approximation for step planning. In: IEEE International Conference on Robotics and Automation. 2021. p. 2825–2831.
Google Scholar
Ding J, Zhou C, Xin S, et al. Nonlinear model predictive control for robust bipedal locomotion: exploring angular momentum and com height changes. Adv Robot. 2021;35(18):1079–1097.
Web of Science ®Google Scholar
Dantec E, Budhiraja R, Roig A, et al. Whole body model predictive control with a memory of motion: experiments on a torque-controlled TALOS. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE; 2021. p. 8202–8208.
Google Scholar
Rathod N, Bratta A, Focchi M, et al. Model predictive control with environment adaptation for legged locomotion. IEEE Access. 2021;9:145710–145727.
Web of Science ®Google Scholar
Romualdi G, Dafarra S, L'Erario G, et al. Online non-linear centroidal MPC for humanoid robot locomotion with step adjustment. In: IEEE International Conference on Robotics and Automation. 2022. p. 10412–10419.
Google Scholar
Dantec E, Taix M, Mansard N. First order approximation of model predictive control solutions for high frequency feedback. IEEE Robot Autom Lett. 2022;7(2):4448–4455.
Web of Science ®Google Scholar
Jeon SH, Kim S, Kim D. Online optimal landing control of the MIT Mini Cheetah. In: IEEE International Conference on Robotics and Automation. 2022. p. 178–184.
Google Scholar
Kajita S, Espiau B. Legged robots. Berlin, Heidelberg: Springer Handbook of Robotics; 2008.
Google Scholar
Sugihara T, Morisawa M. A survey: dynamics of humanoid robots. Adv Robot. 2020;34(21-22):1338–1352.
Web of Science ®Google Scholar
Tazaki Y, Murooka M. A survey of motion planning techniques for humanoid robots. Adv Robot. 2020;34(21-22):1370–1379.
Web of Science ®Google Scholar
Carpentier J, Wieber PB. Recent progress in legged robots locomotion control. Curr Robot Rep. 2021;2:231–238.
Google Scholar
Perrin N. Biped footstep planning. In: Humanoid robotics: a reference. Springer Netherlands; 2019. p. 1697–1717.
Google Scholar
Wieber PB. Model predictive control. In: Humanoid robotics: a reference. Springer Netherlands; 2019. p. 1077–1097.
Google Scholar
Bouyarmane K, Caron S, Escande A, et al. Multi-contact motion planning and control. In: Humanoid robotics: a reference. Springer Netherlands; 2019. p. 1763–1804.
Google Scholar
Featherstone R. Rigid body dynamics algorithms. Berlin, Heidelberg: Springer; 2008.
Google Scholar
Carpentier J, Mansard N. Multicontact locomotion of legged robots. IEEE Trans Robot. 2018;34(6):1441–1460.
Web of Science ®Google Scholar
Kurtz V, Li H, Wensing PM, et al. Mini Cheetah, the falling cat: a case study in machine learning and trajectory optimization for robot acrobatics. In: IEEE International Conference on Robotics and Automation. 2022. p. 4635–4641.
Google Scholar
Stewart DE, Trinkle JC. An implicit time-stepping scheme for rigid body dynamics with inelastic collisions and coulomb friction. Int J Numer Methods Eng. 1996;39(15):2673–2691.
Web of Science ®Google Scholar
Neunert M, Farshidian F, Winkler AW, et al. Trajectory optimization through contacts and automatic gait discovery for quadrupeds. IEEE Robot Autom Lett. 2017;2(3):1502–1509.
Web of Science ®Google Scholar
Todorov E. Convex and analytically-invertible dynamics with contacts and constraints: theory and implementation in mujoco. In: 2014 IEEE International Conference on Robotics and Automation (ICRA). IEEE; 2014. p. 6054–6061.
Google Scholar
Todorov E, Erez T, Tassa Y. Mujoco: a physics engine for model-based control. In: 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE; 2012. p. 5026–5033.
Google Scholar
Acosta B, Yang W, Posa M. Validating robotics simulators on real world impacts. IEEE Robot Autom Lett. 2022;7(3):6471–6478.
Web of Science ®Google Scholar
Kajita S, Tani K. Study of dynamic biped locomotion on rugged terrain – derivation and application of the linear inverted pendulum mode. In: IEEE International Conference on Robotics and Automation. 1991. p. 1405–1411.
Google Scholar
Caron S, Pham QC, Nakamura Y. ZMP support areas for multicontact mobility under frictional constraints. IEEE Trans Robot. 2017;33(1):67–80.
Web of Science ®Google Scholar
Koolen T, de Boer T, Rebula J, et al. Capturability-based analysis and control of legged locomotion, part 1: theory and application to three simple gait models. Int J Robot Res. 2012;31(9):1094–1113.
Web of Science ®Google Scholar
Ferreau HJ, Kirches C, Potschka A, et al. qpOASES: a parametric active-set algorithm for quadratic programming. Math Program Comput. 2014;6(4):327–363.
Google Scholar
Frasch JV, Sager S, Diehl M. A parallel quadratic programming method for dynamic optimization problems. Math Program Comput. 2015;7(3):289–329.
Web of Science ®Google Scholar
Mattingley J, Boyd S. CVXGEN: a code generator for embedded convex optimization. Optim Eng. 2012;13(1):1–27.
Web of Science ®Google Scholar
Frison G, Diehl M. Hpipm: a high-performance quadratic programming framework for model predictive control. IFAC-PapersOnLine. 2020;53(2):6563–6569.
Google Scholar
Pandala AG, Ding Y, Park HW. qpSWIFT: a real-time sparse quadratic program solver for robotic applications. IEEE Robot Autom Lett. 2019;4(4):3355–3362.
Web of Science ®Google Scholar
Stellato B, Banjac G, Goulart P, et al. OSQP: an operator splitting solver for quadratic programs. Math Program Comput. 2020;12(4):637–672.
Web of Science ®Google Scholar
Kouzoupis D, Frison G, Zanelli A, et al. Recent advances in quadratic programming algorithms for nonlinear model predictive control. Vietnam J Math. 2018;46(4):863–882.
Web of Science ®Google Scholar
Ohtsuka T. A continuation/GMRES method for fast computation of nonlinear receding horizon control. Automatica. 2004;40(4):563–574.
Web of Science ®Google Scholar
Diehl M, Bock H, Schlöder JP. A real-time iteration scheme for nonlinear optimization in optimal feedback control. SIAM J Control Optim. 2005;43(5):1714–1736.
Web of Science ®Google Scholar
Zavala VM, Biegler LT. The advanced-step NMPC controller: optimality, stability and robustness. Automatica. 2009;45(1):86–93.
Web of Science ®Google Scholar
Zanelli A, Quirynen R, Jerez J, et al. A homotopy-based nonlinear interior-point method for NMPC. IFAC-PapersOnLine. 2017;50(1):13188–13193.
Google Scholar
Nocedal J, Wright SJ. Numerical optimization. 2nd ed. New York: Springer; 2006.
Google Scholar
Gill PE, Murray W, Saunders MA. SNOPT: an SQP algorithm for large-scale constrained optimization. SIAM Rev. 2005;47(1):99–131.
Web of Science ®Google Scholar
Wächter A, Biegler L. On the implementation of an interior-point filter line-search algorithm for large-scale nonlinear programming. Math Program. 2006;106(1):25–57.
Web of Science ®Google Scholar
Bock H, Plitt K. A multiple shooting algorithm for direct solution of optimal control problems. In: 9th IFAC World Congress. 1984. p. 1603–1608.
Google Scholar
Rawlings K, Mayne DQ, Diehl M. Model predictive control: theory, computation, and design. Madison, Wisconsin: Nob Hill; 2017.
Google Scholar
Rao C, Wright SJ, Rawlings JB. Application of interior-point methods to model predictive control. J Optim Theory Appl. 1998;99(3):723–757.
Web of Science ®Google Scholar
Wang Y, Boyd S. Fast model predictive control using online optimization. IEEE Trans Control Syst Technol. 2010;18(2):267–278.
Web of Science ®Google Scholar
Zanelli A, Domahidi A, Jerez J, et al. FORCES NLP: an efficient implementation of interior-point methods for multistage nonlinear nonconvex programs. Int J Control. 2020;93(1):13–29.
Web of Science ®Google Scholar
Deng H, Ohtsuka T. A parallel Newton-type method for nonlinear model predictive control. Automatica. 2019;109:108560.
Web of Science ®Google Scholar
Verschueren R, Frison G, Kouzoupis D, et al. Acados – a modular open-source framework for fast embedded optimal control. Math Program Comput. 2021;14:147–183.
Web of Science ®Google Scholar
Howell TA, Jackson BE, Manchester Z. ALTRO: A fast solver for constrained trajectory optimization. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). 2019. p. 7674–7679.
Google Scholar
Sideris A, Rodriguez LA. A Riccati approach for constrained linear quadratic optimal control. Int J Control. 2011;84(2):370–380.
Web of Science ®Google Scholar
Giftthaler M, Buchli J. A projection approach to equality constrained iterative linear quadratic optimal control. In: 2017 IEEE-RAS 17th International Conference on Humanoid Robotics (Humanoids). 2017. p. 61–66.
Google Scholar
Katayama S, Ohtsuka T. Efficient Riccati recursion for optimal control problems with pure-state equality constraints. In: American Control Conference (ACC) 2022 (to appear). 2022. p. 3579–3586.
Google Scholar
Jacobson DH, Mayne DQ. Differential dynamic programming. New York: Elsevier; 1970.
Google Scholar
Todorov E, Li W. A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems. In: Proceedings of the 2005, American Control Conference. 2005. p. 300–306.
Google Scholar
Mastalli C, Budhiraja R, Merkt W, et al. Crocoddyl: an efficient and versatile framework for multi-contact optimal control. In: 2020 IEEE International Conference on Robotics and Automation (ICRA). 2020. p. 2536–2542.
Google Scholar
Grandia R, Farshidian F, Ranftl R, et al. Feedback MPC for torque-controlled legged robots. In: 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE; 2019. p. 4730–4737.
Google Scholar
Sleiman JP, Farshidian F, Hutter M. Constraint handling in continuous-time DDP-based model predictive control. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE; 2021. p. 8209–8215.
Google Scholar
Goebel R, Sanfelice RG, Teel AR. Hybrid dynamical systems. Princeton, New Jersey: Princeton University Press; 2012.
Google Scholar
Belotti P, Kirches C, Leyffer S, et al. Mixed-integer nonlinear optimization. Acta Numer. 2013;22:1–131.
Web of Science ®Google Scholar
Sager S, Jung M, Kirches C. Combinatorial integral approximation. Math Methods Oper Res. 2011;73(3):363–380.
Web of Science ®Google Scholar
Bürger A, Zeile C, Hahn M, et al. pycombina: an open-source tool for solving combinatorial approximation problems arising in mixed-integer optimal control. IFAC-PapersOnLine. 2020;53(2):6502–6508.
Google Scholar
Hespanhol P, Quirynen R, Di Cairano S. A structure exploiting branch-and-bound algorithm for mixed-integer model predictive control. In: 2019 18th European Control Conference (ECC). IEEE; 2019. p. 2763–2768.
Google Scholar
Marcucci T, Tedrake R. Warm start of mixed-integer programs for model predictive control of hybrid systems. IEEE Trans Automat Contr. 2020;66(6):2433–2448.
Web of Science ®Google Scholar
Yunt K, Glocker C. Trajectory optimization of mechanical hybrid systems using SUMT. In: 9th IEEE International Workshop on Advanced Motion Control. IEEE; 2006. p. 665–671.
Google Scholar
Scheel H, Scholtes S. Mathematical programs with complementarity constraints: stationarity, optimality, and sensitivity. Math Oper Res. 2000;25(1):1–22.
Web of Science ®Google Scholar
Posa M, Cantu C, Tedrake R. A direct method for trajectory optimization of rigid bodies through contact. Int J Rob Res. 2014;33(1):69–81.
Web of Science ®Google Scholar
Hoheisel T, Kanzow C, Schwartz A. Theoretical and numerical comparison of relaxation methods for mathematical programs with complementarity constraints. Math Program. 2013;137(1-2):257–288.
Web of Science ®Google Scholar
Nurkanovic A, Albrecht S, Diehl M. Limits of MPCC formulations in direct optimal control with nonsmooth differential equations. In: 2020 European Control Conference (ECC). 2020. p. 2015–2020.
Google Scholar
Carius J, Ranftl R, Koltun V, et al. Trajectory optimization with implicit hard contacts. IEEE Robot Autom Lett. 2018;3(4):3316–3323.
Web of Science ®Google Scholar
Carius J, Ranftl R, Koltun V, et al. Trajectory optimization for legged robots with slipping motions. IEEE Robot Autom Lett. 2019;4(3):3013–3020.
Web of Science ®Google Scholar
Alp Aydinoglu MP. Real-time multi-contact model predictive control via ADMM. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). 2022. p. 3414–3421.
Google Scholar
Dempe S, Dutta J. Is bilevel programming a special case of a mathematical program with complementarity constraints?. Math Program. 2012;131(1-2):37–48.
Web of Science ®Google Scholar
Colson B, Marcotte P, Savard G. An overview of bilevel optimization. Ann Oper Res. 2007;153(1):235–256.
Web of Science ®Google Scholar
Xu X, Antsaklis PJ. Optimal control of switched systems based on parameterization of the switching instants. IEEE Trans Automat Contr. 2004;49(1):2–16.
Web of Science ®Google Scholar
Patterson MA, Rao AV. GPOPS-II: a MATLAB software for solving multiple-phase optimal control problems using hp-adaptive Gaussian quadrature collocation methods and sparse nonlinear programming. ACM Trans Math Softw. 2014;41(1).1–37.
Web of Science ®Google Scholar
Xu X, Antsaklis PJ. Optimal control of switched systems via non-linear optimization based on direct differentiations of value functions. Int J Control. 2002;75(16-17):1406–1426.
Web of Science ®Google Scholar
Farshidian F, Neunert M, Winkler AW, et al. An efficient optimal planning and control framework for quadrupedal locomotion. In: 2017 IEEE International Conference on Robotics and Automation (ICRA). 2017. p. 93–100.
Google Scholar
Li H, Wensing PM. Hybrid systems differential dynamic programming for whole-body motion planning of legged robots. IEEE Robot Autom Lett. 2020;5(4):5448–5455.
Web of Science ®Google Scholar
Tedrake R, Kuindersma S, Deits R, et al. A closed-form solution for real-time ZMP gait generation and feedback stabilization. In: IEEE-RAS International Conference on Humanoid Robots. 2015. p. 936–940.
Google Scholar
Katayama T, Ohki T, Inoue T, et al. Design of an optimal controller for a discrete-time system subject to previewable demand. Int J Control. 1985;41(3):677–699.
Web of Science ®Google Scholar
Likhachev M, Ferguson D, Gordon G, et al. Anytime dynamic A*: an anytime, replanning algorithm. In: International Conference on Automated Planning and Scheduling. 2005.
Google Scholar
Chestnutt J, Kuffner J, Nishiwaki K, et al. Planning biped navigation strategies in complex environments. In: IEEE-RAS International Conference on Humanoid Robots. 2003.
Google Scholar
Hornung A, Dornbush A, Likhachev M, et al. Anytime search-based footstep planning with suboptimality bounds. In: IEEE-RAS International Conference on Humanoid Robots. 2012. p. 674–679.
Google Scholar
Deits R, Tedrake R. Footstep planning on uneven terrain with mixed-integer convex optimization. In: IEEE-RAS International Conference on Humanoid Robots. 2014. p. 279–286.
Google Scholar
Tonneau S, Song D, Fernbach P, et al. SL1M: Sparse L1-norm minimization for contact planning on uneven terrain. In: IEEE International Conference on Robotics and Automation. 2020. p. 6604–6610.
Google Scholar
Park J, Youm Y. General ZMP preview control for bipedal walking. In: IEEE International Conference on Robotics and Automation. 2007. p. 2682–2687.
Google Scholar
Wieber P. Trajectory free linear model predictive control for stable walking in the presence of strong perturbations. In: IEEE-RAS International Conference on Humanoid Robots. 2006. p. 137–142.
Google Scholar
Herdt A, Perrin N, Wieber PB. LMPC based online generation of more efficient walking motions. In: IEEE-RAS International Conference on Humanoid Robots. 2012. p. 390–395.
Google Scholar
Goto K, Tazaki Y, Suzuki T. Bipedal locomotion control based on simultaneous trajectory and foot step planning. J Robot Mechatron. 2016;28(4):533–542.
Web of Science ®Google Scholar
Xin S, Orsolino R, Tsagarakis N. Online relative footstep optimization for legged robots dynamic walking using discrete-time model predictive control. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2019. p. 513–520.
Google Scholar
Herdt A, Diedam H, Wieber PB, et al. Online walking motion generation with automatic footstep placement. Adv Robot. 2010;24(5-6):719–737.
Web of Science ®Google Scholar
Dimitrov D, Paolillo A, Wieber PB. Walking motion generation with online foot position adaptation based on l1- and l∞-norm penalty formulations. In: IEEE International Conference on Robotics and Automation. 2011. p. 3523–3529.
Google Scholar
Griffin RJ, Leonessa A. Model predictive control for dynamic footstep adjustment using the divergent component of motion. In: IEEE International Conference on Robotics and Automation. 2016. p. 1763–1768.
Google Scholar
Shafiee-Ashtiani M, Yousefi-Koma A, Shariat-Panahi M. Robust bipedal locomotion control based on model predictive control and divergent component of motion. In: IEEE International Conference on Robotics and Automation. 2017. p. 3505–3510.
Google Scholar
Jeong H, Sim O, Bae H, et al. Biped walking stabilization based on foot placement control using capture point feedback. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2017. p. 5263–5269.
Google Scholar
Kojio Y, Ishiguro Y, Nguyen KNK, et al. Unified balance control for biped robots including modification of footsteps with angular momentum and falling detection based on capturability. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2019. p. 497–504.
Google Scholar
Stephens BJ, Atkeson CG. Push recovery by stepping for humanoid robots with force controlled joints. In: IEEE-RAS International Conference on Humanoid Robots. 2010. p. 52–59.
Google Scholar
Murooka M, Chappellet K, Tanguy A, et al. Humanoid loco-manipulations pattern generation and stabilization control. IEEE Robot Autom Lett. 2021;6(3):5597–5604.
Web of Science ®Google Scholar
Mason S, Rotella N, Schaal S, et al. An MPC walking framework with external contact forces. In: IEEE International Conference on Robotics and Automation. 2018. p. 1785–1790.
Google Scholar
Morisawa M, Cisneros R, Benallegue M, et al. Sequential trajectory generation for dynamic multi-contact locomotion synchronizing contact. Int J HR. 2020;17(1):2050003.
Web of Science ®Google Scholar
Feng S, Whitman E, Xinjilefu X, et al. Optimization-based full body control for the darpa robotics challenge. J Field Robot. 2015;32(2):293–312.
Web of Science ®Google Scholar
Van Heerden K. Real-time variable center of mass height trajectory planning for humanoids robots. IEEE Robot Autom Lett. 2017;2(1):135–142.
Web of Science ®Google Scholar
Brasseur C, Sherikov A, Collette C, et al. A robust linear MPC approach to online generation of 3D biped walking motion. In: IEEE-RAS International Conference on Humanoid Robots. 2015. p. 595–601.
Google Scholar
Takenaka T, Matsumoto T, Yoshiike T. Real time motion generation and control for biped robot - 1st report: walking gait pattern generation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2009. p. 1084–1091.
Google Scholar
Englsberger J, Ott C, Albu-Schäffer A. Three-dimensional bipedal walking control based on divergent component of motion. IEEE Trans Robot. 2015;31(2):355–368.
Web of Science ®Google Scholar
Sugihara T, Yamamoto T. Foot-guided agile control of a biped robot through ZMP manipulation. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2017. p. 4546–4551.
Google Scholar
Byl K, Shkolnik A, Prentice S, et al. Reliable dynamic motions for a stiff quadruped. In: Experimental Robotics. 2009. p. 319–328.
Google Scholar
Horvat T, Melo K, Ijspeert AJ. Model predictive control based framework for CoM control of a quadruped robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2017. p. 3372–3378.
Google Scholar
Orin DE, Goswami A, Lee SH. Centroidal dynamics of a humanoid robot. Auton Robots. 2013;35(2-3):161–176.
Web of Science ®Google Scholar
Audren H, Vaillant J, Kheddar A, et al. Model preview control in multi-contact motion-application to a humanoid robot. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2014. p. 4030–4035.
Google Scholar
Dai H, Tedrake R. Planning robust walking motion on uneven terrain via convex optimization. In: IEEE-RAS International Conference on Humanoid Robots. 2016. p. 579–586.
Google Scholar
Fernbach P, Tonneau S, Taïx M. CROC: Convex resolution of centroidal dynamics trajectories to provide a feasibility criterion for the multi contact planning problem. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2018. p. 1–9.
Google Scholar
Ponton B, Herzog A, Schaal S, et al. A convex model of humanoid momentum dynamics for multi-contact motion generation. In: IEEE-RAS International Conference on Humanoid Robots. 2016. p. 842–849.
Google Scholar
García G, Griffin R, Pratt J. MPC-based locomotion control of bipedal robots with line-feet contact using centroidal dynamics. In: IEEE-RAS International Conference on Humanoid Robots. 2021. p. 276–282.
Google Scholar
Caron S, Kheddar A. Multi-contact walking pattern generation based on model preview control of 3D COM accelerations. In: IEEE-RAS International Conference on Humanoid Robots. 2016. p. 550–557.
Google Scholar
Dai H, Valenzuela A, Tedrake R. Whole-body motion planning with centroidal dynamics and full kinematics. In: IEEE-RAS International Conference on Humanoid Robots. 2014. p. 295–302.
Google Scholar
Herzog A, Rotella N, Schaal S, et al. Trajectory generation for multi-contact momentum control. In: IEEE-RAS International Conference on Humanoid Robots. 2015. p. 874–880.
Google Scholar
Kudruss M, Naveau M, Stasse O, et al. Optimal control for whole-body motion generation using center-of-mass dynamics for predefined multi-contact configurations. In: IEEE-RAS International Conference on Humanoid Robots. 2015. p. 684–689.
Google Scholar
Chignoli M, Kim D, Stanger-Jones E, et al. The MIT humanoid robot: design, motion planning, and control for acrobatic behaviors. In: IEEE-RAS International Conference on Humanoid Robots. 2021. p. 1–8.
Google Scholar
Ding Y, Pandala A, Park HW. Real-time model predictive control for versatile dynamic motions in quadrupedal robots. In: 2019 International Conference on Robotics and Automation (ICRA). IEEE; 2019. p. 8484–8490.
Google Scholar
Shah P, Meduri A, Merkt W, et al. Rapid convex optimization of centroidal dynamics using block coordinate descent. In: IEEE/RSJ International Conference on Intelligent Robots and Systems. 2021. p. 1658–1665.
Google Scholar
Bellicoso CD, Krämer K, Stäuble M, et al. ALMA-articulated locomotion and manipulation for a torque-controllable robot. In: 2019 International Conference on Robotics and Automation (ICRA). IEEE; 2019. p. 8477–8483.
Google Scholar
Muico U, Lee Y, Popović J, et al. Contact-aware nonlinear control of dynamic characters. In: Acm siggraph 2009 papers. 2009. p. 1–9.
Google Scholar
Luh JYS, Walker MW, Paul RPC. On-Line computational scheme for mechanical manipulators. J Dyn Syst Meas Control. 1980;102(2):69–76.06.
Web of Science ®Google Scholar
Featherstone R. The calculation of robot dynamics using articulated-body inertias. Int J Rob Res. 1983;2(1):13–30.
Web of Science ®Google Scholar
Carpentier J, Budhiraja R, Mansard N. Proximal and sparse resolution of constrained dynamic equations. In: Robotics: Science and Systems (RSS 2021). 2021.
Google Scholar
Neunert M, Giftthaler M, Frigerio M, et al. Fast derivatives of rigid body dynamics for control, optimization and estimation. In: 2016 IEEE International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR). 2016. p. 91–97.
Google Scholar
Carpentier J, Mansard N. Analytical derivatives of rigid body dynamics algorithms. In: Robotics: Science and Systems (RSS 2018). 2018. p. hal–01790971v2f.
Google Scholar
Singh S, Russell R, Wensing PM. Efficient analytical derivatives of rigid-body dynamics using spatial vector algebra. IEEE Robot Autom Lett. 2022;7(2):1776–1783.
Web of Science ®Google Scholar
Ayusawa K, Yoshida E. Comprehensive theory of differential kinematics and dynamics towards extensive motion optimization framework. Int J Rob Res. 2018;37(13-14):1554–1572.
Web of Science ®Google Scholar
Nganga JN, Wensing PM. Accelerating second-order differential dynamic programming for rigid-body systems. IEEE Robot Autom Lett. 2021;6(4):7659–7666.
Web of Science ®Google Scholar
Quirynen R, Houska B, Diehl M. Efficient symmetric Hessian propagation for direct optimal control. J Process Control. 2017;50:19–28.
Web of Science ®Google Scholar
Frigerio M, Buchli J, Caldwell DG, et al. RobCoGen: a code generator for efficient kinematics and dynamics of articulated robots, based on domain specific languages. J Softw Eng Robot. 2016;7(1):36–54.
Google Scholar
Carpentier J, Saurel G, Buondonno G, et al. The Pinocchio C++ library – a fast and flexible implementation of rigid body dynamics algorithms and their analytical derivatives. In: International Symposium on System Integration (SII). 2019. p. 614–619.
Google Scholar
Mordatch I, Todorov E, Popović Z. Discovery of complex behaviors through contact-invariant optimization. ACM Trans Graph. 2012;31(4):1–8.
Web of Science ®Google Scholar
Budhiraja R, Carpentier J, Mastalli C, et al. Differential dynamic programming for multi-phase rigid contact dynamics. In: 2018 IEEE-RAS 18th International Conference on Humanoid Robots (Humanoids). IEEE; 2018. p. 1–9.
Google Scholar
Schultz G, Mombaur K. Modeling and optimal control of human-like running. IEEE ASME Trans Mechatron. 2010;15(5):783–792.
Web of Science ®Google Scholar
Andersson JAE, Gillis J, Horn G, et al. CasADi – a software framework for nonlinear optimization and optimal control. Math Program Comput. 2019;11(1):1–36.
Web of Science ®Google Scholar
Giftthaler M, Neunert M, Stäuble M, et al. The control toolbox – an open-source C++ library for robotics, optimal and model predictive control. In: 2018 IEEE International Conference on Simulation, Modeling, and Programming for Autonomous Robots (SIMPAR). IEEE; 2018. p. 123–129.
Google Scholar
Galliker MY, Grandia R, Farshidian F, et al. OCS2: an open source library for optimal control of switched systems. 2017–2022. Available from: https://github.com/leggedrobotics/ocs2
Google Scholar
Farshidian F, Kamgarpour M, Pardo D, et al. Sequential linear quadratic optimal control for nonlinear switched systems. IFAC-PapersOnLine. 2017;50(1):1463–1469. 20th IFAC World Congress.
Google Scholar
Katayama S. robotoc. 2020–2022. Available from: https://github.com/mayataka/robotoc
Google Scholar
Grandia R, Taylor AJ, Singletary A, et al. Nonlinear model predictive control of robotic systems with control Lyapunov functions. In: Robotics: Science and Systems (RSS 2020). 2020.
Google Scholar
Henze B, Roa MA, Ott C. Passivity-based whole-body balancing for torque-controlled humanoid robots in multi-contact scenarios. Int J Rob Res. 2016;35(12):1522–1543.
Web of Science ®Google Scholar
Englsberger J, Dietrich A, Mesesan GA, et al. MPTC-modular passive tracking controller for stack of tasks based control frameworks. In: Robotics: Science and Systems (RSS 2020). 2020.
Google Scholar
Ramuzat N, Boria S, Stasse O. Passive inverse dynamics control using a global energy tank for torque-controlled humanoid robots in multi-contact. IEEE Robotics and Automation Letters; 2022.
Google Scholar
Ames AD, Xu X, Grizzle JW, et al. Control barrier function based quadratic programs for safety critical systems. IEEE Trans Automat Contr. 2016;62(8):3861–3876.
Web of Science ®Google Scholar
Scokaert PO, Mayne DQ. Min-max feedback model predictive control for constrained linear systems. IEEE Trans Automat Contr. 1998;43(8):1136–1142.
Web of Science ®Google Scholar
Köhler J, Soloperto R, Müller MA, et al. A computationally efficient robust model predictive control framework for uncertain nonlinear systems. IEEE Trans Automat Contr. 2020;66(2):794–801.
Web of Science ®Google Scholar
Lorenzen M, Dabbene F, Tempo R, et al. Constraint-tightening and stability in stochastic model predictive control. IEEE Trans Automat Contr. 2016;62(7):3165–3177.
Web of Science ®Google Scholar
Minniti MV, Grandia R, Fäh K, et al. Model predictive robot-environment interaction control for mobile manipulation tasks. In: 2021 IEEE International Conference on Robotics and Automation (ICRA). IEEE; 2021. p. 1651–1657.
Google Scholar
Minniti MV, Grandia R, Farshidian F, et al. Adaptive CLF-MPC with application to quadrupedal robots. IEEE Robot Autom Lett. 2022;7(1):565–572.
Web of Science ®Google Scholar
Zanon M, Gros S. Safe reinforcement learning using robust MPC. IEEE Trans Automat Contr. 2020;66(8):3638–3652.
Web of Science ®Google Scholar

Model predictive control of legged and humanoid robots: models and algorithms