multibody.html

<!DOCTYPE html>

<html>

  <head>
    <title>Ch. 22 - Multi-Body
  Dynamics</title>
    <meta name="Ch. 22 - Multi-Body
  Dynamics" content="text/html; charset=utf-8;" />
    <link rel="canonical" href="http://underactuated.mit.edu/multibody.html" />

    <script src="https://hypothes.is/embed.js" async></script>
    <script type="text/javascript" src="chapters.js"></script>
    <script type="text/javascript" src="htmlbook/book.js"></script>

    <script src="htmlbook/mathjax-config.js" defer></script>
    <script type="text/javascript" id="MathJax-script" defer
      src="htmlbook/MathJax/es5/tex-chtml.js">
    </script>
    <script>window.MathJax || document.write('<script type="text/javascript" src="https://cdn.jsdelivr.net/npm/mathjax@3/es5/tex-chtml.js" defer><\/script>')</script>

    <link rel="stylesheet" href="htmlbook/highlight/styles/default.css">
    <script src="htmlbook/highlight/highlight.pack.js"></script> <!-- http://highlightjs.readthedocs.io/en/latest/css-classes-reference.html#language-names-and-aliases -->
    <script>hljs.initHighlightingOnLoad();</script>

    <link rel="stylesheet" type="text/css" href="htmlbook/book.css" />
  </head>

<body onload="loadChapter('underactuated');">

<div data-type="titlepage">
  <header>
    <h1><a href="index.html" style="text-decoration:none;">Underactuated Robotics</a></h1>
    <p data-type="subtitle">Algorithms for Walking, Running, Swimming, Flying, and Manipulation</p>
    <p style="font-size: 18px;"><a href="http://people.csail.mit.edu/russt/">Russ Tedrake</a></p>
    <p style="font-size: 14px; text-align: right;">
      &copy; Russ Tedrake, 2022<br/>
      Last modified <span id="last_modified"></span>.</br>
      <script>
      var d = new Date(document.lastModified);
      document.getElementById("last_modified").innerHTML = d.getFullYear() + "-" + (d.getMonth()+1) + "-" + d.getDate();</script>
      <a href="misc.html">How to cite these notes, use annotations, and give feedback.</a><br/>
    </p>
  </header>
</div>

<p><b>Note:</b> These are working notes used for <a
href="http://underactuated.csail.mit.edu/Spring2022/">a course being taught
at MIT</a>. They will be updated throughout the Spring 2022 semester.  <a
href="https://www.youtube.com/channel/UChfUOAhz7ynELF-s_1LPpWg">Lecture videos are available on YouTube</a>.</p>

<table style="width:100%;"><tr style="width:100%">
  <td style="width:33%;text-align:left;"><a class="previous_chapter" href=drake.html>Previous Chapter</a></td>
  <td style="width:33%;text-align:center;"><a href=index.html>Table of contents</a></td>
  <td style="width:33%;text-align:right;"><a class="next_chapter" href=optimization.html>Next Chapter</a></td>
</tr></table>

<script type="text/javascript">document.write(notebook_header('multibody'))
</script>
<!-- EVERYTHING ABOVE THIS LINE IS OVERWRITTEN BY THE INSTALL SCRIPT -->
<chapter class="appendix" style="counter-reset: chapter 1"><h1>Multi-Body
  Dynamics</h1>

  <section><h1>Deriving the equations of motion</h1>

    <p>The equations of motion for a standard robot can be derived using the
    method of Lagrange.  Using $T$ as the total kinetic energy of the system,
    and $U$ as the total potential energy of the system, $L = T-U$, and $Q_i$
    as the generalized force corresponding to $q_i$, the Lagrangian dynamic
    equations are: \begin{equation} \frac{d}{dt}\pd{L}{\dot{q}_i} - \pd{L}{q_i}
    = Q_i.\end{equation}  You can think of them as a generalization of Newton's
    equations.  For a particle, we have $T=\frac{1}{2}m \dot{x}^2,$ so
    $\frac{d}{dt}\pd{L}{\dot{x}} = m\ddot{x},$ and $\pd{L}{x} = -\pd{U}{x} = f$
    amounting to $f=ma$.  But the Lagrangian derivation works in generalized
    coordinate systems and for constrained motion.</p>

    <p>Without going into the full details, the key idea for handling
    constraints is the "principle of virtual work" (and the associated
    D'Almbert's principle). To describe even a pendulum at equilibrium in the
    Newtonian approach, we would have to compute both external forces (like
    gravity) and internal forces (like the forces keeping the pendulum arm
    attached to the table), and have their sum equal zero. Computing internal
    forces is slightly annoying for a pendulum; it becomes untenable for more
    complex mechanisms. If the sum of the forces equals zero, then the work done
    by those forces in some (infinitesmal) virtual displacement is certainly
    also equal to zero. Now here is the trick: if we consider only virtual
    displacements that are consistent with the kinematic constraints (e.g.
    rotations around the pin joint of the pendulum), then we can compute that
    virtual work and set it equal to zero without ever computing the internal
    forces! Extending this idea to the dynamics case (via D'Almbert's and
    Hamilton's principles) results eventually in the Lagrangian equations
    above.</p>

    <p>If you are not comfortable with these equations, then any good book
    chapter on rigid body mechanics can bring you up to speed.
    <elib>Craig89</elib> is an excellent practical guide to robot
    kinematics/dynamics. But <elib>Lanczos70</elib> is my favorite mechanics
    book, by far; I highly recommend it if you want something more. It's also ok
    to continue on for now thinking of Lagrangian mechanics simply as a recipe
    than you can apply in a great many situations to generate the equations of
    motion.
    </p>

    <p>For completeness, I've included a derivation of the Lagrangian from the
    principle of stationary action <a href="#action">below</a>.</p>

    <example id=double_pendulum><h1>Simple Double Pendulum</h1>

      <figure>
        <img style="width:250px;" src="figures/simple_double_pend.svg"/>
        <figcaption>Simple double pendulum</figcaption>
      </figure>

      <p> Consider the simple double pendulum with torque actuation at both
      joints and all of the mass concentrated in two points (for simplicity).
      Using $\bq = [\theta_1,\theta_2]^T$, and ${\bf p}_1,{\bf p}_2$ to denote
      the locations of $m_1,m_2$, respectively, the kinematics of this system
      are:

      \begin{eqnarray*}
      {\bf p}_1 =& l_1\begin{bmatrix} s_1 \\ - c_1 \end{bmatrix}, &{\bf p}_2  =
      {\bf p}_1 + l_2\begin{bmatrix} s_{1+2} \\ - c_{1+2} \end{bmatrix} \\
      \dot{{\bf p}}_1 =& l_1 \dot{q}_1\begin{bmatrix} c_1 \\ s_1 \end{bmatrix},
      &\dot{{\bf p}}_2 = \dot{{\bf p}}_1 + l_2 (\dot{q}_1+\dot{q}_2) \begin{bmatrix} c_{1+2} \\ s_{1+2} \end{bmatrix}
      \end{eqnarray*}

      Note that $s_1$ is shorthand for $\sin(q_1)$, $c_{1+2}$ is shorthand for
      $\cos(q_1+q_2)$, etc. From this we can write the kinetic and potential
      energy:

      \begin{align*}
      T =& \frac{1}{2} m_1 \dot{\bf p}_1^T \dot{\bf p}_1 + \frac{1}{2} m_2
      \dot{\bf p}_2^T \dot{\bf p}_2 \\
      =& \frac{1}{2}(m_1 + m_2) l_1^2 \dot{q}_1^2 + \frac{1}{2} m_2 l_2^2 (\dot{q}_1 + \dot{q}_2)^2 + m_2 l_1 l_2 \dot{q}_1 (\dot{q}_1 + \dot{q}_2) c_2 \\
      U =& m_1 g y_1 + m_2 g y_2 = -(m_1+m_2) g l_1 c_1 - m_2 g l_2 c_{1+2}
      \end{align*}

      Taking the partial derivatives $\pd{T}{q_i}$, $\pd{T}{\dot{q}_i}$, and
      $\pd{U}{q_i}$ ($\pd{U}{\dot{q}_i}$ terms are always zero), then
      $\frac{d}{dt}\pd{T}{\dot{q}_i}$, and plugging them into the Lagrangian,
      reveals the equations of motion:

      \begin{align*}
      (m_1 + m_2) l_1^2 \ddot{q}_1& + m_2 l_2^2 (\ddot{q}_1 + \ddot{q}_2) + m_2 l_1 l_2 (2 \ddot{q}_1 + \ddot{q}_2) c_2 \\
      &- m_2 l_1 l_2 (2 \dot{q}_1 + \dot{q}_2) \dot{q}_2 s_2 + (m_1 + m_2) l_1 g s_1 + m_2 g l_2 s_{1+2} = \tau_1 \\
      m_2 l_2^2 (\ddot{q}_1 + \ddot{q}_2)& + m_2 l_1 l_2 \ddot{q}_1 c_2 + m_2 l_1 l_2
      \dot{q}_1^2 s_2 + m_2 g l_2 s_{1+2} = \tau_2
      \end{align*}

      As we saw in chapter 1, numerically integrating (and animating) these
      equations in <drake></drake> produces the expected result. </p>

    </example>

  </section>

  <section id=manipulator><h1>The Manipulator Equations</h1>

    <p> If you crank through the Lagrangian dynamics for a few simple robotic
    manipulators, you will begin to see a pattern emerge - the resulting
    equations of motion all have a characteristic form.  For example, the
    kinetic energy of your robot can always be written in the form:
    \begin{equation} T = \frac{1}{2} \dot{\bq}^T \bM(\bq)
    \dot{\bq},\end{equation} where $\bM$ is the state-dependent inertia matrix
    (aka mass matrix). This observation affords some insight into general
    manipulator dynamics - for example we know that ${\bf M}$ is always
    positive definite, and symmetric<elib part="p.107">Asada86</elib> and has a
    beautiful sparsity pattern<elib>Featherstone05</elib> that we should take
    advantage of in our algorithms.</p>

    <p>Continuing our abstractions, we find that the equations of motion of a
    general robotic manipulator (without kinematic loops) take the form
    \begin{equation}{\bf M}(\bq)\ddot{\bq} + \bC(\bq,\dot{\bq})\dot{\bq} =
    \btau_g(\bq) + {\bf B}\bu, \label{eq:manip} \end{equation} where $\bq$ is
    the joint position vector, ${\bf M}$ is the inertia matrix, $\bC$ captures
    Coriolis forces, and $\btau_g$ is the gravity vector.  The matrix $\bB$
    maps control inputs $\bu$ into generalized forces.  Note that we pair
    $\bM\ddot\bq + \bC\dot\bq$ on the left side because "... the equations of
    motion depend on the choice of coordinates $\bq$.  For this reason neither
    $\bM\ddot\bq$ nor $\bC\dot\bq$ individually should be thought of as a
    generalized force; only their sum is a force"<elib>Choset05</elib>(s.10.2).
    These terms represent the contribution from kinetic energy to the
    Lagrangian: $$\frac{d}{dt}\pd{T}{\dot{\bq}}^T - \pd{T}{\bq}^T = \bM(\bq) \ddot{\bq} + \bC(\bq,\dot\bq)\dot\bq.$$ Whenever I write Eq. (\ref{eq:manip}), I see $ma = f$.
    </p>

    <example id="manipulator_equation_double_pendulum"><h1>Manipulator Equation
    form of the Simple Double Pendulum</h1> The equations of motion from
    Example 1 can be written compactly as: \begin{align*} \bM(\bq) =&
    \begin{bmatrix} (m_1 + m_2)l_1^2 + m_2 l_2^2 + 2 m_2 l_1l_2 c_2 & m_2 l_2^2
    + m_2 l_1 l_2 c_2 \\ m_2 l_2^2 + m_2 l_1 l_2 c_2 & m_2 l_2^2 \end{bmatrix}
    \\ \bC(\bq,\dot\bq) =& \begin{bmatrix} 0 & -m_2 l_1 l_2 (2\dot{q}_1 +
    \dot{q}_2)s_2 \\ m_2 l_1 l_2 \dot{q}_1 s_2 & 0 \end{bmatrix} \\
    \btau_g(\bq) =& -g \begin{bmatrix} (m_1 + m_2) l_1 s_1 + m_2 l_2 s_{1+2} \\
    m_2 l_2 s_{1+2} \end{bmatrix} , \quad \bB = \begin{bmatrix} 1 & 0 \\ 0 & 1
    \end{bmatrix} \end{align*} Note that this choice of the $\bC$ matrix was
    not unique. </example>

    <p> The manipulator equations are very general, but they do define some
    important characteristics.  For example, $\ddot{\bq}$ is (state-dependent)
    linearly related to the control input, $\bu$.   This observation justifies
    the control-affine form of the dynamics assumed throughout the notes. There
    are many other important properties that might prove useful.  For instance,
    we know that the $i$ element of $\bC\dot{q}$ is given by <elib
    part="&sect;5.2.3">Asada86</elib>: $$\left[\bC(\bq,\dot\bq)\dot\bq
    \right]_i = \sum_{j,k} c_{ijk} \dot{q}_j \dot{q}_k, \qquad c_{ijk}(\bq) =
    \pd{M_{ij}}{q_k} - \frac{1}{2}\pd{M_{jk}}{q_k},$$ (e.g. it also quadratic
    in $\dot{\bq}$) and, for the appropriate choice of $\bC$, we have that
    $\bM(\bq) - 2\bC(\bq,\dot{\bq})$ is skew-symmetric<elib>Slotine90</elib>.</p>

    <!-- Khalil and Dombre have c_{ijk} = \frac{1}{2}[\pd{H_{ij}}{q_k} + \pd{H_{ik}}{q_j} - \pd{H_{jk}}{q_i}], which is slightly different... -->

    <p>Note that we have chosen to use the notation of second-order systems
    (with $\dot{\bq}$ and $\ddot{\bq}$ appearing in the equations) throughout
    this book.  Although I believe it provides more clarity, there is an
    important limitation to this notation: it is impossible to describe 3D
    rotations in "minimal coordinates" using this notation without introducing
    kinematic singularities (like the famous "gimbal lock"). For instance, a
    common singularity-free choice for representing a 3D rotation is the unit
    quaternion, described by 4 real values (plus a norm constraint).  However we
    can still represent the rotational velocity without singularities using just
    3 real values.  This means that the length of our velocity vector is no
    longer the same as the length of our position vector.  For this reason, you
    will see that most of the software in <drake></drake> uses the more general
    notation with $\bv$ to represent velocity, $\bq$ to represent positions, and
    the manipulator equations are written as \begin{equation} \bM(\bq) \dot{\bv}
    + \bC(\bq,\bv)\bv = \btau_g(\bq) + \bB \bu, \end{equation} which is
    not necessarily a second-order system.  See <elib>Duindam06</elib> for a
    nice discussion of this topic.</p>

    <subsection><h1>Recursive Dynamics Algorithms</h1>

      <p>The equations of motions for our machines get complicated quickly.
      Fortunately, for robots with a tree-link kinematic structure, there are
      very efficient and natural recursive algorithms for generating these
      equations of motion.  For a detailed reference on these methods, see
      <elib>Featherstone07</elib>; some people prefer reading about the
      Articulated Body Method in <elib>Mirtich96</elib>.  The implementation in
      <drake></drake> uses a related formulation from <elib>Jain11</elib>.</p>

    </subsection> <!-- end recursive algorithms -->

    <subsection><h1>Hamiltonian Mechanics</h1>

      <p>In some cases, it might be preferable to use the <a
        href="https://en.wikipedia.org/wiki/Hamiltonian_mechanics">Hamiltonian
        formulation of the dynamics</a>, with state variables $\bq$ and $\bp$
      (instead of $\dot{\bq}$), where $\bp = \pd{L}{\dot{\bq}}^T = \bM
      \dot\bq$ results in the dynamics: \begin{align*}\bM(\bq) \dot{\bq} =& \bp
      \\ \dot{\bp} =&
      {\bf c}_H(\bq,\bp) + \tau_g(\bq) + \bB\bu, \qquad \text{where}\quad
      {\bf c}_H(\bq,\bp) = \frac{1}{2} \left(\pd{ \left[ \dot{\bq}^T \bM(\bq)
      \dot{\bq}\right]}{\bq}\right)^T.\end{align*} Recall that the Hamiltonian
      is $H = {\bf p}^T\dot\bq - L$; these equations of motion are the
      familiar $\dot\bq = \pd{H}{\bf p}^T, \dot{\bf p} = -\pd{H}{\bq}^T.$</p>

    </subsection>

    <subsection id="bilateral"><h1>Bilateral Position Constraints</h1>

      <p>If our robot has closed-kinematic chains, for instance those that arise
      from a <a href="https://en.wikipedia.org/wiki/Four-bar_linkage">four-bar
      linkage</a>, then we need a little more.  The Lagrangian machinery above
      assumes "minimal coordinates"; if our state vector $\bq$ contains all of
      the links in the kinematic chain, then we do not have a minimal
      parameterization -- each kinematic loop adds (at least) one constraint so
      should remove (at least) one degree of freedom.  Although some constraints
      can be solved away, the more general solution is to use the Lagrangian to
      derive the dynamics of the unconstrained system (a kinematic tree without
      the closed-loop constraints), then add additional generalized forces that
      ensure that the constraint is always satisfied. </p>

      <p>Consider the constraint equation \begin{equation}\bh(\bq) =
      0.\end{equation}  For the case of the kinematic closed-chain, this can be
      the kinematic constraint that the location of one end of the chain is
      equal to the location of the other end of the chain. The equations of
      motion can be written \begin{equation}\bM({\bq})\ddot{\bq} +
      \bC(\bq,\dot{\bq})\dot\bq = \btau_g(\bq) + \bB\bu + \bH^T(\bq)
      \blambda,\label{eq:constrained_manip}\end{equation} where $\bH(\bq) =
      \pd\bh{\bq}$ and ${\blambda}$ is the constraint force.  Let's use the
      shorthand \begin{equation} \bM({\bq})\ddot{\bq} = \btau(q,\dot{q},u) +
      \bH^T(\bq) \blambda. \label{eq:manip_short} \end{equation}</p>

      <p>Using \begin{gather}\dot\bh = \bH\dot\bq,\\ \ddot\bh = \bH \ddot\bq +
      \dot\bH \dot\bq, \label{eq:ddoth} \end{gather} we can solve for
      $\blambda$, by observing that when the constraint is imposed, $\bh=0$ and
      therefore $\dot\bh=0$ and $\ddot\bh=0$.  Inserting the dynamics
      (\ref{eq:manip_short}) into (\ref{eq:ddoth}) yields
      \begin{equation}\blambda =- (\bH \bM^{-1} \bH^T)^+ (\bH \bM^{-1} \btau +
      \dot\bH\dot\bq).\label{eq:lambda_from_h}\end{equation} The $^+$ notation refers to a Moore-Penrose
      pseudo-inverse.  In many cases, this matrix will be full rank (for
      instance, in the case of multiple independent four-bar linkages) and the
      traditional inverse could have been used.  When the matrix drops rank
      (multiple solutions), then the pseudo-inverse will select the solution
      with the smallest constraint forces in the least-squares sense.</p>

      <p>To combat numerical "constraint drift", one might like to add a
      restoring force in the event that the constraint is not satisfied to
      numerical precision.  To accomplish this, rather than solving for
      $\ddot\bh = 0$ as above, we can instead solve for \begin{equation}\ddot\bh
      = \bH\ddot\bq + \dot\bH\dot\bq = -2\alpha \dot\bh - \alpha^2
      \bh,\end{equation} where $\alpha>0$ is a stiffness parameter. This is
      known as Baumgarte's stabilization technique, implemented here with a
      single parameter to provide a critically-damped response. Carrying this
      through yields \begin{equation} \blambda =- (\bH \bM^{-1} \bH^T)^+ (\bH
      \bM^{-1} \btau + (\dot{\bH} + 2\alpha \bH)\dot{\bq} + \alpha^2 \bh).
      \end{equation}</p>

      <p>There is an important optimization-based derivation/interpretation of
      these equations, which we will build on below, using <a
      href="https://en.wikipedia.org/wiki/Gauss%27s_principle_of_least_constraint">Gauss's
      Principle of Least Constraint</a>.  This principle says that we can
      derive the constrained dynamics as : \begin{align} \min_\ddot{\bq} \quad
      & \frac{1}{2} (\ddot{\bq} - \ddot{\bq}_{uc})^T \bM (\ddot{\bq} -
      \ddot{\bq}_{uc}), \label{eq:least_constraint} \\ \subjto \quad &
      \ddot{\bh}(\bq,\dot\bq,\ddot\bq) = 0 = \bH \ddot{\bq} + \dot\bH \dot{\bq},
      \nonumber \end{align} where $\ddot{\bq}_{uc} = \bM^{-1}\tau$ is the
      "unconstrained acceleration"
      <elib>Udwadia92</elib>. Equation \eqref{eq:constrained_manip} comes right
      out of the optimality conditions with $\lambda$ acting precisely as the
      Lagrange multiplier.  This is a convex quadratic program with equality
      constraints, which has a closed-form solution that is precisely Equation
      \eqref{eq:lambda_from_h} above.  It is also illustrative to look at the
      dual formulation of this optimization, which can be written as
      $$\min_\lambda \frac{1}{2} \lambda^T \bH \bM^{-1} \bH^T \lambda -
      \lambda^T (\bH \bM^{-1}\btau + \dot\bH \dot{\bq}).$$  Observe that the
      linear term is contains the acceleration of $\bh$ if the dynamics evolved
      without the constraint forces applied; let's call that $\ddot\bh_{uc}$:
      $$\min_\lambda \frac{1}{2} \lambda^T \bH \bM^{-1} \bH^T \lambda -
      \lambda^T \ddot{\bh}_{uc}.$$ The primal formulation has accelerations as
      the decision variables, and the dual formulation has constraint forces as
      the decision variables.  For now this is merely a cute observation; we'll
      build on it more when we get to discussing contact forces.</p>

    </subsection>

    <subsection><h1>Bilateral Velocity Constraints</h1>

      <p>Consider the constraint equation \begin{equation}\bh_v(\bq,\dot{\bq}) =
      0,\end{equation} where $\pd{\bh_v}{\dot{\bq}} \ne 0.$  These are less
      common, but arise when, for instance, a joint is driven through a
      prescribed motion. Here, the manipulator equations are given by
      \begin{equation}\bM\ddot{\bq}  = \btau + \pd{\bh_v}{\dot{\bq}}^T
      \blambda.\end{equation} Using \begin{equation} \dot\bh_v = \pd{\bh_v}{\bq}
      \dot{\bq} + \pd{\bh_v}{\dot{\bq}}\ddot{\bq},\end{equation} setting
      $\dot\bh_v = 0$ yields \begin{equation}\blambda = - \left(
      \pd{\bh_v}{\dot{\bq}} \bM^{-1} \pd{\bh_v}{\dot{\bq}} \right)^+ \left[
      \pd{\bh_v}{\dot{\bq}} \bM^{-1} \btau + \pd{\bh_v}{\bq} \dot{\bq}
      \right].\end{equation}</p>

      <p>Again, to combat constraint drift, we can ask instead for $\dot\bh_v =
      -\alpha \bh_v$, which yields \begin{equation}\blambda = - \left(
      \pd{\bh_v}{\dot{\bq}} \bM^{-1} \pd{\bh_v}{\dot{\bq}} \right)^+ \left[
      \pd{\bh_v}{\dot{\bq}} \bM^{-1} \btau + \pd{\bh_v}{\bq} \dot{\bq} + \alpha
      \bh_v \right].\end{equation}</p>

    </subsection>

  </section>
  <section><h1>The Dynamics of Contact</h1>

    <p>The dynamics of multibody systems that make and break contact are closely
    related to the dynamics of constrained systems, but tend to be much more
    complex.  In the simplest form, you can think of non-penetration as an
    <i>inequality</i> constraint: the signed distance between collision bodies
    must be non-negative.  But, as we have seen in the chapters on walking, the
    transitions when these constraints become active correspond to collisions,
    and for systems with momentum they require some care.  We'll also see that
    frictional contact adds its own challenges.</p>

    <p>There are three main approaches used to modeling contact with "rigid"
    body systems:  1) rigid contact approximated with stiff compliant contact,
    2) hybrid models with collision event detection, impulsive reset maps, and
    continuous (constrained) dynamics between collision events, and 3) rigid
    contact approximated with time-averaged forces (impulses) in a time-stepping
    scheme. Each modeling approach has advantages/disadvantages for different
    applications.</p>

    <p>Before we begin, there is a bit of notation that we will use throughout.
    Let $\phi(\bq)$ indicate the relative (signed) distance between two rigid
    bodies.  For rigid contact, we would like to enforce the unilateral
    constraint: \begin{equation} \phi(\bq) \geq 0. \end{equation}  We'll use
    ${\bf n} = \pd{\phi}{\bq}$ to denote the "contact normal", as well as ${\bf
    t}_1$ and ${\bf t}_2$ as a basis for the tangents at the contact
    (orthonormal vectors in the Cartesian space, projected into joint space),
    all represented as row vectors in joint coordinates.
    </p>

    <figure><img width="100%" src="figures/contact_coordinates_2d.jpg"
    /><figcaption>Contact coordinates in 2D.  (a) The signed distance between
    contact bodies, $\phi(\bq)$. (b) The normal (${\bf n}$) and tangent (${\bf
    t}$) contact vectors -- note that these can be drawn in 2D when the $\bq$ is
    the $x,y$ positions of one of the bodies, but more generally the vectors
    live in the configuration space. (c) Sometimes it will be helpful for us to
    express the tangential coordinates using $d_1$ and $d_2$; this will make
    more sense in the 3D case.</figcaption></figure>

    <figure><img width="100%" src="figures/contact_coordinates_3d.jpg"
      /><figcaption>Contact coordinates in 3D.</figcaption></figure>

    <p>We will also find it useful to assemble the contact normal and tangent
    vectors into a single matrix, $\bJ$, that we'll call the <i>contact
    Jacobian</i>: $$\bJ(\bq) = \begin{bmatrix} {\bf n}\\ {\bf t}_1\\ {\bf t}_2
    \end{bmatrix}.$$  As written, $\bJ\bv$ gives the relative velocity of the
    closest points in the contact coordinates; it can be also be extended with
    three more rows to output a full spatial velocity (e.g. when modeling
    torsional friction).  The generalized force produced by the contact is given
    by $\bJ^T \lambda$, where $\lambda = [f_n, f_{t1}, f_{t2}]^T:$
    \begin{equation} \bM(\bq) \dot{\bv} + \bC(\bq,\bv)\bv = \btau_g(\bq) + \bB
    \bu + \bJ^T(\bq)\lambda. \end{equation}</p>

    <subsection><h1>Compliant Contact Models</h1>

      <p>Most compliant contact models are conceptually straight-forward: we
      will implement contact forces using a stiff spring (and
      damper<elib>Hunt75</elib>) that produces forces to resist penetration (and
      grossly model the dissipation of collision and/or friction).  For
      instance, for the normal force, $f_n$, we can use a simple (piecewise)
      linear spring law:</p>

      <figure>
        <img width="50%" src="figures/contact_spring.jpg" />
      </figure>

      <p>Coulomb friction is described by two parameters -- $\mu_{static}$ and
      $\mu_{dynamic}$ -- which are the coefficients of static and dynamic
      friction.  When the contact tangential velocity (given by ${\bf t}\bv$) is
      zero, then friction will produce whatever force is necessary to resist
      motion, up to the threshold $\mu_{static} f_n.$ But once this threshold is
      exceeding, we have slip (non-zero contact tangential velocity); during
      slip friction will produce a constant drag force $|f_t| = \mu_{dynamic}
      f_n$ in the direction opposing the motion.  This behaviour is not a simple
      function of the current state, but we can approximate it with a continuous
      function as pictured below.</p>

      <figure>
        <img width="90%" src="figures/contact_stribeck.jpg" />
        <figcaption>(left) The Coloumb friction model.  (right) A continuous piecewise-linear approximation of friction (green) and the <a href="https://drake.mit.edu/doxygen_cxx/group__stribeck__approximation.html">Stribeck approximation of Coloumb friction</a> (blue); the $x$-axis is the contact tangential velocity, and the $y$-axis is the friction coefficient.</figcaption>
      </figure>

      <p>With these two laws, we can recover the contact forces as relatively
      simple functions of the current state.  However, the devil is in the
      details.  There are a few features of this approach that can make it
      numerically unstable. If you've ever been working in a robotics simulator
      and watched your robot take a step only to explode out of the ground and
      go flying through the air, you know what I mean.  </p>

      <p>In order to tightly approximate the (nearly) rigid contact that most
      robots make with the world, the stiffness of the contact "springs" must be
      quite high.  For instance, I might want my 180kg humanoid robot model to
      penetrate into the ground no more than 1mm during steady-state standing.
      The challenge with adding stiff springs to our model is that this results
      in <a href="https://en.wikipedia.org/wiki/Stiff_equation">stiff
      differential equations</a> (it is not a coincidence that the word
      <em>stiff</em> is the common term for this in both springs and ODEs). As a
      result, the best implementations of compliant contact for simulation make
      use of special-purpose numerical integration recipes (e.g.
      <elib>Castro20</elib>) and compliant contact models are often difficult to
      use in e.g. trajectory/feedback optimization.</p>

      <p>But there is another serious numerical challenge with the basic
      implementation of this model.  Computing the signed-distance function,
      $\phi(\bq)$, when it is non-negative is straightforward, but robustly
      computing the "penetration depth" once two bodies are in collision is not.
      Consider the case of use $\bphi(\bq)$ to represent the maximum penetration
      depth of, say, two boxes that are contacting each other.  With compliant
      contact, we must have some penetration to produce any contact force.  But
      the direction of the normal vector, ${\bf n} = \pd{\bphi}{\bq},$ can
      easily change discontinuously as the point of maximal penetration moves,
      as I've tried to illustrate here:</p>

      <figure><img width=90% src="figures/contact_penetration_normals.jpg" />
      <figcaption>(Left) Two boxes in penetration, with the signed distance
      defined by the maximum penetration depth.  (Right) The contact normal for
      various points of maximal penetration.
      </figcaption></figure>

      <p>If you really think about this example, it actually reveals even more of the
      foibles of trying to even define the contact points and normals
      consistently.  It seems reasonable to define the point on body B as the
      point of maximal penetration into body A, but notice that as I've drawn
      it, that isn't actually unique!  The corresponding point on body A should
      probably be the point on the surface with the minimal distance from the
      max penetration point (other tempting choices would likely cause the
      distance to be discontinuous at the onset of penetration).  But this whole
      strategy is asymmetric;  why shouldn't I have picked the vector going with
      maximal penetration into body B?  My point is simply that these cases
      exist, and that even our best software implementations get pretty
      unsatisfying when you look into the details.  And in practice, it's a lot to expect the collision engine to give consistent and smooth contact points out in every situation.</p>

      <p>Some of the advantages of this approach include (1) the fact that it is
      easy to implement, at least for simple geometries, (2) by virtue of being
      a continuous-time model it can be simulated with error-controlled
      integrators, and (3) the tightness of the approximation of "rigid" contact
      can be controlled through relatively intuitive parameters.  However, the
      numerical challenges of dealing with penetration are very real, and they
      motivate our other two approaches that attempt to more strictly enforce
      the non-penetration constraints.</p>

    </subsection>

    <subsection><h1>Rigid Contact with Event Detection</h1>

      <subsubsection id="impulsive_collision"><h1>Impulsive Collisions</h1>

        <p>The collision event is described by the zero-crossings (from positive
        to negative) of $\phi$. Let us start by assuming frictionless
        collisions, allowing us to write \begin{equation}\bM\ddot{\bq} = \btau +
        \lambda {\bf n}^T,\end{equation} where ${\bf n}$ is the "contact normal"
        we defined above and $\lambda \ge 0$ is now a (scalar) impulsive force
        that is well-defined when integrated over the time of the collision
        (denoted $t_c^-$ to $t_c^+$). Integrate both sides of the equation over
        that (instantaneous) interval: \begin{align*}\int_{t_c^-}^{t_c^+}
        dt\left[\bM \ddot{\bq} \right] = \int_{t_c^-}^{t_c^+} dt \left[ \btau +
        \lambda {\bf n}^T \right] \end{align*} Since $\bM$, $\btau$, and ${\bf
        n}$ are constants over this interval, we are left with $$\bM\dot{\bq}^+
        - \bM\dot{\bq}^- = {\bf n}^T \int_{t_c^-}^{t_c^+} \lambda dt,$$ where
        $\dot{\bq}^+$ is short-hand for $\dot{\bq}(t_c^+)$. Multiplying both
        sides by ${\bf n} \bM^{-1}$, we have $$ {\bf n} \dot{\bq}^+ - {\bf
        n}\dot{\bq}^- = {\bf n} \bM^{-1} {\bf n}^T \int_{t_c^-}^{t_c^+}
        \lambda dt.$$ After the collision, we have $\dot\phi^+ = -e \dot\phi^-$,
        with $0 \le e \le 1$ denoting the <a
        href="https://en.wikipedia.org/wiki/Coefficient_of_restitution">
        coefficient of restitution</a>, yielding: $$ {\bf n} \dot{\bq}^+ -
        {\bf n}\dot{\bq}^- = -(1+e){\bf n}\dot{\bq}^-,$$ and therefore
        $$\int_{t_c^-}^{t_c^+} \lambda dt = - (1+e)\left[ {\bf n} \bM^{-1}
        {\bf n}^T \right]^\# {\bf n} \dot{\bq}^-.$$ I've used the notation
        $A^\#$ here to denote the pseudoinverse of $A$ (I would normally write
        $A^+,$ but have changed it for this section to avoid confusion).
        Substituting this back in above results in \begin{equation}\dot{\bq}^+ =
        \left[ \bI - (1+e)\bM^{-1} {\bf n}^T \left[{\bf n} \bM^{-1} {\bf n}^T
        \right]^\# {\bf n} \right] \dot{\bq}^-.\end{equation}</p>

        <p>We can add friction at the contact.  Rigid bodies will almost always
        experience contact at only a point, so we typically ignore torsional
        friction <elib>Featherstone07</elib>, and model only tangential friction
        by extending ${\bf n}$ to a matrix $$\bJ = \begin{bmatrix} {\bf n}\\
        {\bf t}_1\\ {\bf t}_2 \end{bmatrix},$$ to capture the gradient of the
        contact location tangent to the contact surface. Then $\blambda$ becomes
        a Cartesian vector representing the contact impulse, and (for infinite
        friction) the post-impact velocity condition becomes ${\bf J}\dot{\bq}^+
        = \text{diag}(-e, 0, 0) {\bf J}\dot{\bq}^-,$ resulting in the equations:
        \begin{equation}\dot{\bq}^+ = \left[ \bI - \bM^{-1} \bJ^T
        \text{diag}(1+e, 1, 1) \left[\bJ \bM^{-1} \bJ^T \right]^\# \bJ
        \right]\dot{\bq}^-.\end{equation} If $\blambda$ is restricted to a
        friction cone, as in Coulomb friction, in general we have to solve an
        optimization to solve for $\dot\bq^+$ subject to the inequality
        constraints.</p>

        <example id="spinning_ball_bouncing"><h1>A spinning ball bouncing on the
        ground.</h1>

          <p>Imagine a ball (a hollow-sphere) in the plane with mass $m$, radius
          $r$.  The configuration is given by $q = \begin{bmatrix} x, z, \theta
          \end{bmatrix}^T.$  The equations of motion are $$\bM(\bq)\ddot\bq =
          \begin{bmatrix} m & 0 & 0 \\ 0 & m & 0 \\ 0 & 0 & \frac{2}{3}mr^2
          \end{bmatrix} \ddot\bq = \begin{bmatrix} 0 \\ -g \\ 0 \end{bmatrix} +
          \begin{bmatrix} 0 & 1 \\ 1 & 0 \\ 0 & r \end{bmatrix} \begin{bmatrix}
          \lambda_z \\ \lambda_x \end{bmatrix} = \tau_g + \bJ^T {\bf \lambda},$$
          where ${\bf \lambda}$ are the contact forces (which are zero except
          during the impulsive collision).  Given a coefficient of restitution
          $e$ and a collision with a horizontal ground, the post-impact
          velocities are: $$ \dot{\bq}^+ = \begin{bmatrix} \frac{3}{5} & 0 &
          -\frac{2}{5} r \\ 0 & - e & 0 \\ -\frac{3}{5r} & 0 &
          \frac{2}{5}\end{bmatrix}\dot{\bq}^-.$$
        </p>

        <!-- Derivation is here: https://www.wolframcloud.com/obj/russt/Published/BouncingBallWithSpin.nb-->

        </example>

      </subsubsection>

      <subsubsection><h1>Putting it all together</h1>

        <p> We can put the bilateral constraint equations and the impulsive
        collision equations together to implement a hybrid model for unilateral
        constraints of the form. Let us generalize \begin{equation}\bphi(\bq)
        \ge 0,\end{equation} to be the vector of all pairwise (signed) distance
        functions between rigid bodies; this vector describes all possible
        contacts. At every moment, some subset of the contacts are active, and
        can be treated as a bilateral constraint ($\bphi_i=0$). The guards of
        the hybrid model are when an inactive constraint becomes active
        ($\bphi_i>0 \rightarrow \bphi_i=0$), or when an active constraint
        becomes inactive ($\blambda_i>0 \rightarrow \blambda_i=0$ and
        $\dot\phi_i > 0$). Note that collision events will (almost always) only
        result in new active constraints when $e=0$, e.g. we have pure
        inelastic collisions, because elastic collisions will rarely permit
        sustained contact.</p>

        <p>If the contact involves Coulomb friction, then the transitions
        between sticking and sliding can be modeled as additional hybrid
        guards.</p>

      </subsubsection>

    </subsection>

    <subsection><h1>Time-stepping Approximations for Rigid Contact</h1>

      <p> So far we have seen two different approaches to enforcing the
      inequality constraints of contact, $\phi(\bq) \ge 0$ and the friction
      cone.  First we introduced compliant contact which effectively enforces
      non-penetration with a stiff spring.  Then we discussed event detection as
      a means to switch  between different models which treat the active
      constraints as equality constraints.  But, perhaps surprisingly, one of
      the most popular and scalable approaches is something different: it
      involves formulating a mathematical program that can solve the inequality
      constraints directly on each time step of the simulation.  These
      algorithms may be more expensive to compute on each time step, but they
      allow for stable simulations with potentially much larger and more
      consistent time steps.</p>

      <subsubsection><h1>Complementarity formulations</h1>

        <p>What class of mathematical program due we need to simulate contact?
        The discrete nature of contact suggests that we might need some form of
        combinatorial optimization.  Indeed, the most common transcription is to
        a Linear Complementarity Problem (LCP) <elib>Murty88</elib>, as
        introduced by <elib>Stewart96</elib> and <elib>Anitescu97</elib>.  An
        LCP is typically written as \begin{align*} \find_{\bw,\bz} \quad \subjto
        \quad & \bw = \bq + \bM \bz, \\ & \bw \ge 0, \bz \ge 0, \bw^T\bz =
        0.\end{align*} They are directly related to the optimality conditions of
        a (potentially non-convex) quadratic program  and
        <elib>Anitescu97</elib>> showed that the LCP's generated by our
        rigid-body contact problems can be solved by Lemke's algorithm.  As
        short-hand we will write these complementarity constraints as
        $$\find_{\bz}\quad \subjto \quad 0 \le (\bq + \bM\bz) \perp \bz \ge
        0.$$</p>

        <p>Rather than dive into the full transcription, which has many terms
        and can be relatively difficult to parse, let me start with two very
        simple examples.</p>

        <example><h1>Time-stepping LCP: Normal force</h1>

          <p>Consider our favorite simple mass being actuated by a horizontal
          force (with the double integrator dynamics), but this time we will add
          a wall that will prevent the cart position from taking negative
          values: our non-penetration constraint is $q \ge 0$.  Physically, this
          constraint is implemented by a normal (horizontal) force, $f$,
          yielding the equations of motion: $$m\ddot{q} = u + f.$$</p>

          <figure><img width=40% src="figures/lcp_brick_normal_force.jpg">
          </figure>

          <p>$f$ is defined as the force required to enforce the non-penetration
          constraint; certainly the following are true: $f \ge 0$ and $q \cdot
          f = 0$.  $q \cdot f = 0$ is the "complementarity constraint", and you
          can read it here as "either q is zero or force is zero" (or both); it
          is our "no force at a distance" constraint, and it is clearly
          non-convex.  It turns out that satisfying these constraints, plus $q
          \ge 0$, is sufficient to fully define $f$.</p>

          <p>To define the LCP, we first discretize time, by approximating
          \begin{gather*}q[n+1] = q[n] + h v[n+1], \\ v[n+1] = v[n] +
          \frac{h}{m}(u[n] + f[n]).\end{gather*}  This is almost the standard
          Euler approximation, but note the use of $v[n+1]$ in the right-hand
          side of the first equation -- this is actually a <a
          href="https://en.wikipedia.org/wiki/Semi-implicit_Euler_method">semi-implicit
          Euler approximation</a>, and this choice is essential in the
          derivation.</p>

          <p>Given $h, q[n], v[n],$ and $u[n]$, we can solve for $f[n]$ and
          $q[n+1]$ simultaneously, by solving the following LCP: \begin{gather*}
          q[n+1] = \left[ q[n] + h v[n] + \frac{h^2}{m} u[n] \right] +
          \frac{h^2}{m} f[n] \\ q[n+1] \ge 0, \quad f[n] \ge 0, \quad
          q[n+1]\cdot f[n] = 0. \end{gather*}  Take a moment and convince
          yourself that this fits into the LCP prescription given above.</p>

          <figure><img width=40%
          src="figures/lcp_brick_sol_vectors.jpg"></figure>

          <p>Note: Please don't confuse this visualization with the more common
          visualization of the solution space for an LCP (in two or more
          dimensions) in terms of "complementary cones"<elib>Murty88</elib>.</p>

          <p>Perhaps it's also helpful to plot the solution, $q[n+1], f[n]$ as a function of $q[n], v[n]$.  I've done it here for $m=1, h=0.1, u[n]=0$:</p>
          <figure>
<!--            <iframe id="igraph" scrolling="no" style="border:none;"
            seamless="seamless" src="data/lcp_cart.html" height="250"
            width="100%"></iframe> -->
          </figure>

        </example>

        <p>In the (time-stepping, "impulse-velocity") LCP formulation, we write
        the contact problem in it's combinatorial form.  In the simple example
        above, the complementarity constraints force any solution to lie on
        <i>either</i> the positive x-axis ($f[n] \ge 0$) <i>or</i> the positive
        y-axis ($q[n+1] \ge 0$).  The equality constraint is simply a line that
        will intersect with this constraint manifold at the solution.  But in
        this frictionless case, it is important to realize that these are simply
        the optimality conditions of a convex optimization problem: the
        discrete-time version of Gauss's principle that we used above.  Using
        $\bv'$ as shorthand for $\bv[n+1]$, and replacing $\ddot{\bq} =
        \frac{\bv' - \bv}{h}$ in Eq \eqref{eq:least_constraint} and scaling the
        objective by $h^2$ we have: \begin{align*} \min_{\bv'} \quad &
        \frac{1}{2} \left(\bv' - \bv - h\bM^{-1}\btau\right)^T \bM \left(\bv' -
        \bv - h\bM^{-1}\btau\right) \\ \subjto \quad & \frac{1}{h}\phi(\bq') =
        \frac{1}{h} \phi(\bq + h\bv') \approx \frac{1}{h}\phi(\bq) + {\bf n}\bv'
        \ge 0. \end{align*}  The objective is even cleaner/more intuitive if we
        denote the next step velocity that would have occurred before the
        contact impulse is applied as $\bv^- = \bv + h\bM^{-1}\btau$:
        \begin{align*} \min_{\bv'} \quad & \frac{1}{2} \left(\bv' -
        \bv^-\right)^T \bM \left(\bv' - \bv^-\right) \\ \subjto \quad &
        \frac{1}{h}\phi(\bq') \ge 0. \end{align*} The LCP for the frictionless
        contact dynamics is precisely the optimality conditions of this convex
        (because $\bM \succeq 0$) quadratic program, and once again the contact
        impulse, ${\bf f} \ge 0$, plays the role of the Lagrange multiplier
        (with units $N\cdot s$).</p>

        <p>So why do we talk so much about LCPs instead of QPs?  Well LCPs can
        also represent a wider class of problems, which is what we arrive at
        with the standard transcription of Coulomb friction.  In the limit of
        infinite friction, then we could add an additional constraint that the
        tangential velocity at each contact point was equal to zero (but these
        equation may not always have a feasible solution).  Once we admit limits on the magnitude of the friction forces, the non-convexity of the disjunctive form rear's it's ugly head.</p>

        <example><h1>Time-stepping LCP: Coulomb Friction</h1>

          <p>We can use LCP to find a feasible solution with Coulomb
          friction, but it requires some gymnastics with slack variables to make
          it work.  For this case in particular, I believe a very simple
          example is best.  Let's take our brick and remove the wall and the
          wheels (so we now have friction with the ground).</p>

          <figure><img width=40% src="figures/lcp_brick_friction.jpg">
          </figure>

          <p>The dynamics are the same as our previous example, $$m\ddot{q} = u
          + f,$$ but this time I'm using $f$ for the friction force which is
          inside the friction cone if $\dot{q} = 0$ and on the boundary of the
          friction cone if $\dot{q} \ne 0;$ this is known as the principle of
          maximum dissipation<elib>Stewart96</elib>.  Here the magnitude of the
          normal force is always $mg$, so we have $|f| \le \mu mg,$ where $\mu$
          is the coefficient of friction.  And we will use the same
          semi-implicit Euler approximation to cast this into discrete time.</p>

          <p>Now, to write the friction constraints as an LCP, we must introduce
          some slack variables.  First, we'll break the force into a positive
          component and a negative component: $f[n] = f^+ - f^-.$  And we will
          introduce one more variable $v_{abs}$ which will be non-zero if the
          velocity is non-zero (in either direction).  Now we can write the LCP:
          \begin{align*} \find_{v_{abs}, f^+, f^-} \quad \subjto && \\ 0 \le
          v_{abs} + v[n+1] \quad &\perp& f^+ \ge 0, \\ 0 \le v_{abs} - v[n+1]
          \quad &\perp& f^- \ge 0, \\ 0 \le \mu mg - f^+ - f^- \quad &\perp&
          v_{abs} \ge 0,\end{align*} where each instance of $v[n+1]$ we write
          out with $$v[n+1] = v[n] + \frac{h}{m}(u + f^+ - f^-).$$ It's enough
          to make your head spin!  But if you work it out, all of the
          constraints we want are there.  For example, for $v[n+1] > 0$, then we
          must have $f^+=0$, $v_{abs} = v[n+1]$, and $f^- = \mu mg$.  When
          $v[n+1] = 0$, we can have $v_{abs} = 0$, $f^+ - f^- \le \mu mg$, and
          those forces must add up to make $v[n+1] = 0$.</p>

        </example>

        <p>We can put it all together and write an LCP with both normal forces
        and friction forces, related by Coulomb friction (using a polyhedral
        approximation of the friction cone in 3d)<elib>Stewart96</elib>.
        Although the solution to any LCP <a
        href="https://en.wikipedia.org/wiki/Linear_complementarity_problem">can
        also be represented as the solution to a QP</a>, the QP for this problem
        is non-convex.</p>

      </subsubsection>

      <subsubsection><h1>Anitescu's convex formulation</h1>

        <p>In <elib>Anitescu06</elib>, Anitescu described the natural convex
        formulation of this problem by dropping the maximum dissipation
        inequalities and allowing any force inside the friction cone.  Consider
        the following optimization: \begin{align*} \min_{\bv'} \quad &
        \frac{1}{2} \left( \bv' - \bv^-\right)^T \bM \left(\bv' - \bv^-\right)
        \\ \subjto \quad & \frac{1}{h}\phi(\bq') + \mu {\bf d}_i \bv' \ge 0,
        \qquad \forall i \in 1,...,m \end{align*} where ${\bf d}_i, \forall i\in
        1,...,m$ are a set of tangent row vectors in joint coordinates
        parameterizing the polyhedral approximation of the friction cone (as in
        <elib>Stewart96</elib>).  Importantly, for each ${\bf d}_i$ we must also
        have $-{\bf d}_i$ in the set.  By writing the Lagrangian, $$L(\bv', {\bf
        \beta}) = \frac{1}{2}(\bv' - \bv^-)^T\bM(\bv' - \bv^-) - \sum_i \beta_i
        \left(\frac{1}{h}\phi(\bq) + ({\bf n} + \mu {\bf d})\bv'\right),$$ and checking the
        stationarity condition: $$\pd{L}{\bv'}^T = \bM(\bv' - \bv) - h\btau -
        \sum_i \beta_i ({\bf n} + \mu{\bf d}_i)^T = 0,$$ we can see that the
        Lagrange multiplier, $\beta_i \ge 0$, associated with the $i$th
        constraint is the magnitude of the impulse (with units $N \cdot s$) in
        the direction ${\bf n} - \mu {\bf d}_i,$ where ${\bf n} =
        \pd{\phi}{\bq}$ is the contact normal; the sum of the forces is an
        affine combination of these extreme rays of the polyhedral friction
        cone.  As written, the optimization is a QP.</p>

        <p>But be careful!  Although the primal solution was convex, and the
        dual problems are always convex, the objective here can be positive
        semi-definite.  This isn't a mistake -- <elib>Anitescu06</elib>
        describes a few simple examples where the solution to $\bv'$ is unique,
        but the impulses that produce it are not (think of a table with four
        legs).</p>

        <p>When the tangential velocity is zero, this constraint is tight; for
        sliding contact the relaxation effectively causes the contact to
        "hydroplane" up out of contact, because $\phi(\bq') \ge h\mu {\bf d}_i\bv'.$  It seems like a quite reasonable
        approximation to me, especially for small $h$!</p>

        <p>Let's continue on and write the dual program.  To make the notation a
        little better, the us stack the Jacobian vectors into $\bJ_\beta$, such
        that the $i$th row is ${\bf n} + \mu{\bf d_i}$, so that we have
        $$\pd{L}{\bv'}^T = \bM(\bv' - \bv) - h\btau - \bJ_\beta^T \beta = 0.$$
        Substituting this back into the Lagrangian give us the dual program:
        $$\min_{\beta \ge 0} \frac{1}{2} \beta^T \bJ_\beta \bM^{-1} \bJ_\beta^T
        \beta +  \frac{1}{h}\phi(\bq) \sum_i \beta_i.$$</p>

        <p>One final note: I've written just one contact point here to simplify
        the notation, but of course we repeat the constraint(s) for each contact
        point; <elib>Anitescu06</elib> studies the typical case of only
        including potential contact pairs with $\phi(\bq) \le \epsilon$, for
        small $\epsilon \ge 0$.  </p>

      </subsubsection>

      <subsubsection><h1>Todorov's convex formulation</h1>

        <p>According to <elib>Todorov14</elib>, the popular <a
        href="http://www.mujoco.org/">MuJoCo simulator</a> uses a slight
        variation of this convex relaxation, with the optimization:
        \begin{align*} \min_\lambda \quad & \frac{1}{2} \lambda^T \bJ \bM^{-1}
        \bJ^T \lambda + \frac{1}{h}\lambda^T \left( \bJ \bv^- - \dot\bx^d
        \right), \\ \subjto \quad & \lambda \in \mathcal{FC}(\bq),\end{align*}
        where $\dot\bx^d$ is a "desired contact velocity", and
        $\mathcal{FC}(\bq)$ describes the friction cone.  The friction cone
        constraint looks new but is not; observe that $\bJ^T \lambda =
        \bJ_\beta^T \beta$, where $\beta \ge 0$ is a clever parameterization of
        the friction cone in the polyhedral case.  The bigger difference is in
        the linear term: <elib>Todorov14</elib> proposed $\dot\bx^d = \bJ \bv -
        h\mathcal{B} \bJ \bv - h \mathcal{K} [\phi(\bq), 0, 0]^T,$ with
        $\mathcal{B}$ and $\mathcal{K}$ stabilization gains that are chosen to
        yield a critically damped response. </p>

        <p>How should we interpret this?  If you expand the linear terms, you
        will see that this is almost exactly the dual form we get from the
        position equality constraints formulation, including the Baumgarte
        stabilization.  <!--One term is missing: the $\dot\bJ\bv$ term, but
        <elib>Todorov14</elib> avoids this by saying that they consider
        constraints of the form $\bJ\bv = 0 (+ \text{stabilization});$ I think
        that's a clever way of saying you don't want to bother computing the
        discrete-time equivalent of $\dot\bJ\bv$. --> It's interesting to think
        about the implications of this formulation -- like the Anitescu
        relaxation, it is possible to get some "force at a distance" because we
        have not in any way expressed that $\lambda = 0$ when $\phi(\bq) > 0.$
        In Anitescu, it happened in a velocity-dependent boundary layer; here it
        could happen if you are "coming in hot" -- moving towards contact with
        enough relative velocity that the stabilization terms want to slow you
        down.</p>

        <p>In dual form, it is natural to consider the full conic description of
        the friction cone: $$\mathcal{FC}(\bq) = \left\{{\bf \lambda} = [f_n,
        f_{t1}, f_{t2}]^T \middle| f_n \ge 0, \sqrt{f^2_{t1}+f^2_{t2}} \le \mu
        f_n \right\}.$$  The resulting dual optimization is has a quadratic
        objective and second-order cone constraints (SOCP).</p>

        <p>There are a number of other important relaxations that happen in
        MuJoCo.  To address the positive indefiniteness in the objective,
        <elib>Todorov14</elib> relaxes the dual objective by adding a small term
        to the diagonal.  This guarantees convexity, but the convex optimization
        can still be poorly conditioned.  The stabilization terms in the
        objective are another form of relaxation, and <elib>Todorov14</elib>
        also implements adding additional stabilization to the inertial matrix,
        called the "implicit damping inertia".  These relaxations can
        dramatically improve simulation speed, both by making each convex
        optimization faster to solve, and by allowing the simulator to take
        larger integration time steps.  MuJoCo boasts a special-purpose solver
        that can simulate complex scenes at seriously impressive speeds -- often
        orders of magnitude faster than real time.  But these relaxations can
        also be abused -- it's unfortunately quite common to see physically
        absurd MuJoCo simulations, especially when researchers who don't care
        much about the physics start changing simulation parameters to optimize
        their learning curves!</p>

      </subsubsection>

      <todo>

        <example><h1>Relaxed complementarity-free formulation: Normal force</h1>

          <p>What equations do we get if we simulate the simple cart colliding
          with a wall example that we used above?</p>

          <figure><img width=40% src="figures/lcp_brick_normal_force.jpg">
          </figure>

          <p>Following <elib>Todorov14</elib>, the first relaxation we find is
          the addition of "implicit damping inertia", which effectively adds
          some inertia and some damping, parameterized by the regularization
          term $b$, to the explicit form of the equations.  As a result our
          discrete-time cart dynamics are given by $$(m+b)\frac{v[n+1] -
          v[n]}{h} = u[n] + f[n] - bv[n].$$</p>

          <p>What remains is to compute $f[n]$, and here is where things get
          interesting.  Motivated by the idea of "softening the Gauss principle"
          (e.g. the principle of least constraint <elib>Udwadia92</elib>) by
          replacing the hard constraints with a soft penalty term, we end up
          with a convex optimization which, in this scalar frictionless case,
          reduces to: \begin{gather*}f[n] = \argmin_{f \ge 0}  \frac{1}{2} f^2
          \frac{1 + \epsilon}{m+b} + f(v^- - v^*),\end{gather*} where $v^- =
          v[n] + \frac{h}{m + b}(u[n] - bv[n])$ is the next-step velocity before
          the contact impulse.  $v^*$ is another regularization term: the
          "desired contact velocity" which is implemented like a by Baumgarte
          stabilization: $$v^* = v[n] - 2h \frac{1 + \epsilon}{\kappa} v[n] -
          h\frac{1+\epsilon}{\kappa^2} q[n].$$  This introduced two additional
          relaxation parameters; MuJoCo calls $\epsilon$ the "impulse
          regularization scaling" and $\kappa$ the "error-reduction time
          constant".</p>
        </example>
      </todo>

      <subsubsection><h1>Beyond Point Contact</h1>

        <p>Coming soon... Box-on-plane example.  Multi-contact.</p>

        <p>Also a single point force cannot capture effects like torsional
        friction, and performs badly in some very simple cases (imaging a box
        sitting on a table).  As a result, many of the most effective algorithms
        for contact restrict themselves to very limited/simplified geometry.
        For instance, one can place "point contacts" (zero-radius spheres) in
        the four corners of a robot's foot; in this case adding forces at
        exactly those four points as they penetrate some other body/mesh can
        give more consistent contact force locations/directions. A surprising
        number of simulators, especially for legged robots, do this.</p>

        <p>In practice, most collision detection algorithms will return a list
        of "collision point pairs" given the list of bodies (with one point pair
        per pair of bodies in collision, or more if they are using the
        aforementioned "multi-contact" heuristics), and our contact force model
        simply attaches springs between these pairs. Given a point-pair, $p_A$
        and $p_B$, both in world coordinates, ...</p>

        <p>Hydroelastic model in drake<elib>Elandt19</elib>. </p>

      </subsubsection>

    </subsection>


  </section>


  <todo> numerical integration.  discrete-time approximations of continuous
  systems (lots of edx people had trouble with this.  some found:
  http://web.mit.edu/10.001/Web/Course_Notes/Differential_Equations_Notes/node3.html)
  </todo>

  <section id="action"><h1>Principle of Stationary Action</h1>

    <p><elib>Susskind14</elib> says <blockquote>The principle of least action -- really the principle of stationary action -- is the most compact form of the classical laws of physics.  This simple rule (it can be written in a single line) summarizes everything!  Not only the principles of classical mechanics, but electromagnetism, general relativity, quantum mechanics, everything known about chemistry -- right down to the ultimate known constituents of matter, elementary particles.</blockquote>  Sounds important!</p>

    <p>I find a lot of presentations of the principle of stationary action
    derivation of the Lagrangian unnecessarily confusing.  Here's my version,
    in case it helps.</p>

    <p>Consider a trajectory $\bq(t)$ defined over $t \in [t_0, t_1]$.  The
    principle of stationary action states that this trajectory is a valid
    solution to the differential equation governing our system if it represents
    a stationary point of the "action", defined as $$A = \int_{t_0}^{t_1}
    L(\bq,\dot{\bq})dt.$$  What does it mean for a trajectory to be a
    stationary point?  Using the calculus of variations, we think of an
    infinitesimal variation of this trajectory: $\bq(t) + {\bf \epsilon}(t),$
    where ${\bf \epsilon}(t)$ is an arbitrary differentiable function that is zero at
    $t_0$ and $t_1$.  That this variation should not change the action means
    $\delta A = 0$ for any small ${\bf \epsilon}(t)$: \begin{align*} \delta A =&
    A(\bq(t) + {\bf \epsilon}(t)) - A(\bq(t)) \\ =& \int_{t_0}^{t_1}
    L\left(\bq(t)+{\bf \epsilon}(t),\, \dot\bq(t) + \dot{\bf \epsilon}(t)\right) dt -
    \int_{t_0}^{t_1} L\left(\bq(t),\,\dot\bq(t)\right)dt.\end{align*}  We can
    expand the term in the first integral: $$L\left(\bq(t)+{\bf \epsilon}(t),\,
    \dot\bq(t) + \dot{\bf \epsilon}(t)\right) = L\left(\bq(t),\,\dot\bq(t)\right) +
    \pd{L}{\bq(t)}{\bf \epsilon}(t) + \pd{L}{\dot\bq(t)}\dot{\bf \epsilon}(t).$$
    Substituting this back in and simplifying leaves: $$\delta A =
    \int_{t_0}^{t_1} \left[\pd{L}{\bq(t)}{\bf \epsilon}(t) +
    \pd{L}{\dot\bq(t)}\dot{\bf \epsilon}(t)\right]dt.$$  Now use integration by parts
    on the second term: $$\int_{t_0}^{t_1} \pd{L}{\dot\bq(t)}\dot{\bf \epsilon}(t)dt
    = \left[\pd{L}{\dot\bq(t)} {\bf \epsilon}(t) \right]_{t_0}^{t_1} -
    \int_{t_0}^{t_1} \frac{d}{dt} \pd{L}{\dot\bq(t)} {\bf \epsilon}(t) dt,$$ and
    observe that the first term in the right-hand side is zero since
    ${\bf \epsilon}(t_0) = {\bf \epsilon}(t_1) = 0$.  This leaves $$\delta A =
    \int_{t_0}^{t_1} \left[\pd{L}{\bq} -
    \frac{d}{dt}\pd{L}{\dot\bq}\right]{\bf \epsilon}(t) dt = 0.$$  Since this must
    integrate to zero for any ${\bf \epsilon}(t)$, we must have $$\pd{L}{\bq} -
    \frac{d}{dt}\pd{L}{\dot\bq} = 0,$$ which can be multiplied by negative one
    to obtain the familiar form of the (unforced) Lagrangian equations of
    motion.  The forced version follows from the variation $$\delta A =
    \int_{t_0}^{t_1} L(\bq(t),\dot\bq(t))dt + \int_{t_0}^{t_1}
    \bQ^T(t){\bf \epsilon}(t) dt = 0.$$ </p>

    <p>For a much richer treatment, see <elib>Lanczos70</elib>.</p>

  </section>

</chapter>
<!-- EVERYTHING BELOW THIS LINE IS OVERWRITTEN BY THE INSTALL SCRIPT -->

<div id="references"><section><h1>References</h1>
<ol>

<li id=Craig89>
<span class="author">John Craig</span>, 
<span class="title">"Introduction to Robotics: Mechanics and Control"</span>, Addison Wesley
, January, <span class="year">1989</span>.

</li><br>
<li id=Lanczos70>
<span class="author">Cornelius Lanczos</span>, 
<span class="title">"The variational principles of mechanics"</span>, University of Toronto Press
, no. no. 4, <span class="year">1970</span>.

</li><br>
<li id=Asada86>
<span class="author">H. Asada and J.E. Slotine</span>, 
<span class="title">"Robot Analysis and Control"</span>, 
, pp. 93-131, <span class="year">1986</span>.

</li><br>
<li id=Featherstone05>
<span class="author">Roy Featherstone</span>, 
<span class="title">"Efficient Factorization of the Joint Space Inertia Matrix for Branched Kinematic Trees"</span>, 
<span class="publisher">International Journal of Robotics Research</span>, vol. 24, no. 6, pp. 487–500, <span class="year">2005</span>.

</li><br>
<li id=Choset05>
<span class="author">H. Choset and K. M. Lynch and S. Hutchinson and G. Kantor and W. Burgard and L. E. Kavraki and S. Thrun</span>, 
<span class="title">"Principles of Robot Motion-Theory, Algorithms, and Implementations"</span>, The MIT Press
, <span class="year">2005</span>.

</li><br>
<li id=Slotine90>
<span class="author">Jean-Jacques E. Slotine and Weiping Li</span>, 
<span class="title">"Applied Nonlinear Control"</span>, Prentice Hall
, October, <span class="year">1990</span>.

</li><br>
<li id=Duindam06>
<span class="author">Vincent Duindam</span>, 
<span class="title">"Port-Based Modeling and Control for Efficient Bipedal Walking Robots"</span>, 
PhD thesis, University of Twente, March, <span class="year">2006</span>.

</li><br>
<li id=Featherstone07>
<span class="author">Roy Featherstone</span>, 
<span class="title">"Rigid Body Dynamics Algorithms"</span>, Springer
, <span class="year">2007</span>.

</li><br>
<li id=Mirtich96>
<span class="author">Brian Mirtich</span>, 
<span class="title">"Impulse-based Dynamic Simulation of Rigid Body Systems"</span>, 
PhD thesis, University of California at Berkeley, <span class="year">1996</span>.

</li><br>
<li id=Jain11>
<span class="author">Abhinandan Jain</span>, 
<span class="title">"Robot and Multibody Dynamics: Analysis and Algorithms"</span>, Springer US
, <span class="year">2011</span>.

</li><br>
<li id=Udwadia92>
<span class="author">Firdaus E Udwadia and Robert E Kalaba</span>, 
<span class="title">"A new perspective on constrained motion"</span>, 
<span class="publisher">Proceedings of the Royal Society of London. Series A: Mathematical and Physical Sciences</span>, vol. 439, no. 1906, pp. 407--410, <span class="year">1992</span>.

</li><br>
<li id=Hunt75>
<span class="author">K. H. Hunt and F. R. E. Crossley</span>, 
<span class="title">"Coefficient of restitution interpreted as damping in vibroimpact"</span>, 
<span class="publisher">Journal of Applied Mechanics</span>, vol. 42 Series E, no. 2, pp. 440-445, <span class="year">1975</span>.

</li><br>
<li id=Castro20>
<span class="author">Alejandro M Castro and Ante Qu and Naveen Kuppuswamy and Alex Alspach and Michael Sherman</span>, 
<span class="title">"A Transition-Aware Method for the Simulation of Compliant Contact with Regularized Friction"</span>, 
<span class="publisher">IEEE Robotics and Automation Letters</span>, vol. 5, no. 2, pp. 1859--1866, <span class="year">2020</span>.

</li><br>
<li id=Murty88>
<span class="author">Katta G Murty and Feng-Tien Yu</span>, 
<span class="title">"Linear complementarity, linear and nonlinear programming"</span>, Citeseer
, vol. 3, <span class="year">1988</span>.

</li><br>
<li id=Stewart96>
<span class="author">D.E. Stewart and J.C. Trinkle</span>, 
<span class="title">"AN IMPLICIT TIME-STEPPING SCHEME FOR RIGID BODY DYNAMICS WITH INELASTIC COLLISIONS AND COULOMB FRICTION"</span>, 
<span class="publisher">International Journal for Numerical Methods in Engineering</span>, vol. 39, no. 15, pp. 2673--2691, <span class="year">1996</span>.

</li><br>
<li id=Anitescu97>
<span class="author">M. Anitescu and F.A. Potra</span>, 
<span class="title">"Formulating dynamic multi-rigid-body contact problems with friction as solvable linear complementarity problems"</span>, 
<span class="publisher">Nonlinear Dynamics</span>, vol. 14, no. 3, pp. 231--247, <span class="year">1997</span>.

</li><br>
<li id=Anitescu06>
<span class="author">Mihai Anitescu</span>, 
<span class="title">"Optimization-based simulation of nonsmooth rigid multibody dynamics"</span>, 
<span class="publisher">Mathematical Programming</span>, vol. 105, no. 1, pp. 113--143, jan, <span class="year">2006</span>.

</li><br>
<li id=Todorov14>
<span class="author">Emanuel Todorov</span>, 
<span class="title">"Convex and analytically-invertible dynamics with contacts and constraints: Theory and implementation in {MuJoCo}"</span>, 
<span class="publisher">2014 IEEE International Conference on Robotics and Automation (ICRA)</span> , pp. 6054--6061, <span class="year">2014</span>.

</li><br>
<li id=Elandt19>
<span class="author">Ryan Elandt and Evan Drumwright and Michael Sherman and Andy Ruina</span>, 
<span class="title">"A pressure field model for fast, robust approximation of net contact force and moment between nominally rigid objects"</span>, 
, pp. 8238-8245, 11, <span class="year">2019</span>.

</li><br>
<li id=Susskind14>
<span class="author">Leonard Susskind and George Hrabovsky</span>, 
<span class="title">"The theoretical minimum: what you need to know to start doing physics"</span>, Basic Books
, <span class="year">2014</span>.

</li><br>
</ol>
</section><p/>
</div>

<table style="width:100%;"><tr style="width:100%">
  <td style="width:33%;text-align:left;"><a class="previous_chapter" href=drake.html>Previous Chapter</a></td>
  <td style="width:33%;text-align:center;"><a href=index.html>Table of contents</a></td>
  <td style="width:33%;text-align:right;"><a class="next_chapter" href=optimization.html>Next Chapter</a></td>
</tr></table>

<div id="footer">
  <hr>
  <table style="width:100%;">
    <tr><td><a href="https://accessibility.mit.edu/">Accessibility</a></td><td style="text-align:right">&copy; Russ
      Tedrake, 2022</td></tr>
  </table>
</div>


</body>
</html>