Full article: A second-order dynamical approach with variable damping to nonconvex smooth minimization

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

We investigate a second-order dynamical system with variable damping in connection with the minimization of a nonconvex differentiable function. The dynamical system is formulated in the spirit of the differential equation which models Nesterov's accelerated convex gradient method. We show that the generated trajectory converges to a critical point, if a regularization of the objective function satisfies the Kurdyka- Lojasiewicz property. We also provide convergence rates for the trajectory formulated in terms of the Lojasiewicz exponent.

COMMUNICATED BY:

Boris Mordukhovich

KEYWORDS:

AMS SUBJECT CLASSIFICATIONS:

View correction statement:

Erratum

1. Introduction

Consider the (not necessarily convex) optimization problem (1) $inf_{x \in R^{n}} g (x),$ (1) where $g : R^{n} ⟶ R$ is a Fréchet differentiable function with $L_{g}$ -Lipschitz continuous gradient, i.e. there exists $L_{g} \geq 0$ such that $∥ \nabla g (x) - \nabla g (y) ∥ \leq L_{g} ∥ x - y ∥$ for all $x, y \in R^{n}$ . We associate to (Equation1(1) $inf_{x \in R^{n}} g (x),$ (1) ) the second-order dynamical system (for $t \geq t_{0}$ ) (2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) where $t_{0} > 0, u_{0}, v_{0} \in R^{n}, α \in R$ and $γ \in (0, + \infty)$ .

The study of the system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) is motivated by the recent developments related to the approaching of the solving of convex optimization problems from a continuous perspective.

In [Citation1], Su, Boyd and Candes proposed the following dynamical system (3) $\begin{aligned} \begin{aligned} \ddot{x} (t) + \frac{α}{t} \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (3) as the continuous counterpart of the Nesterov's accelerated gradient method [see [Citation2]] for minimizing g in the convex case. This research has been deepened by Attouch and his co-authors [see [Citation3,Citation4]], who proved that, if $α > 3$ , then the generated trajectory $x (t)$ converges to a minimizer of g as $t \to + \infty$ , while the convergence rate of the objective function along the trajectory is $o (1 / t^{2})$ . The convergence of the trajectory is actually the continuous counterpart of a result due to Chambolle and Dossal [see [Citation5]], which proves the convergence of the iterates of the modified FISTA algorithm [see [Citation6]].

Recently, in [Citation7], investigations have been performed concerning the convergence rate of the objective function along the trajectory in the subcritical case $α \leq 3$ , while some open questions related to the asymptotic properties of the trajectory have been formulated.

In this manuscript, we carry out, in the nonconvex setting, an asymptotic analysis of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ), which can be seen as a perturbation of the dynamical system (Equation3(3) $\begin{aligned} \begin{aligned} \ddot{x} (t) + \frac{α}{t} \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (3) ) that models Nesterov's accelerated gradient method in the convex case. To the best of our knowledge, this is the first contribution addressing second-order dynamical systems with variable damping associated to nonconvex optimization problems. We show that the generated trajectory converges to a critical point of g as $t \to + \infty$ , provided the following regularization of g, $H : R^{n} \times R^{n} ⟶ R, H (u, v) = g (u) + \frac{1}{2} ∥ u - v ∥^{2},$ satisfies the Kurdyka–Łojasiewicz inequality. Moreover, we derive convergence rates in the terms of Łojasiewicz exponent, for the trajectory, its velocity and its acceleration. One of the major future goals is to study the asymptotic properties of the system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) in case $γ = 0$ . For other investigations of the asymptotic analysis of second-order dynamical systems with time-dependent damping, we refer to the papers of Haraux and Jendoubi [Citation8] and Balti [Citation9].

For $α = 0$ , the convergence of the trajectory generated by (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) to a critical point of g has been shown by Bégout, Bolte and Jendoubi in [Citation10] in the hypothesis that g is of class $C^{2}$ and it satisfies the Kurdyka–Łojasiewicz property with a desingularizing function satisfying a restrictive condition [see also the papers of Haraux and Jendoubi [Citation11] and Chill and Jendoubi [Citation12]]. On the other hand, the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) is, for $α = 0$ , a particular instance of the second-order dynamical system of proximal-gradient type studied in [Citation13].

The following numerical scheme, with starting points $x_{0}, x_{1} \in R^{n}$ , (4) $(\forall k \geq 1) \{\begin{cases} y_{k} = x_{k} + \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} (x_{k} - x_{k - 1}), \\ x_{k + 1} = y_{k} - s \nabla g (y_{k}), \end{cases}$ (4) where $s \leq 1 / L_{g}$ is the step size, can be seen as a discrete counterpart of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). One can notice that for $γ = 0$ , this iterative scheme algorithm is similar to Nesterov's accelerated convex gradient method.

In the following, we prove that (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) can be seen in an informal way as the exact limit of (Equation4(4) $(\forall k \geq 1) \{\begin{cases} y_{k} = x_{k} + \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} (x_{k} - x_{k - 1}), \\ x_{k + 1} = y_{k} - s \nabla g (y_{k}), \end{cases}$ (4) )). We take to this end in (Equation4(4) $(\forall k \geq 1) \{\begin{cases} y_{k} = x_{k} + \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} (x_{k} - x_{k - 1}), \\ x_{k + 1} = y_{k} - s \nabla g (y_{k}), \end{cases}$ (4) ) small step sizes and follow the same approach as Su, Boyd and Candes in [Citation1, Section 2]. For this purpose, we rewrite (Equation4(4) $(\forall k \geq 1) \{\begin{cases} y_{k} = x_{k} + \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} (x_{k} - x_{k - 1}), \\ x_{k + 1} = y_{k} - s \nabla g (y_{k}), \end{cases}$ (4) ) in the form (5) $\frac{x_{k + 1} - x_{k}}{\sqrt{s}} = \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} \cdot \frac{x_{k} - x_{k - 1}}{\sqrt{s}} - \sqrt{s} \nabla g (y_{k}) \forall k \geq 1$ (5) and introduce the Ansatz $x_{k} \approx x (k \sqrt{s})$ for some twice continuously differentiable function $x : [0, + \infty) \to R^{n}$ . We let $k = t / \sqrt{s}$ and get $x (t) \approx x_{k}, x (t + \sqrt{s}) \approx x_{k + 1}, x (t - \sqrt{s}) \approx x_{k - 1} .$ Then, as the step size s goes to zero, from the Taylor expansion of x, we obtain $\frac{x_{k + 1} - x_{k}}{\sqrt{s}} = \dot{x} (t) + \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s})$ and $\frac{x_{k} - x_{k - 1}}{\sqrt{s}} = \dot{x} (t) - \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s}) .$

Furthermore, since $\sqrt{s} ∥ \nabla g (y_{k}) - \nabla g (x_{k}) ∥ \leq \sqrt{s} L_{g} ∥ y_{k} - x_{k} ∥ = \sqrt{s} L_{g} |\frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α}| ∥ x_{k} - x_{k - 1} ∥ = o (\sqrt{s}),$ it follows $\sqrt{s} \nabla g (y_{k}) = \sqrt{s} \nabla g (x_{k}) + o (\sqrt{s})$ . Consequently, (Equation5(5) $\frac{x_{k + 1} - x_{k}}{\sqrt{s}} = \frac{(1 - γ \sqrt{s}) k - α γ \sqrt{s}}{k + α} \cdot \frac{x_{k} - x_{k - 1}}{\sqrt{s}} - \sqrt{s} \nabla g (y_{k}) \forall k \geq 1$ (5) ) can be written as $\begin{aligned} \dot{x} (t) + \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s}) & = \frac{(1 - γ \sqrt{s}) t - α γ s}{t + α \sqrt{s}} (\dot{x} (t) - \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s})) \\ - \sqrt{s} \nabla g (x (t)) + o (\sqrt{s}) \end{aligned}$ or, equivalently $\begin{aligned} (t + α \sqrt{s}) (\dot{x} (t) + \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s})) & = ((1 - γ \sqrt{s}) t - α γ s) (\dot{x} (t) - \frac{1}{2} \ddot{x} (t) \sqrt{s} + o (\sqrt{s})) \\ - \sqrt{s} (t + α \sqrt{s}) \nabla g (x (t)) + o (\sqrt{s}) . \end{aligned}$ Hence, $\frac{1}{2} (2 t + α \sqrt{s} - γ t \sqrt{s} - α γ s) \ddot{x} (t) \sqrt{s} + (γ t \sqrt{s} + α \sqrt{s} + α γ s) \dot{x} (t) + \sqrt{s} (t + α \sqrt{s}) \nabla g (x (t)) = o (\sqrt{s}) .$ After dividing by $\sqrt{s}$ and letting $s \to 0$ , we obtain $t \ddot{x} (t) + (γ t + α) \dot{x} (t) + t \nabla g (x (t)) = 0,$ which, after division by t, gives (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ), namely $\ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0.$

2. Existence and uniqueness of the trajectory

We consider on the finite-dimensional space $R^{n}$ the Euclidean topology. If $x \in R^{n}$ is a local minimizer of g, then $\nabla g (x) = 0$ . We denote by $crit (g) = {x \in R^{n} : \nabla g (x) = 0}$ the set of critical points of g.

We are considering in the asymptotic analysis of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) strong global solutions.

Definition 2.1

We say that $x : [t_{0}, + \infty) \to R^{n}$ is a strong global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ), if the following properties are satisfied:

$x, \dot{x} : [t_{0}, + \infty) \to R^{n}$ are locally absolutely continuous, in other words, absolutely continuous on each interval $[t_{0}, T]$ for $t_{0} < T < + \infty$ ;
$\ddot{x} (t) + (α / t + γ) \dot{x} (t) + \nabla g (x (t)) = 0$ for almost every $t \geq t_{0}$ ;
$x (t_{0}) = u_{0}$ and $\dot{x} (t_{0}) = v_{0}$ .

Recall that a function $x : [t_{0}, + \infty) \to R^{n}$ is absolutely continuous on an interval $[t_{0}, T]$ , if there exists an integrable function $y : [t_{0}, T] \to R^{n}$ such that $x (t) = x (0) + \int_{t_{0}}^{t} y (s) d s \forall t \in [t_{0}, T] .$ It follows from the definition that an absolutely continuous function is differentiable almost everywhere, its derivative coincides with its distributional derivative almost everywhere and one can recover the function from its derivative $\dot{x} = y$ by the integration formula above. On the other hand, if $x : [t_{0}, T] \to R^{n}$ (where $T > t_{0}$ ) is absolutely continuous and $B : R^{n} \to R^{n}$ is L-Lipschitz continuous (where $L \geq 0$ ), then the function $B \circ x$ is absolutely continuous, too. Moreover, $B \circ x$ is almost everywhere differentiable and the inequality $∥ (d / d t) B (x (t)) ∥ \leq L ∥ \dot{x} (t) ∥$ holds for almost every $t \geq t_{0}$ [see [Citation14,Citation15]].

We prove existence and uniqueness of a strong global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) by making use of the Cauchy–Lipschitz–Picard Theorem for absolutely continues trajectories [see for example [Citation16, Proposition 6.2.1], [Citation17, Theorem 54]]. The key argument is that one can rewrite (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) as a particular first-order dynamical system in a suitably chosen product space [see also [Citation18]].

Theorem 2.1

For every starting points $u_{0}, v_{0} \in R^{n}$ there exists a unique strong global solution of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ).

Proof.

By making use of the notation $X (t) = (x (t), \dot{x} (t))$ , the system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) can be rewritten as a first-order dynamical system: (6) $\begin{aligned} \begin{aligned} \dot{X} (t) & = F (t, X (t)), \\ X (t_{0}) & = (u_{0}, v_{0}), \end{aligned} \end{aligned}$ (6) where $F : [t_{0}, + \infty) \times R^{n} \times R^{n} ⟶ R^{n} \times R^{n}, F (t, u, v) = (v, - (α / t + γ) v - \nabla g (u)) .$

First we show that $F (t, \cdot, \cdot)$ is $L (t)$ -Lipschitz continuous for every $t \geq t_{0}$ and that the Lipschitz constant is a function of time with the property that $L (\cdot) \in L_{loc}^{1} ([t_{0}, + \infty))$ . Indeed, for every $(u, v), (\bar{u}, \bar{v}) \in R^{n} \times R^{n}$ , we have $\begin{aligned} ∥ F (t, u, v) - F (t, \bar{u}, \bar{v}) ∥ & = \sqrt{∥ v - \bar{v} ∥^{2} + {∥(\frac{α}{t} + γ) (\bar{v} - v) + (\nabla g (\bar{u}) - \nabla g (u))∥}^{2}} \\ \leq \sqrt{(1 + 2 {(\frac{α}{t} + γ)}^{2}) ∥ v - \bar{v} ∥^{2} + 2 L_{g}^{2} ∥ u - \bar{u} ∥^{2}} \\ \leq \sqrt{1 + 2 L_{g}^{2} + 2 {(\frac{α}{t} + γ)}^{2}} \sqrt{∥ v - \bar{v} ∥^{2} + ∥ u - \bar{u} ∥^{2}} \\ = \sqrt{1 + 2 L_{g}^{2} + 2 {(\frac{α}{t} + γ)}^{2}} ∥ (u, v) - (\bar{u}, \bar{v}) ∥ . \end{aligned}$ Obviously, the Lipschitz constant function $t \mapsto L (t) := \sqrt{1 + 2 L_{g}^{2} + 2 (α / t + γ)^{2}}$ is continuous, hence integrable, on $[t_{0}, T]$ for all $t_{0} < T < + \infty$ , consequently, $L \in L_{loc}^{1} ([t_{0}, + \infty))$ .

Next we show that $F (\cdot, u, v) \in L_{loc}^{1} ([t_{0}, + \infty), R^{n} \times R^{n})$ for all $u, v \in R^{n}$ . Let $u, v \in R^{n}$ be fixed. For $t_{0} < T < + \infty$ , one has $\begin{aligned} \int_{t_{0}}^{T} ∥ F (t, u, v) ∥ d t & = \int_{t_{0}}^{T} \sqrt{∥ v ∥^{2} + {∥(\frac{α}{t} + γ) v + \nabla g (u)∥}^{2}} d t \\ \leq \int_{t_{0}}^{T} \sqrt{(1 + 2 {(\frac{α}{t} + γ)}^{2}) ∥ v ∥^{2} + 2 ∥ \nabla g (u) ∥^{2}} d t \\ \leq \sqrt{∥ v ∥^{2} + ∥ \nabla g (u) ∥^{2}} \int_{t_{0}}^{T} \sqrt{3 + 2 {(\frac{α}{t} + γ)}^{2}} d t \end{aligned}$ and the conclusion follows by the continuity of $t \mapsto \sqrt{3 + 2 (α / t + γ)^{2}}$ on $[t_{0}, T]$ .

The Cauchy–Lipschitz–Picard Theorem guarantees existence and uniqueness of the trajectory of the first-order dynamical system (Equation6(6) $\begin{aligned} \begin{aligned} \dot{X} (t) & = F (t, X (t)), \\ X (t_{0}) & = (u_{0}, v_{0}), \end{aligned} \end{aligned}$ (6) ) and thus of the second-order dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ).

The next result shows that the acceleration of the trajectory generated by (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) is also locally absolutely continuous on $[t_{0}, + \infty)$ .

Proposition 2.1

For the starting points $u_{0}, v_{0} \in R^{n},$ let x be the unique strong global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). Then $\ddot{x}$ is locally absolutely continuous on $[t_{0}, + \infty),$ hence the third-order derivative $x^{(3)}$ exists almost everywhere on $[t_{0}, + \infty) .$

Proof.

Let T>0 be fixed. According to Theorem 2.1, $X (t) := (x (t), \dot{x} (t))$ is absolutely continuous on $[t_{0}, T]$ . We endow the product space $R^{n} \times R^{n}$ with the 1-norm. For arbitrary $s, t \in [t_{0}, T]$ , we have $\begin{aligned} ∥ \dot{X} (s) - \dot{X} (t) ∥_{1} & = ∥ F (s, X (s)) - F (t, X (t)) ∥_{1} \\ = {∥(\dot{x} (s) - \dot{x} (t), - (\frac{α}{s} + γ) \dot{x} (s) + (\frac{α}{t} + γ) \dot{x} (t) - \nabla g (x (s)) + \nabla g (x (t)))∥}_{1} \\ \leq (1 + γ) ∥ \dot{x} (s) - \dot{x} (t) ∥ + ∥\frac{α}{s} \dot{x} (s) - \frac{α}{t} \dot{x} (t)∥ + ∥ \nabla g (x (s)) - \nabla g (x (t)) ∥ \\ \leq (1 + γ) ∥ \dot{x} (s) - \dot{x} (t) ∥ + \frac{| α |}{s} ∥ \dot{x} (s) - \dot{x} (t) ∥ + ∥\frac{α}{s} \dot{x} (t) - \frac{α}{t} \dot{x} (t)∥ + L_{g} ∥ x (s) - x (t) ∥ \\ \leq L_{1} ∥ \dot{x} (s) - \dot{x} (t) ∥ + L_{2} |\frac{α}{s} - \frac{α}{t}| + L_{g} ∥ x (s) - x (t) ∥, \end{aligned}$ where $L_{1} := max_{t \in [t_{0}, T]} (1 + γ + \frac{| α |}{t}) and L_{2} := max_{t \in [t_{0}, T]} ∥ \dot{x} (t) ∥ .$

Let be $ε > 0$ . Since the functions $\dot{x} (\cdot), t \mapsto α / t$ and $x (\cdot)$ are absolutely continuous on $[t_{0}, T]$ , there exists $η > 0$ such that for any finite family of intervals $I_{k} = (a_{k}, b_{k}) \subseteq [t_{0}, T]$ , the implication $\begin{aligned} (I_{k} \cap I_{j} = \emptyset and \sum_{k} | b_{k} - a_{k} | < η) \\ ⟹ \sum_{k} ∥ \dot{x} (b_{k}) - \dot{x} (a_{k}) ∥ < \frac{ε}{3 L_{1}}, \sum_{k} |\frac{α}{b_{k}} - \frac{α}{a_{k}}| < \frac{ε}{3 L_{2}} and \sum_{k} ∥ x (b_{k}) - x (a_{k}) ∥ < \frac{ε}{3 L_{g}} \end{aligned}$ holds. Consequently, $\sum_{k} ∥ \dot{X} (a_{k}) - \dot{X} (b_{k}) ∥ < \frac{ε}{3} + \frac{ε}{3} + \frac{ε}{3} = ε,$ hence $\dot{X} (\cdot) = (\dot{x} (\cdot), \ddot{x} (\cdot))$ is absolutely continuous on $[t_{0}, T]$ , which shows that $\ddot{x}$ is absolutely continuous $[t_{0}, T]$ . This proves that $\ddot{x}$ is locally absolutely continuous on $[t_{0}, + \infty)$ , which means that the third-order derivative $x^{(3)}$ exists almost everywhere on $[t_{0}, + \infty) .$

The following results provides an estimate for the third-order derivative of the strong global solution of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) in terms its first- and second-order derivatives.

Lemma 2.1

For the starting points $u_{0}, v_{0} \in R^{n},$ let x be the unique strong global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). Then, for almost every $t \in [t_{0}, + \infty),$ it holds (7) $∥ x^{(3)} (t) ∥ \leq (L_{g} + \frac{| α |}{t^{2}}) ∥ \dot{x} (t) ∥ + (γ + \frac{| α |}{t}) ∥ \ddot{x} (t) ∥ .$ (7)

Proof.

Let $t \in [t_{0}, + \infty)$ be such that $\dot{X} (t) = F (t, X (t))$ . We have for almost every h>0 that $\begin{aligned} ∥ \dot{X} (t + h) - \dot{X} (t) ∥_{1} = ∥ F (t + h, X (t + h)) - F (t, X (t)) ∥_{1} \\ = {∥(\dot{x} (t + h) - \dot{x} (t), - (\frac{α}{t + h} + γ) \dot{x} (t + h) + (\frac{α}{t} + γ) \dot{x} (t) - \nabla g (x (t + h)) + \nabla g (x (t)))∥}_{1} \\ = ∥ \dot{x} (t + h) - \dot{x} (t) ∥ + ∥- (\frac{α}{t + h} + γ) \dot{x} (t + h) + (\frac{α}{t} + γ) \dot{x} (t) - \nabla g (x (t + h)) + \nabla g (x (t))∥ \\ \leq (1 + γ) ∥ \dot{x} (t + h) - \dot{x} (t) ∥ + ∥\frac{α}{t + h} \dot{x} (t + h) - \frac{α}{t} \dot{x} (t)∥ + ∥ \nabla g (x (t + h)) - \nabla g (x (t)) ∥ \\ \leq (1 + γ) ∥ \dot{x} (t + h) - \dot{x} (t) ∥ + ∥\frac{α}{t + h} \dot{x} (t + h) - \frac{α}{t} \dot{x} (t)∥ + L_{g} ∥ x (t + h) - x (t) ∥ . \end{aligned}$ Hence, $\begin{aligned} {∥\frac{\dot{X} (t + h) - \dot{X} (t)}{h}∥}_{1} & \leq (1 + γ) ∥\frac{\dot{x} (t + h) - \dot{x} (t)}{h}∥ + ∥\frac{(α / (t + h)) \dot{x} (t + h) - (α / t) \dot{x} (t)}{h}∥ \\ + L_{g} ∥\frac{x (t + h) - x (t)}{h}∥ . \end{aligned}$ By taking the limit as $h \to 0$ , we obtain $∥ \ddot{X} (t) ∥_{1} \leq (1 + γ) ∥ \ddot{x} (t) ∥ + ∥{(\frac{α}{t} \dot{x} (t))}^{'}∥ + L_{g} ∥ \dot{x} (t) ∥ .$ Since $∥ \ddot{X} (t) ∥_{1} = ∥ x^{(3)} (t) ∥ + ∥ \ddot{x} (t) ∥$ , we conclude $∥ x^{(3)} (t) ∥ \leq (L_{g} + \frac{| α |}{t^{2}}) ∥ \dot{x} (t) ∥ + (γ + \frac{| α |}{t}) ∥ \ddot{x} (t) ∥ .$

Remark 2.1

For $N := max_{t \geq t_{0}} (L_{g} + \frac{| α |}{t^{2}}, γ + \frac{| α |}{t}),$ we have that $∥ x^{(3)} (t) ∥ \leq N (∥ \ddot{x} (t) ∥ + ∥ \dot{x} (t) ∥)$ for almost every $t \in [t_{0}, + \infty)$ .

3. Convergence of trajectories

In this section, we study the convergence of the trajectory of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). We denote by $ω (x) := {\bar{x} \in R^{n} : \exists t_{k} ⟶ + \infty such that x (t_{k}) ⟶ \bar{x} as k ⟶ + \infty}$ the set of limit points of the trajectory x.

Before proving a first result in this sense, we recall two technical lemmas which will play an essential role in the asymptotic analysis.

Lemma 3.1

Suppose that $F : [0, + \infty) \to R$ is locally absolutely continuous and bounded below and that there exists $G \in L^{1} ([0, + \infty))$ such that for almost every $t \in [0, + \infty)$ $\frac{d}{d t} F (t) \leq G (t) .$ Then there exists $lim_{t \to + \infty} F (t) \in R$ .

Lemma 3.2

Suppose that $F : [0, + \infty) \to [0, + \infty)$ is locally absolutely continuous and $F \in L^{p} ([0, + \infty)), 1 \leq p < \infty,$ and that there exists $G : [0, + \infty) \to R, G \in L^{r} ([0, + \infty)), 1 \leq r \leq \infty,$ such that for almost every $t \in [0, + \infty)$ $\frac{d}{d t} F (t) \leq G (t) .$ Then it holds $lim_{t \to + \infty} F (t) = 0$ .

Theorem 3.1

$\dot{x}, \ddot{x} \in L^{2} ([t_{0}, + \infty), R^{n});$
there exists $β > 0$ such that the limit $lim_{t ⟶ + \infty} g (β \dot{} x (t) + x (t))$ exists and is finite;
$lim_{t ⟶ + \infty} \ddot{x} (t) = 0$ and $lim_{t ⟶ + \infty} \dot{x} (t) = 0;$
$ω (x) \subseteq crit (g) .$

Proof.

Choose $0 < β < min (2 / L_{g}, (\sqrt{L_{g}^{2} + 2 γ L_{g}} - L_{g}) / L_{g})$ . By using the $L_{g}$ -Lipschitz continuity of $\nabla g$ , for almost every $t \in [t_{0}, + \infty)$ it holds $\begin{aligned} \frac{d}{d t} g (β \dot{x} (t) + x (t)) & = ⟨ β \ddot{x} (t) + \dot{x} (t), \nabla g (β \dot{x} (t) + x (t)) ⟩ \\ = ⟨ β \ddot{x} (t) + \dot{x} (t), \nabla g (β \dot{x} (t) + x (t)) - \nabla g (x (t)) ⟩ \\ + ⟨β \ddot{x} (t) + \dot{x} (t), - \ddot{x} (t) - (\frac{α}{t} + γ) \dot{x} (t)⟩ \\ \leq - β ∥ \ddot{x} (t) ∥^{2} - (1 + β γ + \frac{α β}{t}) ⟨ \ddot{x} (t), \dot{x} (t) ⟩ - (γ + \frac{α}{t}) ∥ \dot{x} (t) ∥^{2} \\ + L_{g} ∥ β \ddot{x} (t) + \dot{x} (t) ∥ ∥ β \dot{x} (t) ∥ . \end{aligned}$ Taking into account that $∥ β \ddot{x} (t) + \dot{x} (t) ∥ ∥ β \dot{x} (t) ∥ \leq β^{2} ∥ \ddot{x} (t) ∥ ∥ \dot{x} (t) ∥ + β ∥ \dot{x} (t) ∥^{2} \leq \frac{1}{2} β^{2} ∥ \ddot{x} (t) ∥^{2} + (β + \frac{1}{2} β^{2}) ∥ \dot{x} (t) ∥^{2}$ and $- (1 + β γ + \frac{α β}{t}) ⟨ \ddot{x} (t), \dot{x} (t) ⟩ = - \frac{1}{2} \frac{d}{d t} [(1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}] - \frac{α β}{2 t^{2}} ∥ \dot{x} (t) ∥^{2},$ we obtain for almost every $t \in [t_{0}, + \infty)$ (8) $\begin{aligned} \frac{d}{d t} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) \\ \leq (- β + \frac{L_{g} β^{2}}{2}) ∥ \ddot{x} (t) ∥^{2} + (- γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}}) ∥ \dot{x} (t) ∥^{2} . \end{aligned}$ (8) Since $0 < β < min (2 / L_{g}, (\sqrt{L_{g}^{2} + 2 γ L_{g}} - L_{g}) / L_{g})$ , there exists $t_{1} > 0$ such that for every $t \geq t_{1}$ it holds (9) $1 + β γ + \frac{α β}{t} > 0, - β + \frac{L_{g} β^{2}}{2} < 0 and - γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}} < 0.$ (9) For simplicity, we denote $A := - β + \frac{L_{g} β^{2}}{2} and B (t) := - γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}} \forall t \in [t_{0}, + \infty) .$

Let be $T > t_{1} .$ Since $x \in C^{2} ([t_{1}, T], R^{n})$ , we have $x, \dot{x}, \ddot{x} \in L^{2} ([t_{1}, T], R^{n}) .$ Furthermore, by the $L_{g}$ -Lipschitz property of $\nabla g$ , it holds $\nabla g \in L^{2} ([t_{1}, T], R^{n}) .$ By integrating (Equation8(8) $\begin{aligned} \frac{d}{d t} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) \\ \leq (- β + \frac{L_{g} β^{2}}{2}) ∥ \ddot{x} (t) ∥^{2} + (- γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}}) ∥ \dot{x} (t) ∥^{2} . \end{aligned}$ (8) ) on $[t_{1}, T]$ , we obtain (10) $\begin{aligned} g (β \dot{x} (T) + x (T)) + \frac{1}{2} (1 + β γ + \frac{α β}{T}) ∥ \dot{x} (T) ∥^{2} - \int_{t_{1}}^{T} A ∥ \ddot{x} (t) ∥^{2} d t - \int_{t_{1}}^{T} B (t) ∥ \dot{x} (t) ∥^{2} d t \\ \leq g (β \dot{x} (t_{1}) + x (t_{1})) + \frac{1}{2} (1 + β γ + \frac{α β}{t_{1}}) ∥ \dot{x} (t_{1}) ∥^{2} . \end{aligned}$ (10) Taking into account that g is bounded from bellow, by letting $T ⟶ + \infty$ , we obtain $\int_{t_{1}}^{\infty} (- A ∥ \ddot{x} (t) ∥^{2}) d t < + \infty and \int_{t_{1}}^{\infty} (- B (t) ∥ \dot{x} (t) ∥^{2}) d t < + \infty$ Consequently $∥ \ddot{x} (\cdot) ∥^{2}, B (\cdot) ∥ \dot{x} (\cdot) ∥^{2} \in L^{1} ([t_{0}, + \infty), R)$ , hence $\ddot{x}, \dot{x} \in L^{2} ([t_{0}, + \infty), R^{n}) .$ On the other hand, (Equation8(8) $\begin{aligned} \frac{d}{d t} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) \\ \leq (- β + \frac{L_{g} β^{2}}{2}) ∥ \ddot{x} (t) ∥^{2} + (- γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}}) ∥ \dot{x} (t) ∥^{2} . \end{aligned}$ (8) ) and Lemma 3.1 ensure that the limit (11) $lim_{t ⟶ + \infty} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2})$ (11) exists and is finite.

As for almost every $t \in [t_{0}, + \infty)$ $\frac{d}{d t} (∥ \dot{x} (t) ∥^{2}) = 2 ⟨ \ddot{x} (t), \dot{x} (t) ⟩ \leq ∥ \dot{x} (t) ∥^{2} + ∥ \ddot{x} (t) ∥^{2}$ and $∥ \dot{x} (\cdot) ∥^{2} + ∥ \ddot{x} (\cdot) ∥^{2} \in L^{1} ([t_{0}, + \infty))$ , according to Lemma 3.2, it follows that $lim_{t ⟶ + \infty} \dot{x} (t) = 0.$

As for almost every $t \in [t_{0}, + \infty)$ $\frac{d}{d t} (∥ \ddot{x} (t) ∥^{2}) = 2 ⟨ x^{(3)} (t), \ddot{x} (t) ⟩ \leq ∥ x^{(3)} (t) ∥^{2} + ∥ \ddot{x} (t) ∥^{2}$ and $∥ x^{(3)} (\cdot) ∥^{2} + ∥ \ddot{x} (\cdot) ∥^{2} \in L^{1} ([t_{0}, + \infty))$ (see Remark 2.1 and (i)), according to Lemma 3.2, it follows that $lim_{t ⟶ + \infty} \ddot{x} (t) = 0.$

Finally, by using that $lim_{t ⟶ + \infty} \dot{x} (t) = 0$ , (Equation11(11) $lim_{t ⟶ + \infty} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2})$ (11) ) becomes (12) $\exists lim_{t ⟶ + \infty} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) = lim_{t ⟶ + \infty} g (β \dot{x} (t) + x (t)) \in R .$ (12)

Let $\bar{x} \in ω (x) .$ Then there exists a sequence $t_{k} ⟶ + \infty, k ⟶ + \infty$ such that $x (t_{k}) ⟶ \bar{x}$ as $k ⟶ + \infty .$ By using the continuity of $\nabla g$ , we have $0 = lim_{k ⟶ + \infty} (\ddot{x} (t_{k}) + (\frac{α}{t_{k}} + γ) \dot{x} (t_{k}) + \nabla g (x (t_{k}))) = \nabla g (\bar{x}),$ which shows that $\bar{x} \in crit (g) .$

In the following result, we use the distance function to a set, defined for $A \subseteq R^{n}$ as $dist (x, A) = inf_{y \in A} ∥ x - y ∥$ for all $x \in R^{n}$ . The following result is a direct consequence of Theorem 3.1.

Lemma 3.3

Assume that g is bounded from below and, for $u_{0}, v_{0} \in R^{n},$ let x be the unique strong global solution of the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). Define $H : R^{n} \times R^{n} ⟶ R, H (x, y) = g (x) + \frac{1}{2} ∥ x - y ∥^{2} .$ Let be $0 < β < min (2 / L_{g}, (\sqrt{L_{g}^{2} + 2 γ L_{g}} - L_{g}) / L_{g})$ and $t_{1} > 0$ such that for every $t \geq t_{1}$ the inequalities (Equation9(9) $1 + β γ + \frac{α β}{t} > 0, - β + \frac{L_{g} β^{2}}{2} < 0 and - γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}} < 0.$ (9) ) hold. For every $t \in [t_{1}, + \infty),$ define $\begin{aligned} u (t) & := β \dot{x} (t) + x (t), v (t) := (\sqrt{1 + β γ + \frac{α β}{t}} + β) \dot{x} (t) + x (t), \\ A & = - β + \frac{L_{g} β^{2}}{2}, B (t) := - γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}} a n d \\ p (t) & := L_{g} β + γ + \frac{| α |}{t} + 2 \sqrt{1 + β γ + \frac{α β}{t}} . \end{aligned}$ Then the following statements are true:

$ω (u) = ω (v) = ω (x);$
$\frac{d}{d t} H (u (t), v (t)) \leq A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2} \leq 0 for almost every t \geq t_{1};$
the limit $lim_{t ⟶ + \infty} H (u (t), v (t)) = lim_{t ⟶ + \infty} g (β \dot{} x (t) + x (t))$ exists and is finite;
H is finite and constant on $ω (u, v) = {(\bar{x}, \bar{x}) \in R^{n} \times R^{n} : \bar{x} \in ω (x)};$
$∥ \nabla H (u (t), v (t)) ∥ \leq ∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥ for almost every t \geq t_{1};$
$ω (u, v) \subseteq crit (H) .$

If x is bounded, then

$ω (u, v)$ is nonempty and compact;
$lim_{t ⟶ + \infty} dist ((u (t), v (t)), ω (u, v)) = 0.$

Proof.

(i) According to Theorem 3.1(iii), $lim_{t ⟶ + \infty} β \dot{x} (t) = lim_{t ⟶ + \infty} (\sqrt{1 + β γ + \frac{α β}{t}} + β) \dot{x} (t) = 0,$ hence $ω (u) = ω (v) = ω (x) .$

(ii) and (iii) are nothing else than (Equation8(8) $\begin{aligned} \frac{d}{d t} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) \\ \leq (- β + \frac{L_{g} β^{2}}{2}) ∥ \ddot{x} (t) ∥^{2} + (- γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}}) ∥ \dot{x} (t) ∥^{2} . \end{aligned}$ (8) ) and (Equation12(12) $\exists lim_{t ⟶ + \infty} (g (β \dot{x} (t) + x (t)) + \frac{1}{2} (1 + β γ + \frac{α β}{t}) ∥ \dot{x} (t) ∥^{2}) = lim_{t ⟶ + \infty} g (β \dot{x} (t) + x (t)) \in R .$ (12) ), respectively.

(iv) follows directly from (iii).

(v) Since $\nabla H (x, y) = (\nabla g (x) + x - y, y - x)$ for every $(x, y) \in R^{n} \times R^{n}$ , by using (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ), we have for almost every $t \in [t_{1}, + \infty)$ $\begin{aligned} ∥ \nabla H (u (t), v (t)) ∥ \leq ∥ \nabla g (u (t)) ∥ + 2 ∥ u (t) - v (t) ∥ \\ \leq ∥ \nabla g (u (t)) - \nabla g (x (t)) ∥ + ∥ \nabla g (x (t)) ∥ + 2 ∥ u (t) - v (t) ∥ \\ \leq L_{g} β ∥ \dot{x} (t) ∥ + ∥- \ddot{x} (t) - (γ + \frac{α}{t}) \dot{x} (t)∥ + 2 ∥ u (t) - v (t) ∥ \\ \leq ∥ \ddot{x} (t) ∥ + (L_{g} β + γ + \frac{| α |}{t} + 2 \sqrt{1 + β γ + \frac{α β}{t}}) ∥ \dot{x} (t) ∥ = ∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥ . \end{aligned}$

(vi) Since $crit H = {(x, y) \in R^{n} \times R^{n} : \nabla H (x, y) = (0, 0)} = {(\bar{x}, \bar{x}) \in R^{n} \times R^{n} : \bar{x} \in crit (g)}$ and (see (i)) $ω (u, v) = {(\bar{x}, \bar{x}) \in R^{n} \times R^{n} : \bar{x} \in ω (x)},$ by Theorem 3.1(iv) one obtains $ω (u, v) \subseteq crit (H) .$

Assume that x is bounded.

(vii) Since $\dot{x} (t) ⟶ 0, t ⟶ + \infty,$ we obtain that u and v are bounded, too. Thus, the set of limit points $ω (u, v)$ is nonempty. Furthermore, since $ω (u, v) = {(\bar{x}, \bar{x}) \in R^{n} \times R^{n} : \bar{x} \in ω (x)}$ and $ω (x)$ is bounded, it is enough to show that $ω (x)$ is closed.

Let be $({\bar{x}}_{n})_{n \geq 1} \subseteq ω (x)$ and assume that $lim_{n ⟶ + \infty} {\bar{x}}_{n} = x^{*} .$ We show that $x^{*} \in ω (x) .$ Obviously, for every $n \geq 1$ , there exists a sequence $t_{k}^{n} ⟶ + \infty, k ⟶ + \infty$ , such that $lim_{k ⟶ + \infty} x (t_{k}^{n}) = {\bar{x}}_{n} .$

Let be $ϵ > 0$ . Since $lim_{n ⟶ + \infty} {\bar{x}}_{n} = x^{*},$ there exists $N (ϵ) \in N$ such that for every $n \geq N (ϵ)$ it holds $∥ {\bar{x}}_{n} - x^{*} ∥ < \frac{ϵ}{2} .$ Let $n \geq 1$ be fixed. Since $lim_{k ⟶ + \infty} x (t_{k}^{n}) = {\bar{x}}_{n},$ there exists $k (n, ϵ) \in N$ such that for every $k \geq k (n, ϵ)$ it holds $∥ x (t_{k}^{n}) - {\bar{x}}_{n} ∥ < \frac{ϵ}{2} .$ Let be $k_{n} \geq k (n, ε)$ such that $t_{k_{n}}^{n} > n$ . Obviously $t_{k_{n}}^{n} ⟶ \infty$ as $n ⟶ + \infty$ and for every $n \geq N (ϵ)$ $∥ x (t_{k_{n}}^{n}) - x^{*} ∥ < ϵ .$ Hence, $lim_{n ⟶ + \infty} x (t_{k_{n}}^{n}) = x^{*},$ thus $x^{*} \in ω (x) .$

(viii) follows from the definition of the set $ω (u, v)$ .

Remark 3.1

Combining (iii) and (iv) in Lemma 3.3, it follows that for every $\bar{x} \in ω (x)$ and $t_{k} ⟶ + \infty$ such that $x (t_{k}) ⟶ \bar{x}$ as $k ⟶ + \infty$ we have $lim_{k ⟶ + \infty} H (u (t_{k}), v (t_{k})) = H (\bar{x}, \bar{x}) .$

The convergence of the trajectory generated by the dynamical system (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ) will be shown in the framework of functions satisfying the Kurdyka–Łojasiewicz property. For $η \in (0, + \infty]$ , we denote by $Θ_{η}$ the class of concave and continuous functions $ϕ : [0, η) \to [0, + \infty)$ such that $ϕ (0) = 0$ , ϕ is continuously differentiable on $(0, η)$ , continuous at 0 and $ϕ^{'} (s) > 0$ for all $s \in (0, η)$ .

Definition 3.1

Kurdyka–Łojasiewicz property

Let $f : R^{n} \to R$ be a differentiable function. We say that f satisfies the Kurdyka–Łojasiewicz (KL) property at $\bar{x} \in R^{n}$ if there exist $η \in (0, + \infty]$ , a neighbourhood U of $\bar{x}$ and a function $ϕ \in Θ_{η}$ such that for all x in the intersection $U \cap {x \in R^{n} : f (\bar{x}) < f (x) < f (\bar{x}) + η},$ the following inequality holds $ϕ^{'} (f (x) - f (\bar{x})) ∥ \nabla f (x)) ∥ \geq 1.$ If f satisfies the KL property at each point in $R^{n}$ , then f is called a KL function.

The origins of this notion go back to the pioneering work of Łojasiewicz [Citation19], where it is proved that for a real-analytic function $f : R^{n} \to R$ and a critical point $\bar{x} \in R^{n}$ (that is $\nabla f (\bar{x}) = 0$ ), there exists $θ \in [1 / 2, 1)$ such that the function $| f - f (\bar{x}) |^{θ} ∥ \nabla f ∥^{- 1}$ is bounded around $\bar{x}$ . This corresponds to the situation when $ϕ (s) = C (1 - θ)^{- 1} s^{1 - θ}$ . The result of Łojasiewicz allows the interpretation of the KL property as a re-parametrization of the function values in order to avoid flatness around the critical points. Kurdyka [Citation20] extended this property to differentiable functions definable in an o-minimal structure. Further extensions to the nonsmooth setting can be found in [Citation21–24].

To the class of KL functions belong semi-algebraic, real sub-analytic, semiconvex, uniformly convex and convex functions satisfying a growth condition. We refer the reader to [Citation21–27] and the references therein for more details regarding all the classes mentioned above and illustrating examples.

An important role in our convergence analysis will be played by the following uniformized KL property given in [Citation27, Lemma 6].

Lemma 3.4

Let $Ω \subseteq R^{n}$ be a compact set and let $f : R^{n} \to R$ be a differentiable function. Assume that f is constant on Ω and f satisfies the KL property at each point of Ω. Then there exist $ϵ, η > 0$ and $ϕ \in Θ_{η}$ such that for all $\bar{x} \in Ω$ and for all x in the intersection (13) ${x \in R^{n} : dist (x, Ω) < ε} \cap {x \in R^{n} : f (\bar{x}) < f (x) < f (\bar{x}) + η},$ (13) the following inequality holds (14) $ϕ^{'} (f (x) - f (\bar{x})) ∥ \nabla f (x) ∥ \geq 1.$ (14)

The following theorem is the main result of the paper and concerns the global asymptotic convergence of the trajectory generated by (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ).

Theorem 3.2

$\dot{x}, \ddot{x} \in L^{1} ([t_{0}, + \infty), R^{n});$
there exists $\bar{x} \in crit (g)$ such that $lim_{t ⟶ + \infty} x (t) = \bar{x} .$

Proof.

Case I. There exists $\bar{t} \geq t_{1}$ such that $H (u (\bar{t}), v (\bar{t})) = H (\tilde{x}, \tilde{x}) .$ From Lemma 3.3(ii), we have $\frac{d}{d t} H (u (t), v (t)) \leq A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2} \leq 0 for almost every t \geq t_{1},$ hence $H (u (t), v (t)) \leq H (\tilde{x}, \tilde{x}) for every t \geq \bar{t} .$ On the other hand, $H (u (t), v (t)) \geq lim_{t ⟶ + \infty} H (u (t), v (t)) = H (\tilde{x}, \tilde{x}) for every t \geq t_{1},$ hence $H (u (t), v (t)) = H (\tilde{x}, \tilde{x}) for every t \geq \bar{t} .$ Hence, $(d / d t) H (u (t), v (t)) = 0$ , which leads to $0 \leq A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2} \leq 0 for almost every t \geq \bar{t} .$ Since A<0 and $B (t) < 0$ for every $t \geq t_{1}$ , the latter inequality can hold only if $\dot{x} (t) = \ddot{x} (t) = 0 for almost every t \geq \bar{t} .$ Consequently, $\dot{x}, \ddot{x} \in L^{1} ([t_{0}, + \infty), R^{n})$ and x is constant on $[\bar{t}, + \infty) .$

Case II. We assume that $H (u (t), v (t)) > H (\tilde{x}, \tilde{x})$ holds for every $t \geq t_{1}$ . Let $Ω := ω (u, v) .$ According to Lemma 3.3, Ω is nonempty and compact and H is constant on Ω. Since H is a KL function, according to Lemma 3.4, there exist $ε, η > 0$ and $ϕ \in Θ_{η}$ such that for every $(\tilde{z}, \tilde{w})$ in the intersection ${(z, w) \in R^{n} \times R^{n} : dist ((z, w), Ω) < ε} \cap {(z, w) \in R^{n} \times R^{n} : H (\bar{x}, \bar{x}) < H (z, w) < H (\tilde{x}, \tilde{x}) + η}$ one has $ϕ^{'} (H (\tilde{z}, \tilde{w}) - H (\tilde{x}, \tilde{x})) ∥ \nabla H (\tilde{z}, \tilde{w}) ∥ \geq 1.$

Since $lim_{t ⟶ + \infty} dist (u (t), v (t), Ω) = 0$ , there exists $t_{2} \geq t_{1}$ such that $dist (u (t), v (t)), Ω) < ϵ$ for every $t \geq t_{2}$ . Since $lim_{t ⟶ + \infty} H (u (t), v (t)) = H (\bar{x}, \bar{x})$ , there exists $t_{3} \geq t_{1}$ such that $H (\bar{x}, \bar{x}) < H (u (t), v (t)) < H (\bar{x}, \bar{x}) + η$ for every $t \geq t_{3}$ . Hence, for every $t \geq T := max (t_{2}, t_{3})$ , we have $ϕ^{'} (H (u (t), v (t)) - H (\bar{x}, \bar{x})) \cdot ∥ \nabla H (u (t), v (t)) ∥ \geq 1.$

According to Lemma 3.3 (ii) and (v), we have $(d / d t) H (u (t), v (t)) \leq A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2} \leq 0$ and $∥ \nabla H (u (t), v (t)) ∥ \leq ∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥,$ hence $\begin{aligned} \frac{d}{d t} ϕ (H (u (t), v (t)) - H (\tilde{x}, \tilde{x})) & = ϕ^{'} (H (u (t), v (t)) - H (\tilde{x}, \tilde{x})) \frac{d}{d t} H (u (t), v (t)) \\ \leq \frac{A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} \end{aligned}$ for almost every $t \in [T, + \infty) .$

By integrating on the interval $[T, \bar{T}]$ , for $\bar{T} > T$ , we obtain $ϕ (H (u (\bar{T}), v (\bar{T})) - H (\tilde{x}, \tilde{x})) - \int_{T}^{\bar{T}} \frac{A ∥ \ddot{x} (s) ∥^{2} + B (s) ∥ \dot{x} (s) ∥^{2}}{∥ \ddot{x} (s) ∥ + p (s) ∥ \dot{x} (s) ∥} d s \leq ϕ (H (u (T), v (T)) - H (\tilde{x}, \tilde{x})) .$

Since ϕ is bounded from below, $A < 0, B (s) < 0$ and $p (s) > 0$ for every $s \geq T$ and $\bar{T}$ was arbitrarily chosen, we obtain that $0 \leq \int_{T}^{+ \infty} \frac{- A ∥ \ddot{x} (s) ∥^{2} - B (s) ∥ \dot{x} (s) ∥^{2}}{∥ \ddot{x} (s) ∥ + p (s) ∥ \dot{x} (s) ∥} d s \leq ϕ (H (u (T), v (T)) - H (\bar{x}, \bar{x})),$ which leads to $t \mapsto \frac{∥ \ddot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥}, t \mapsto \frac{∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} \in L^{1} ([t_{0}, + \infty), R^{n})$ and further to $t \mapsto \frac{∥ \ddot{x} (t) ∥ ∥ \dot{x} (t) ∥}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} \in L^{1} ([t_{0}, + \infty), R^{n}) .$ Since p is bounded from above on $[t_{0}, + \infty)$ and $∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥ = \frac{∥ \ddot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} + \frac{p (t) ∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} + \frac{(1 + p (t)) ∥ \ddot{x} (t) ∥ ∥ \dot{x} (t) ∥}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥},$ we obtain that $\dot{x}, \ddot{x} \in L^{1} ([t_{0}, + \infty), R^{n}) .$

Finally, since $\dot{x} \in L^{1} ([t_{0}, + \infty), R^{n})$ , the limit $lim_{t ⟶ + \infty} x (t)$ exists and it is finite. In conclusion, there exists $\bar{x} \in crit (g)$ such that $lim_{t ⟶ + \infty} x (t) = \bar{x} .$

Remark 3.2

According to Remark 2.1, there exists N>0 such that $∥ x^{(3)} (t) ∥ \leq N (∥ \ddot{x} (t) ∥ + ∥ \dot{x} (t) ∥)$ for almost every $t \geq t_{0},$ hence, under the hypotheses of Theorem 3.2, one has $x^{(3)} \in L^{1} ([t_{0}, + \infty), R^{n}) .$

Remark 3.3

Since the class of semi-algebraic functions is closed under addition [see, for example, [Citation27]] and $(x, y) \mapsto \frac{1}{2} ∥ x - y ∥^{2}$ is semi-algebraic, the conclusion of the previous theorem holds, if, instead of asking that H is a KL function, we ask that g is semi-algebraic.

Remark 3.4

Assume that g is coercive, that is $lim_{∥ u ∥ \to + \infty} g (u) = + \infty .$ For $u_{0}, v_{0} \in R^{n}$ , let $x \in C^{2} ([0, + \infty), R^{n})$ be the unique global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). Then x is bounded.

Indeed, notice that g is bounded from below, being a continuous and coercive function [see [Citation28]]. From (Equation10(10) $\begin{aligned} g (β \dot{x} (T) + x (T)) + \frac{1}{2} (1 + β γ + \frac{α β}{T}) ∥ \dot{x} (T) ∥^{2} - \int_{t_{1}}^{T} A ∥ \ddot{x} (t) ∥^{2} d t - \int_{t_{1}}^{T} B (t) ∥ \dot{x} (t) ∥^{2} d t \\ \leq g (β \dot{x} (t_{1}) + x (t_{1})) + \frac{1}{2} (1 + β γ + \frac{α β}{t_{1}}) ∥ \dot{x} (t_{1}) ∥^{2} . \end{aligned}$ (10) ), it follows that $β \dot{x} (T) + x (T)$ is contained for every $T \geq t_{1}$ in a lower level set of g, which is bounded. According to Theorem 3.1, $β \dot{x} (t) ⟶ 0, t ⟶ + \infty$ , which implies that x is bounded.

4. Convergence rates

In this section, we will assume that the regularized function H satisfies the Lojasiewicz property, which, as noted in the previous section, corresponds to a particular choice of the desingularizing function ϕ [see [Citation19,Citation22,Citation25]].

Definition 4.1

Let $f : R^{n} ⟶ R$ be a differentiable function. The function f is said to fulfil the Łojasiewicz property, if for every $\bar{x} \in crit f$ there exist $K, ϵ > 0$ and $θ \in (0, 1)$ such that $| f (x) - f (\bar{x}) |^{θ} \leq K ∥ \nabla f (x) ∥ for every x fulfilling ∥ x - \bar{x} ∥ < ϵ .$ The number θ is called the Łojasiewicz exponent of f at the critical point $\bar{x} .$

In the following theorem, we provide convergence rates for the trajectory generated by (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ), its velocity and acceleration in terms of the Łojasiewicz exponent of H [see, also, [Citation22,Citation25]].

Theorem 4.1

Assume that g is bounded from below and, for $u_{0}, v_{0} \in R^{n},$ let x be the unique strong global solution of (Equation2(2) $\begin{aligned} \begin{aligned} \ddot{x} (t) + (\frac{α}{t} + γ) \dot{x} (t) + \nabla g (x (t)) = 0, \\ x (t_{0}) = u_{0}, \dot{x} (t_{0}) = v_{0}, \end{aligned} \end{aligned}$ (2) ). Suppose that x is bounded, let $\bar{x} \in crit (g)$ be such that $lim_{t ⟶ + \infty} x (t) = \bar{x}$ and suppose that $H : R^{n} \times R^{n} ⟶ R, H (x, y) = g (x) + \frac{1}{2} ∥ x - y ∥^{2}$ fulfils the Łojasiewicz property at $(\bar{x}, \bar{x}) \in crit H$ with Łojasiewicz exponent θ. Let be (see Remark 2.1) $N := max_{t \geq t_{0}} (L_{g} + \frac{| α |}{t^{2}}, γ + \frac{| α |}{t}) .$ Then, there exist $a_{1}, a_{2}, a_{3}, a_{4} > 0$ and T>0 such that for almost every $t \in [T, + \infty)$ , the following statements are true:

if $θ \in (0, \frac{1}{2}),$ then x converges in finite time;
if $θ = \frac{1}{2},$ then $∥ x (t) - \bar{x} ∥ \leq a_{1} e^{- a_{2} t},$ $∥ \dot{x} (t) ∥ \leq a_{1} e^{- a_{2} t}$ and $∥ \ddot{x} (t) ∥ \leq N a_{1} e^{- a_{2} t};$
if $θ \in (\frac{1}{2}, 1),$ then $∥ x (t) - \bar{x} ∥ \leq (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)},$ $∥ \dot{x} (t) ∥ \leq (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)}$ and $∥ \ddot{x} (t) ∥ \leq N (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)} .$

Proof.

Let be $0 < β < min (2 / L_{g}, (\sqrt{L_{g}^{2} + 2 γ L_{g}} - L_{g}) / L_{g})$ and $t_{1} > 0$ such that for every $t \geq t_{1}$ the inequalities (Equation9(9) $1 + β γ + \frac{α β}{t} > 0, - β + \frac{L_{g} β^{2}}{2} < 0 and - γ + L_{g} β + \frac{L_{g} β^{2}}{2} - \frac{α}{t} - \frac{α β}{2 t^{2}} < 0.$ (9) ) hold. We define for every $t \in [t_{1}, + \infty)$ $σ (t) := \int_{t}^{+ \infty} (∥ \dot{x} (s) ∥ + ∥ \ddot{x} (s) ∥) d s .$ Let $t \in [t_{1}, + \infty)$ be fixed. For $T \geq t$ , we have $∥ x (t) - \bar{x} ∥ = ∥x (T) - \bar{x} - \int_{t}^{T} \dot{x} (s) d s∥ \leq ∥ x (T) - \bar{x} ∥ + \int_{t}^{T} ∥ \dot{x} (s) ∥ d s .$ By taking the limit as $T ⟶ + \infty$ , we obtain (15) $∥ x (t) - \bar{x} ∥ \leq \int_{t}^{+ \infty} ∥ \dot{x} (s) ∥ d s \leq σ (t) .$ (15)

Furthermore, for T>t, we have $∥ \dot{x} (t) ∥ = ∥\dot{x} (T) - \int_{t}^{T} \ddot{x} (s) d s∥ \leq ∥ \dot{x} (T) ∥ + \int_{t}^{T} ∥ \ddot{x} (s) ∥ d s .$ By taking the limit as $T ⟶ + \infty$ , we obtain (16) $∥ \dot{x} (t) ∥ \leq \int_{t}^{+ \infty} ∥ \ddot{x} (s) ∥ d s \leq σ (t) .$ (16)

According to Remark 2.1, it holds $∥ x^{(3)} (t) ∥ \leq N (∥ \ddot{x} (t) ∥ + ∥ \dot{x} (t) ∥)$ for almost every $t \geq t_{1}$ , $\begin{aligned} ∥ \ddot{x} (t) ∥ & = ∥\ddot{x} (T) - \int_{t}^{T} x^{(3)} (s) d s∥ \leq ∥ \ddot{x} (T) ∥ + \int_{t}^{T} ∥ x^{(3)} (s) ∥ d s \leq ∥ \ddot{x} (T) ∥ \\ + N \int_{t}^{T} (∥ \ddot{x} (s) ∥ + ∥ \dot{x} (s) ∥) d s \forall T > t . \end{aligned}$ By taking the limit as $T ⟶ + \infty$ , we obtain (17) $∥ \ddot{x} (t) ∥ \leq N σ (t) .$ (17) Next, we show that there exists m<0 such that (18) $\frac{A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} \leq m (∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥) .$ (18)

Indeed, $\begin{aligned} (∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥) (∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥) = ∥ \ddot{x} (t) ∥^{2} + (1 + p (t)) ∥ \dot{x} (t) ∥ ∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥^{2} \\ \leq (\frac{3}{2} + \frac{p (t)}{2}) ∥ \ddot{x} (t) ∥^{2} + (\frac{1}{2} + \frac{3 p (t)}{2}) ∥ \dot{x} (t) ∥^{2} \leq \frac{A}{m} ∥ \ddot{x} (t) ∥^{2} + \frac{B (t)}{m} ∥ \dot{x} (t) ∥^{2}, \end{aligned}$ where $m := max (max_{t \geq t_{1}} \frac{A}{3 / 2 + p (t) / 2}, max_{t \geq t_{1}} \frac{B (t)}{(3 / 2) p (t) + 1 / 2}) .$ It is an easy verification that $m < 0.$

As we have seen in the proof of Theorem 3.2, if there exists $\bar{t} \geq t_{1}$ such that $H (u (\bar{t}), v (\bar{t})) = H (\bar{x}, \bar{x})$ , then x is constant on $[\bar{t}, + \infty)$ and the conclusion follows.

On the other hand, if for every $t \geq t_{1}$ one has that $H (u (t), v (t)) > H (\bar{x}, \bar{x})$ , then according to the proof of Theorem 3.2, there exist $ϵ > 0$ and $T \geq t_{1}$ such that $∥ (u (t), v (t)) - (\bar{x}, \bar{x}) ∥ < ϵ$ and $\frac{d}{d t} (H (u (t), v (t)) - H (\bar{x}, \bar{x}))^{1 - θ} \leq \frac{A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} for almost every t \geq T .$ Busing (Equation18(18) $\frac{A ∥ \ddot{x} (t) ∥^{2} + B (t) ∥ \dot{x} (t) ∥^{2}}{∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥} \leq m (∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥) .$ (18) ), we obtain that $M (∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥) + \frac{d}{d t} (H (u (t), v (t)) - H (\bar{x}, \bar{x}))^{1 - θ} \leq 0 for almost every t \geq T,$ where $M := - m > 0.$

For $t \geq T$ , we integrate the last relation on the interval $[t, \tilde{T}]$ , where $\tilde{T} > t$ , which yields $M \int_{t}^{\tilde{T}} (∥ \dot{x} (s) ∥ + ∥ \ddot{x} (s) ∥) d s + (H (u (\tilde{T}), v (\tilde{T})) - H (\bar{x}, \bar{x}))^{1 - θ} \leq (H (u (t), v (t)) - H (\bar{x}, \bar{x}))^{1 - θ} .$ By taking the limits as $\tilde{T} ⟶ + \infty$ , we get $M σ (t) \leq (H (u (t), v (t)) - H (\bar{x}, \bar{x}))^{1 - θ} .$ On the other hand, according to the KL property for H and Lemma 3.3 (v), we have $(H (u (t), v (t)) - H (\bar{x}, \bar{x}))^{θ} \leq K ∥ \nabla H (u (t), v (t)) ∥ \leq K (∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥) for almost every t \geq T,$ hence $M σ (t) \leq K^{(1 - θ) / θ} (∥ \ddot{x} (t) ∥ + p (t) ∥ \dot{x} (t) ∥)^{(1 - θ) / θ} for almost every t \geq T .$ By denoting $a := max_{t \geq T} p (t) \in (0, + \infty)$ , one can easily see that a>1 and so $M σ (t) \leq (a K)^{(1 - θ) / θ} (∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥)^{(1 - θ) / θ} for almost every t \geq T .$ Taking into account that $∥ \dot{x} (t) ∥ + ∥ \ddot{x} (t) ∥ = - \dot{σ} (t)$ , the previous inequality is nothing else than (19) $- c σ^{θ / (1 - θ)} (t) \geq \dot{σ} (t) for almost every t \geq T,$ (19) where $c := M^{θ / (1 - θ)} / a K > 0.$

If $θ = \frac{1}{2}$ , then (Equation19(19) $- c σ^{θ / (1 - θ)} (t) \geq \dot{σ} (t) for almost every t \geq T,$ (19) ) becomes $c σ (t) + \dot{σ} (t) \leq 0 for almost every t \geq T .$ By multiplying with $e^{c t}$ and integrating on $[T, t]$ , it follows that there exists $a_{1} > 0$ such that $σ (t) \leq a_{1} e^{- a_{2} t} for every t \geq T,$ where $a_{2} = c .$ Using (Equation15(15) $∥ x (t) - \bar{x} ∥ \leq \int_{t}^{+ \infty} ∥ \dot{x} (s) ∥ d s \leq σ (t) .$ (15) )–(Equation17(17) $∥ \ddot{x} (t) ∥ \leq N σ (t) .$ (17) ), we obtain $∥ x (t) - \bar{x} ∥ \leq a_{1} e^{- a_{2} t}, ∥ \dot{x} (t) ∥ \leq a_{1} e^{- a_{2} t} and ∥ \ddot{x} (t) ∥ \leq N a_{1} e^{- a_{2} t} for every t \geq T,$ which proves (b).

Assume now that $0 < θ < \frac{1}{2}$ . In this case, (Equation19(19) $- c σ^{θ / (1 - θ)} (t) \geq \dot{σ} (t) for almost every t \geq T,$ (19) ) leads to $\frac{d}{d t} σ^{(1 - 2 θ) / (1 - θ)} (t) = \frac{1 - 2 θ}{1 - θ} σ^{- θ / (1 - θ)} (t) \dot{σ} (t) \leq - c \frac{1 - 2 θ}{1 - θ} for almost every t \geq T .$ By integrating on $[T, t]$ we obtain $σ^{(1 - 2 θ) / (1 - θ)} (t) \leq - \bar{α} t + \bar{β}, for every t \geq T,$ where $\bar{α} > 0.$ Then there exists $\hat{T} \geq T$ such that $σ (t) \leq 0$ for every $t \geq \hat{T}$ , thus, x is constant on $[\hat{T}, + \infty)$ and (a) follows.

Assume now that $\frac{1}{2} < θ < 1.$ In this case, (Equation19(19) $- c σ^{θ / (1 - θ)} (t) \geq \dot{σ} (t) for almost every t \geq T,$ (19) ) leads to $\frac{d}{d t} σ^{(1 - 2 θ) / (1 - θ)} (t) = \frac{1 - 2 θ}{1 - θ} σ^{- θ / (1 - θ)} (t) \dot{σ} (t) \geq c \frac{2 θ - 1}{1 - θ} for almost every t \geq T .$ By integrating on $[T, t]$ we obtain $σ^{(1 - 2 θ) / (1 - θ)} (t) \geq a_{3} t + a_{4} for every t \geq T,$ where $a_{3}, a_{4} > 0,$ or, equivalently, $σ (t) \leq (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)} for every t \geq T .$ Using again (Equation15(15) $∥ x (t) - \bar{x} ∥ \leq \int_{t}^{+ \infty} ∥ \dot{x} (s) ∥ d s \leq σ (t) .$ (15) )–(Equation17(17) $∥ \ddot{x} (t) ∥ \leq N σ (t) .$ (17) ), we obtain $\begin{aligned} ∥ x (t) - \bar{x} ∥ \leq (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)}, ∥ \dot{x} (t) ∥ \leq (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)} and \\ ∥ \ddot{x} (t) ∥ \leq N (a_{3} t + a_{4})^{- (1 - θ) / (2 θ - 1)} for every t \geq T, \end{aligned}$ which proves (c).

Acknowledgements

The authors are thankful to an anonymous reviewer for comments and remarks which were helpful to gain a better insight into the asymptotic behaviour of the trajectories of the studied dynamical system.

Disclosure statement

No potential conflict of interest was reported by the authors.

ORCID

Radu Ioan Bo http://orcid.org/0000-0002-4469-314X

Additional information

Funding

Radu Ioan Bo's research was partially supported by FWF (Austrian Science Fund) [project I 2419-N32] ; Ernö Robert Csetnek's research was supported by FWF (Austrian Science Fund) [project P 29809-N32] and Szilárd Csaba László's research was supported by a grant of Ministry of Research and Innovation - Unitatea Executiva pentru Finantarea Invatamantului Superior, a Cercetarii, Dezvoltarii si Inovarii (CNCS-UEFISCDI) [project number PN-III-P1-1.1-TE-2016-0266] within PNCDI III.

References

Su W, Boyd S, Candes EJ. A differential equation for modeling Nesterov's accelerated gradient method: theory and insights. J Mach Learn Res. 2016;17:1–43.
Web of Science ®Google Scholar
Nesterov YE. A method for solving the convex programming problem with convergence rate O(1/k2). (Russian) Dokl Akad Nauk SSSR. 1983;269(3):543–547.
Web of Science ®Google Scholar
Attouch H, Chbani Z, Peypouquet J, et al. Fast convergence of inertial dynamics and algorithms with asymptotic vanishing viscosity. Math Program. 2018;168(1–2, Ser. B):123–175. doi: 10.1007/s10107-016-0992-8
Web of Science ®Google Scholar
Attouch H, Peypouquet J, Redont P. Fast convex optimization via inertial dynamics with Hessian driven damping. J Differ Equat. 2016;261(10):5734–5783. doi: 10.1016/j.jde.2016.08.020
Web of Science ®Google Scholar
Chambolle A, Dossal C. On the convergence of the iterates of the “fast iterative shrinkage/thresholding algorithm”. J Optim Theory Appl. 2015;166(3):968–982. doi: 10.1007/s10957-015-0746-4
Web of Science ®Google Scholar
Beck A, Teboulle M. A fast iterative shrinkage- thresholding algorithm for linear inverse problems. SIAM J Imag Sci. 2009;2(1):183–202. doi: 10.1137/080716542
Web of Science ®Google Scholar
Attouch H, Chbani Z, Riahi H. Rate of convergence of the Nesterov accelerated gradient method in the subcritical case α≤3. 2017. arXiv:1706.05671.
Google Scholar
Haraux A, Jendoubi MA. Asymptotics for a second order differential equation with a linear, slowly time-decaying damping term. Evol Equat Contr Theor. 2013;2(3):461–470. doi: 10.3934/eect.2013.2.461
Web of Science ®Google Scholar
Balti M. Asymptotic behavior for second-order differential equations with nonlinear slowly time-decaying damping and integrable source. Electron J Differ Equat. 2015;302:11pp.
Google Scholar
Bégout P, Bolte J, Jendoubi MA. On damped second-order gradient systems. J Differ Equat. 2015;259:3115–3143. doi: 10.1016/j.jde.2015.04.016
Web of Science ®Google Scholar
Haraux A, Jendoubi MA. Convergence of solutions of second-order gradient-like systems with analytic nonliniearities. J Differ Equat. 1998;144(2):313–320. doi: 10.1006/jdeq.1997.3393
Web of Science ®Google Scholar
Chill R, Jendoubi MA. Asymptotics for a second order differential equation with a linear, slowly time-decaying damping term. Evol Equat Contr Theor. 2013;2(3):461–470. doi: 10.3934/eect.2013.2.461
Web of Science ®Google Scholar
Boţ RI, Csetnek ER, László SC. Approaching nonsmooth nonconvex minimization through second-order proximal-gradient dynamical systems. J Evol Equat. 2018. doi:10.1007/s00028-018-0441-7.
Web of Science ®Google Scholar
Abbas B, Attouch H, Svaiter BF. Newton-like dynamics and forward-backward methods for structured monotone inclusions in Hilbert spaces. J Optim Theory Appl. 2014;161(2):331–360. doi: 10.1007/s10957-013-0414-5
Web of Science ®Google Scholar
Attouch H, Svaiter BF. A continuous dynamical Newton-like approach to solving monotone inclusions. SIAM J Control Optim. 2011;49(2):574–598. doi: 10.1137/100784114
Web of Science ®Google Scholar
Haraux A. Systèmes Dynamiques Dissipatifs et Applications, Recherches en Mathématiques Appliquées 17, Masson, Paris, 1991.
Google Scholar
Sontag ED. Mathematical control theory. Deterministic finite-dimensional systems. 2nd ed. New York: Springer-Verlag; 1998. (Texts in Applied Mathematics 6).
Google Scholar
Alvarez F, Attouch H, Bolte J, et al. A second-order gradient-like dissipative dynamical system with Hessian-driven damping. Application to optimization and mechanics. J Math Pures et Appl. 2002;81(8):747–779. doi: 10.1016/S0021-7824(01)01253-3
Web of Science ®Google Scholar
Łojasiewicz S. Une propriété topologique des sous-ensembles analytiques réels, Les Équations aux Dérivées Partielles, Éditions du Centre National de la Recherche Scientifique Paris, 87–89, 1963.
Google Scholar
Kuryka K. On gradients of functions definable in o-minimal structures. Ann l Fourier (Grenoble). 1998;48(3):769–783. doi: 10.5802/aif.1638
Web of Science ®Google Scholar
Attouch H, Bolte J, Redont P, et al. Proximal alternating minimization and projection methods for nonconvex problems: an approach based on the Kurdyka–Łojasiewicz inequality. Math Oper Res. 2010;35(2):438–457. doi: 10.1287/moor.1100.0449
Web of Science ®Google Scholar
Bolte J, Daniilidis A, Lewis A. The Łojasiewicz inequality for nonsmooth subanalytic functions with applications to subgradient dynamical systems. SIAM J Optim. 2006;17(4):1205–1223. doi: 10.1137/050644641
Web of Science ®Google Scholar
Bolte J, Daniilidis A, Lewis A, et al. Clarke subgradients of stratifiable functions. SIAM J Optim. 2007;18(2):556–572. doi: 10.1137/060670080
Web of Science ®Google Scholar
Bolte J, Daniilidis A, Ley O, et al. Characterizations of Łojasiewicz inequalities: subgradient flows, talweg, convexity. Trans Am Math Soc. 2010;362(6):3319–3363. doi: 10.1090/S0002-9947-09-05048-X
Web of Science ®Google Scholar
Attouch H, Bolte J. On the convergence of the proximal algorithm for nonsmooth functions involving analytic features. Math Program. 2009;116(1–2, Series B):5–16. doi: 10.1007/s10107-007-0133-5
Web of Science ®Google Scholar
Attouch H, Bolte J, Svaiter BF. Convergence of descent methods for semi-algebraic and tame problems: proximal algorithms, forward-backward splitting, and regularized Gauss-Seidel methods. Math Program. 2013;137(1–2, Series A):91–129. doi: 10.1007/s10107-011-0484-9
Web of Science ®Google Scholar
Bolte J, Sabach S, Teboulle M. Proximal alternating linearized minimization for nonconvex and nonsmooth problems. Math Program Ser A. 2014;146(1–2):459–494. doi: 10.1007/s10107-013-0701-9
Web of Science ®Google Scholar
Rockafellar RT, Wets RJ-B. Variational analysis, fundamental principles of mathematical sciences, 317. Berlin: Springer-Verlag; 1998.
Google Scholar

A second-order dynamical approach with variable damping to nonconvex smooth minimization

ABSTRACT

1. Introduction

2. Existence and uniqueness of the trajectory

3. Convergence of trajectories

Kurdyka–Łojasiewicz property

4. Convergence rates

Acknowledgements

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

A second-order dynamical approach with variable damping to nonconvex smooth minimization

ABSTRACT

1. Introduction

2. Existence and uniqueness of the trajectory

3. Convergence of trajectories

Kurdyka–Łojasiewicz property

4. Convergence rates

Acknowledgements

Disclosure statement

ORCID

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date