Full article: Sampled visual feedback pose estimation and regulation based on camera frame rates

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

This paper presents visual feedback 3D pose estimation/regulation methodologies in the sampled data setting by camera frame rates. Vision-based estimation/control problems have been studied by a number of research groups. While most works focus on the limitation of measured outputs, they conduct convergence/performance analysis under the assumption that visual measurements extracted from a camera are continuously available. However, the camera frame rates including image processing time often cannot be neglected compared with other computation time. In view of this fact, this paper newly proposes visual feedback pose estimation/regulation techniques under the situation that visual measurements are sampled due to the frame rates. The problem settings are first provided. Then, the pose estimation/regulation methods with sampled visual measurements are proposed. The convergence/performance analysis is conducted by the fusion of a Lyapunov-based approach and an event-triggered control technique. The present analysis scheme provides us guidelines for the design of estimation/control gains guaranteeing desired convergence/performance. The effectiveness of the present technique is verified via simulation and an experiment with real hardware.

KEYWORDS:

1. Introduction

Vision sensors have been widely leveraged for situation recognition since they provide rich 2D information projected from 3D relative states [Citation1]. Thanks to this utility, vision sensors are also utilized in robotics and control engineering communities to develop vision-based autonomous control methods [Citation2–12]. In vision-based control, one of the main issues is how to deal with 2D visual information to estimate/control a 3D pose (position and orientation) of a target object (relative to a camera) [Citation5,Citation10–12]. On the other hand, compared with other computation times, the sampling time for extracting desired information from the image, i.e. a camera frame rate, might not be small enough to be neglected, especially when complex image processing algorithms are employed. Nonetheless, most of research studies, including the references above, focus only on the limitation of measured outputs for estimation/control under the premise that visual information is continuously available.

In view of this fact, this paper investigates visual feedback 3D pose estimation/regulation problems in the sampled data setting. Here, the objectives are (i) to estimate the 3D target object pose relative to the camera, and (ii) to drive the relative pose to a desired one, by using only sampled 2D visual information extracted from a monocular camera with a certain frame rate. To achieve these goals, we introduce a visual feedback 3D pose estimation mechanism, called a visual motion observer, presented by [Citation13,Citation14] and also utilized for some extended estimation/control objectives [Citation15–17]. In the visual motion observer, a passivity property of rigid body motion plays a central role and estimation/control performance is analysed through a Lyapunov-based approach. However, similarly to the references above, the observer assumes continuous measurements of visual information. This assumption leads to the convergence/performance analysis allowing any large positive estimation/control gains, but high gain estimation and its control application often do not work in real experiments with sampled measurements. Therefore, this paper extends the visual motion observer to that with sampled visual measurements.

Since the observer input is formed by the visual measurements, the input is also sampled in the case of sampled measurements. For conducting convergence/performance analysis even in the sampled data setting, this paper employs an event-triggered control technique [Citation18–20]. The merit of event-triggered control is the possibility to reduce the number of re-computing inputs and transmissions (i.e. events) while guaranteeing desired performance. Specifically, inter-event time is investigated to guarantee a monotonic decrease of Lyapunov-like functions for convergence. This technique is suited to the visual motion observer because its convergence/performance analysis is also based on non-increasing properties of Lyapunov-like functions. As a result of the event-triggered approach, we can obtain gain limitations for desired convergence/performance based on given certain camera frame rates.

In summary, the main contribution of this work is to propose a novel vision-based pose observer and its control application in the sampled data setting by camera frame rates. We first provide problem settings: a 3D relative motion model between a camera and a target object; a visual measurement model; and definitions of the estimation and control objectives. Then, we newly propose a sampled visual motion observer and give convergence analysis for the stationary target object case and tracking analysis for the moving object case. A pose regulation method based on the proposed sampled visual motion observer is next proposed, and the regulation/tracking analysis is also conducted for the stationary/moving object. Here, we provide the relationship between the frame rates and estimation/control gains to achieve desired convergence/performance. Specifically, we show that estimation and control errors are ultimately bounded by a function of the camera frame rate, estimation/control gains and target object velocity. This analysis provides us with the guidelines for gain settings. The effectiveness of the proposed methods is demonstrated via simulation and an experiment with real hardware.

The conference versions of this paper are reported in [Citation21,Citation22]. While [Citation21] considers only estimation problems, this work also tackles pose regulation problems. Compared with [Citation22], we greatly improve the conservativeness of the gain condition by modifying the control law structure, and newly carrying out an experimental demonstration with real hardware.

2. Problem settings

This section formulates 3D visual feedback pose estimation/regulation problems consisting of two rigid bodies (a camera robot and a target object robot) and visual measurements by a monocular camera.

2.1. Relative rigid body motion

Throughout this paper, this work considers a visual feedback system shown in Figure . Let the world frame, camera frame, and object frame be $Σ_{w}$ , $Σ_{c}$ , and $Σ_{o}$ , respectively (see Figure (a)). Then, the pose of the origin of $Σ_{c}$ relative to $Σ_{w}$ is represented by $g_{w c} = (p_{w c}, e^{{\hat{ξ}}_{w c} θ_{w c}}) \in S E (3)$ , where $S E (3) := R^{3} \times S O (3)$ , $S O (3) := {R \in R^{3 \times 3} ∣ R^{T} R = I_{3}, det R = 1}$ . ( $I_{n} \in R^{n \times n}$ is the n-dimensional identity matrix.) The orientation is represented by the exponential coordinate of the rotation matrix with the unit axis $ξ_{w c} \in R^{3}$ $(ξ_{w c}^{T} ξ_{w c} = 1)$ and the angle $θ_{w c} \in (- π, π]$ . The operator $\land : R^{3} \to s o (3)$ , $s o (3) := {S \in R^{3 \times 3} ∣ S^{T} = - S}$ provides $\hat{a} b = a \times b, a, b \in R^{3}$ for the vector cross-product “×,” and $\lor : s o (3) \to R^{3}$ is its inverse operator. For the ease of representation, $ξ_{w c} θ_{w c}$ is simply written by $ξ θ_{w c}$ in this paper. Similarly, the pose of $Σ_{o}$ relative to $Σ_{w}$ is represented by $g_{w o} = (p_{w o}, e^{\hat{ξ} θ_{w o}}) \in S E (3)$ .

Figure 1. Visual feedback system. (a) Coordinate frames and (b) Perspective projection model.

Let us next introduce the body velocity of $Σ_{c}$ relative to $Σ_{w}$ as $V_{w c}^{b} = [v_{w c}^{T} ω_{w c}^{T}]^{T} \in R^{6}$ and that of $Σ_{o}$ as $V_{w o}^{b} = [v_{w o}^{T} ω_{w o}^{T}]^{T} \in R^{6}$ . Here, $v \in R^{3}$ and $ω \in R^{3}$ are respectively the body translational velocity and angular one. The pose g and body velocity $V^{b}$ can be also written in the homogeneous representation form as follows: $\begin{aligned} g & = [\begin{matrix} e^{\hat{ξ} θ} & p \\ O_{1 \times 3} & 1 \end{matrix}] \in R^{4 \times 4}, \\ {\hat{V}}^{b} & = [\begin{matrix} \hat{ω} & v \\ O_{1 \times 3} & 0 \end{matrix}] \in R^{4 \times 4} . \end{aligned}$ ( $O_{m \times n} \in R^{m \times n}$ represents the $m \times n$ zero matrix.) Notice here that another definition of “ $\land$ ” is used for the 6D vector $V^{b}$ . Then, $V_{w c}^{b}$ and $V_{w o}^{b}$ are respectively given by ${\hat{V}}_{w c}^{b} = g_{w c}^{- 1} {\dot{g}}_{w c}$ and ${\hat{V}}_{w o}^{b} = g_{w o}^{- 1} {\dot{g}}_{w o}$ [Citation1].

Similarly, the pose of $Σ_{o}$ relative to $Σ_{c}$ and its body velocity are denoted by $g_{c o} = (p_{c o}, e^{\hat{ξ} θ_{c o}}) \in S E (3)$ and $V_{c o}^{b} := (g_{c o}^{- 1} {\dot{g}}_{c o})^{\lor} \in R^{6}$ , respectively. Then, $g_{c o} = g_{w c}^{- 1} g_{w o}$ holds, and thus the following relative rigid body motion is obtained from ${\hat{V}}_{c o}^{b} = g_{c o}^{- 1} {\dot{g}}_{c o}$ : (1) ${\dot{g}}_{c o} = - {\hat{V}}_{w c}^{b} g_{c o} + g_{c o} {\hat{V}}_{w o}^{b} .$ (1)

2.2. Visual measurements

2D visual measurements extracted from a monocular camera are introduced as the measured outputs for 3D pose estimation/regulation laws. Although the extraction by perspective projection is only introduced in this paper, it can be easily extended to the panoramic camera case as in [Citation15].

Consider the target object with $m (\geq 4)$ feature points. The positions of the feature points relative to the object frame $Σ_{o}$ are represented by $p_{o i} \in R^{3}, i \in {1, \dots, m}$ . Then, these positions relative to the camera frame $Σ_{c}$ are given by $p_{c i} = g_{c o} p_{o i}$ from the coordinate transformation.Footnote¹ We next denote the m feature points on the 2D image plane by $f = [f_{1}^{T} \dots f_{m}^{T}]^{T} \in R^{2 m}$ . Then, the well-known perspective projection [Citation1] yields the following relationship for each $f_{i} \in R^{2}$ (see Figure (b)): (2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) Here, $σ > 0$ is the focal length of the camera.

This paper considers the situation that the visual measurements f are only available for pose estimation/regulation laws, and supposes that the positions of the feature points in $Σ_{o}$ (i.e. $p_{o i}$ ) are known a priori. Then, the visual measurements are given by the function only of the relative pose $g_{c o}$ , i.e. $f (g_{c o})$ . Figure illustrates the block diagram of the relative rigid body motion (Equation1(1) ${\dot{g}}_{c o} = - {\hat{V}}_{w c}^{b} g_{c o} + g_{c o} {\hat{V}}_{w o}^{b} .$ (1) ) with the perspective projection (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ).

Figure 2. Block diagram of relative rigid body motion (RRBM) with camera model.

2.3. Research objectives

For the present visual feedback system, the objectives of this paper are

to propose a pose estimation mechanism of the relative pose $g_{c o}$
to propose a pose regulation mechanism to drive $g_{c o}$ to the fixed desired pose $g_{d} = (p_{d}, e^{\hat{ξ} θ_{d}}) \in S E (3)$ (see Figure (a))

under the situation that only the visual measurements

f (g_{c o})

are available and the camera frame rates including the image processing time are non-negligible.

3. Sampled visual feedback pose estimation

We first propose a sampled visual feedback pose estimation mechanism taking account of camera frame rates for the objective (i).

3.1. Estimation error system

Since the 2D visual measurements (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ) are only available, we consider the estimation of the 3D relative pose $g_{c o}$ by a nonlinear observer. The estimate of $g_{c o}$ is represented by ${\bar{g}}_{c o} = ({\bar{p}}_{c o}, e^{\hat{\bar{ξ}} {\bar{θ}}_{c o}}) \in S E (3)$ . Similarly to the Luenberger-type observer [Citation23], we build the copy model of the relative rigid body motion (Equation1(1) ${\dot{g}}_{c o} = - {\hat{V}}_{w c}^{b} g_{c o} + g_{c o} {\hat{V}}_{w o}^{b} .$ (1) ) as follows: (3) ${\dot{\bar{g}}}_{c o} = - {\hat{V}}_{w c}^{b} {\bar{g}}_{c o} + {\bar{g}}_{c o} {\hat{u}}_{e} .$ (3) Here, $u_{e} = [u_{e p}^{T} u_{e R}^{T}]^{T} \in R^{6}$ is the observer input for the estimation of $g_{c o}$ . We note that the model (Equation3(3) ${\dot{\bar{g}}}_{c o} = - {\hat{V}}_{w c}^{b} {\bar{g}}_{c o} + {\bar{g}}_{c o} {\hat{u}}_{e} .$ (3) ) does not include the target object velocity information $V_{w o}^{b}$ , because it is unavailable.Footnote² Notice also that the estimated visual measurements $\bar{f} \in R^{2 m}$ can be computed by ${\bar{g}}_{c o}$ and (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ).

Let us define the estimation error $g_{e e} = (p_{e e}, e^{\hat{ξ} θ_{e e}}) \in S E (3)$ between $g_{c o}$ and ${\bar{g}}_{c o}$ and its vector form $e_{e} \in R^{6}$ as follows: $g_{e e} := {\bar{g}}_{c o}^{- 1} g_{c o}, e_{e} := [\begin{matrix} p_{e e} \\ s k (e^{\hat{ξ} θ_{e e}})^{\lor} \end{matrix}] .$ Here, $s k (e^{\hat{ξ} θ}) := (1 / 2) (e^{\hat{ξ} θ} - e^{- \hat{ξ} θ}) \in s o (3)$ . The estimation error vector has the important property that for $θ_{e e} \in (- π, π)$ , $e_{e} = 0$ holdsFootnote³ if and only if $g_{e e} = I_{4}$ which is equivalent to ${\bar{g}}_{c o} = g_{c o}$ , i.e. the objective (i) is achieved. It should be also noted that $e_{e}$ can be approximately reconstructed by the measurement error $f_{e} := f - \bar{f} \in R^{2 m}$ (refer to [Citation13] for the details). Then, the time derivative of $g_{e e}$ along the trajectories of (Equation1(1) ${\dot{g}}_{c o} = - {\hat{V}}_{w c}^{b} g_{c o} + g_{c o} {\hat{V}}_{w o}^{b} .$ (1) ) and (Equation3(3) ${\dot{\bar{g}}}_{c o} = - {\hat{V}}_{w c}^{b} {\bar{g}}_{c o} + {\bar{g}}_{c o} {\hat{u}}_{e} .$ (3) ) yields the following estimation error system: (4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) This is also given in the vector form with the Adjoint transformation ${A d}_{(g)} \in R^{6 \times 6}$ as (5) $\begin{aligned} V_{e e}^{b} := (g_{e e}^{- 1} {\dot{g}}_{e e})^{\lor} = - {A d}_{(g_{e e}^{- 1})} u_{e} + V_{w o}^{b}, \\ {A d}_{(g)} := [\begin{matrix} e^{\hat{ξ} θ} & \hat{p} e^{\hat{ξ} θ} \\ O_{3 \times 3} & e^{\hat{ξ} θ} \end{matrix}] . \end{aligned}$ (5) Then, it is shown in [Citation13] that if $V_{w o}^{b} = 0$ , the estimation error system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) has a passivity-like property from the input $u_{e}$ to the output $- e_{e}$ with the storage function $U_{e} \geq 0$ , i.e. ${\dot{U}}_{e} = u_{e}^{T} (- e_{e})$ holds. Here, $\begin{aligned} U_{e} := \frac{1}{2} ‖ p_{e e} ‖^{2} + ϕ (e^{\hat{ξ} θ_{e e}}), \\ ϕ (e^{\hat{ξ} θ_{e e}}) := \frac{1}{4} ‖ I_{3} - e^{\hat{ξ} θ_{e e}} ‖_{F}^{2} = \frac{1}{2} t r (I_{3} - e^{\hat{ξ} θ_{e e}}) \geq 0, \end{aligned}$ and notice that $U_{e} = 0$ is equivalent to $e_{e} = 0$ , that is, the pose estimation (the objective (i)) is achieved. ( $‖ \cdot ‖_{F}$ represents the Frobenius norm.) Based on this property, Fujita et al. [Citation13] propose the following negative feedback law to achieve the pose estimation: $u_{e} = - k_{e} (- e_{e}) = k_{e} e_{e}, k_{e} > 0,$ where the achievement of the estimation is proved by the direct use of the storage function $U_{e}$ as a potential function for the Lyapunov-based energy approach. However, this observer input assumes that the visual measurements f are continuously available although general cameras have 60, 30, 15, or fewer fps when the image processing time to extract the feature points is included. Therefore, this paper tackles a new visual feedback pose estimation problem explicitly taking account of the camera frame rates.

3.2. Pose estimation mechanism

Let us first consider the case that the camera has a fixed frame rate $τ > 0$ [fps], and, for ease of representation, this includes the image processing time to extract the feature points. The variable frame rate case is handled at the end of the main results. We assume that the computation time to calculate the estimation input is small enough to be neglected compared with $1 / τ$ . Then, based on the frame rate τ, we introduce the sampling time sequence ${t_{0}, t_{1}, t_{2}, \dots}$ such that $t_{i + 1} - t_{i} = 1 / τ$ holds for all $i \in N_{0}$ . ( $N_{0}$ represents the union of natural numbers and ${0}$ .) Then, the visual measurements (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ) are extracted at each time instant $t_{i}$ .

We now propose the following sampled observer input for the relative rigid body motion model (Equation3(3) ${\dot{\bar{g}}}_{c o} = - {\hat{V}}_{w c}^{b} {\bar{g}}_{c o} + {\bar{g}}_{c o} {\hat{u}}_{e} .$ (3) ): (6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) Notice here that the present input remains to be constant until it is re-computed. Figure illustrates the block diagram of the present estimation mechanism called the sampled visual motion observer. In the present observer since the image Jacobian consists of the estimates $\bar{f}$ and ${\bar{g}}_{c o}$ as well as the visual measurements f [Citation13], we employ the same samplers for these estimates as for the camera.

Figure 3. Block diagram of sampled visual motion observer.

4. Analysis of pose estimation

This section provides the convergence analysis for the stationary target object (i.e. $V_{w o}^{b} \equiv 0$ holds) and the tracking performance analysis for the moving target.

4.1. Convergence analysis

Let us define the sampling error $ϵ (t) \in R^{6}$ between the actual estimation error $e_{e} (t)$ and the sampled one $e_{e} (t_{i})$ as $ϵ (t) := e_{e} (t) - e_{e} (t_{i}), t \in [t_{i}, t_{i + 1})^{\forall} i \in N_{0} .$ Then, we obtain the following theorem:

Theorem 4.1

Suppose that the target object is static (i.e. $V_{w o}^{b} \equiv 0$ ). Then, if the camera frame rate satisfies the condition (7) $τ \geq \frac{2 k_{e} (1 + δ)}{δ}, δ \in (0, 1),$ (7) there exist finite time $T_{a} \geq t_{0}$ and positive scalars $α_{1}, β_{1} > 0$ such that (8) $‖ e_{e} (t) ‖ \leq α_{1} e^{- β_{1} (t - T_{a}) \forall} t \geq T_{a}$ (8) for the closed-loop system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) and (Equation6(6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) ). In other words, the equilibrium point $e_{e} = 0$ is exponentially stable after time $T_{a}$ .

Proof.

When $V_{w o}^{b} = 0$ holds, the time derivative of the potential function $U_{e}$ along the trajectories of (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) and (Equation6(6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) ) for $t \in [t_{i}, t_{i + 1})$ yields $\begin{aligned} {\dot{U}}_{e} = - e_{e}^{T} u_{e} = - k_{e} e_{e}^{T} e_{e} (t_{i}) = - k_{e} e_{e}^{T} (e_{e} - ϵ) \\ \leq - k_{e} ‖ e_{e} ‖^{2} + k_{e} ‖ e_{e} ‖ ‖ ϵ ‖ . \end{aligned}$ Therefore, if the inequality (9) $‖ ϵ ‖ \leq δ ‖ e_{e} ‖, δ \in (0, 1)$ (9) is satisfied, we obtain (10) ${\dot{U}}_{e} \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} \leq 0$ (10) for $t \in [t_{i}, t_{i + 1})$ .

We next derive the frame rate condition to guarantee the inequality (Equation9(9) $‖ ϵ ‖ \leq δ ‖ e_{e} ‖, δ \in (0, 1)$ (9) ). Notice first that it is enough to consider each time interval $[t_{i}, t_{i + 1})$ since ϵ becomes 0 at the next time step $t_{i + 1}$ . Motivated by the analysis of event-triggered control [Citation18–20], we investigate the dynamics of $‖ ϵ ‖ / ‖ e_{e} ‖$ for $t \in [t_{i}, t_{i + 1})$ as follows: (11) $\begin{aligned} \frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{e} ‖} & = \frac{ϵ^{T} \dot{ϵ} ‖ e_{e} ‖}{‖ ϵ ‖ ‖ e_{e} ‖^{2}} + \frac{‖ ϵ ‖ e_{e}^{T} {\dot{e}}_{e}}{‖ e_{e} ‖^{3}} \leq \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖} \\ + \frac{‖ ϵ ‖ ‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖^{2}} = (1 + \frac{‖ ϵ ‖}{‖ e_{e} ‖}) \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖} . \end{aligned}$ (11) Here, we use the fact that $\dot{ϵ} = {\dot{e}}_{e}$ for $t \in [t_{i}, t_{i + 1})$ . Let us now consider ${\dot{e}}_{e}$ . Then, we first get the following position term from the estimation error system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) with $V_{w o}^{b} = 0$ : ${\dot{p}}_{e e} = - u_{e p} - {\hat{u}}_{e R} p_{e e} .$ Before obtaining the orientation term, we note that the following equality holds for any vector $a \in R^{3}$ and any matrix $A \in R^{3 \times 3}$ : $\hat{a} A + A^{T} \hat{a} = ((t r (A) I_{3} - A) a)^{\land} .$ From this property and the estimation error system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) with $V_{w o}^{b} = 0$ , we get $\begin{aligned} \frac{d}{d t} (s k (e^{\hat{ξ} θ_{e e}})^{\lor}) \\ = \frac{1}{2} ({\dot{e}}^{\hat{ξ} θ_{e e}} - {\dot{e}}^{- \hat{ξ} θ_{e e}})^{\lor} \\ = \frac{1}{2} (e^{\hat{ξ} θ_{e e}} {\hat{ω}}_{e e} + {\hat{ω}}_{e e} e^{- \hat{ξ} θ_{e e}})^{\lor} \\ = \frac{1}{2} (t r (e^{- \hat{ξ} θ_{e e}}) I_{3} - e^{- \hat{ξ} θ_{e e}}) ω_{e e} \\ = - \frac{1}{2} e^{- \hat{ξ} θ_{e e}} ((1 + 2 \cos θ_{e e}) I_{3} - e^{- \hat{ξ} θ_{e e}}) u_{e R} . \end{aligned}$ Here, we use the notation $V_{e e}^{b} = [v_{e e}^{T} ω_{e e}^{T}]^{T}$ , and the representation $ϵ = [ϵ_{p}^{T} ϵ_{R}^{T}]^{T}$ is also utilized hereafter. Therefore, using the properties $‖ e^{\hat{ξ} θ} ‖ = 1$ and $‖ s k (e^{\hat{ξ} θ}) ‖ \leq 1$ , and the estimation input $u_{e} (t) = k_{e} (e_{e} (t) - ϵ (t))$ , we obtain (12) $\begin{aligned} ‖ {\dot{e}}_{e} ‖ \\ = ‖ \begin{matrix} - u_{e p} - {\hat{u}}_{e R} p_{e e} \\ - \frac{1}{2} e^{- \hat{ξ} θ_{e e}} ((1 + 2 \cos θ_{e e}) I_{3} - e^{- \hat{ξ} θ_{e e}}) u_{e R} \end{matrix} ‖ \\ \leq ‖ \begin{matrix} k_{e} (- I_{3} - s k (e^{\hat{ξ} θ_{e e} (t_{i})})) p_{e e} \\ \frac{k_{e}}{2} e^{- \hat{ξ} θ_{e e}} (e^{- \hat{ξ} θ_{e e}} - (1 + 2 \cos θ_{e e}) I_{3}) s k (e^{\hat{ξ} θ_{e e}})^{\lor} \end{matrix} ‖ \\ + ‖ \begin{matrix} k_{e} ϵ_{p} \\ \frac{k_{e}}{2} e^{- \hat{ξ} θ_{e e}} (e^{- \hat{ξ} θ_{e e}} - (1 + 2 \cos θ_{e e}) I_{3}) ϵ_{R} \end{matrix} ‖ \\ \leq 2 k_{e} ‖ e_{e} ‖ + 2 k_{e} ‖ ϵ ‖ . \end{aligned}$ (12) Substituting this inequality into (Equation11(11) $\begin{aligned} \frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{e} ‖} & = \frac{ϵ^{T} \dot{ϵ} ‖ e_{e} ‖}{‖ ϵ ‖ ‖ e_{e} ‖^{2}} + \frac{‖ ϵ ‖ e_{e}^{T} {\dot{e}}_{e}}{‖ e_{e} ‖^{3}} \leq \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖} \\ + \frac{‖ ϵ ‖ ‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖^{2}} = (1 + \frac{‖ ϵ ‖}{‖ e_{e} ‖}) \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{e} ‖} . \end{aligned}$ (11) ) gives $\frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{e} ‖} \leq 2 k_{e} {(1 + \frac{‖ ϵ ‖}{‖ e_{e} ‖})}^{2} .$ Let us now denote $‖ ϵ ‖ / ‖ e_{e} ‖$ by $χ \geq 0$ , i.e. we consider $\dot{χ} \leq 2 k_{e} (1 + χ)^{2}, χ (t_{i}) = 0, t \in [t_{i}, t_{i + 1})$ . Then, if $η (t)$ is the solution of $\dot{η} = 2 k_{e} (1 + η)^{2}, η (t_{i}) = 0$ , we have $χ (t) \leq η (t)$ for all $t \in [t_{i}, t_{i + 1})$ . This means that the time it takes for χ to evolve from 0 to δ is larger than or equal to the time for η (see Figure ). The time for η is given by $δ / (2 k_{e} (1 + δ))$ which is obtained by the solution T>0 of $η (t_{i} + T) = δ$ . Therefore, if $1 / τ \leq T$ holds, (Equation9(9) $‖ ϵ ‖ \leq δ ‖ e_{e} ‖, δ \in (0, 1)$ (9) ) is guaranteed for all time because ϵ is reset to 0 before $‖ ϵ ‖ / ‖ e_{e} ‖$ reaches to δ (see Figure ).

Figure 4. Image of time evolution of χ and η.

We finally show the exponential stability of $e_{e} = 0$ after some time, from the Gronwall-Bellman Inequality [Citation24]. Notice that if τ satisfies (Equation7(7) $τ \geq \frac{2 k_{e} (1 + δ)}{δ}, δ \in (0, 1),$ (7) ), then the inequality (Equation10(10) ${\dot{U}}_{e} \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} \leq 0$ (10) ) holds, i.e. ${\dot{U}}_{e}$ is negative definite for all time. It should be also noted that $U_{e}$ is continuous and positive definite. Therefore, there exists finite time $T_{a} > t_{0}$ in the time sequence $t_{i}, i \in N_{0}$ satisfying $U_{e} (t) < 1^{\forall} t \geq T_{a}$ . Then, $ϕ (e^{\hat{ξ} θ_{e e}}) < 1$ holds for all $t \geq T_{a}$ from the definition of $U_{e}$ . The property $ϕ (e^{\hat{ξ} θ}) \leq ‖ s k (e^{\hat{ξ} θ})^{\lor} ‖^{2}$ also holds for $ϕ (e^{\hat{ξ} θ}) < 1$ .Footnote⁴ In summary, (Equation10(10) ${\dot{U}}_{e} \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} \leq 0$ (10) ) provides the following inequality for every interval $t \in [t_{i}, t_{i + 1}), t_{i} \geq T_{a}$ : ${\dot{U}}_{e} (t) \leq - k_{e} (1 - δ) U_{e} (t) .$ Then, from the Gronwall-Bellman Inequality, we obtain (13) $U_{e} (t) \leq U_{e} (t_{i}) e^{- k_{e} (1 - δ) (t - t_{i})}, t \in [t_{i}, t_{i + 1}) .$ (13) We next consider $t_{i + 1} = t_{i} + 1 / τ$ . Similarly to the above discussion, we obtain $U_{e} (t) \leq U_{e} (t_{i + 1}) e^{- k_{e} (1 - δ) (t - t_{i} - 1 / τ)}, t \in [t_{i + 1}, t_{i + 2}) .$ Then, since $U_{e} (t_{i + 1}) \leq U_{e} (t_{i}) e^{- k_{e} (1 - δ) / τ}$ holds from (Equation13(13) $U_{e} (t) \leq U_{e} (t_{i}) e^{- k_{e} (1 - δ) (t - t_{i})}, t \in [t_{i}, t_{i + 1}) .$ (13) ) and the continuity of $U_{e}$ , we get $U_{e} (t) \leq U_{e} (t_{i}) e^{- k_{e} (1 - δ) (t - t_{i})}, t \in [t_{i + 1}, t_{i + 2}) .$ Furthermore, the induction from time $T_{a}$ gives (14) $U_{e} (t) \leq U_{e} (T_{a}) e^{- k_{e} (1 - δ) (t - T_{a}) \forall} t \geq T_{a} .$ (14) Finally, we derive the inequality (Equation8(8) $‖ e_{e} (t) ‖ \leq α_{1} e^{- β_{1} (t - T_{a}) \forall} t \geq T_{a}$ (8) ) from (Equation14(14) $U_{e} (t) \leq U_{e} (T_{a}) e^{- k_{e} (1 - δ) (t - T_{a}) \forall} t \geq T_{a} .$ (14) ). Remember that $(1 / 2) ‖ e_{e} ‖^{2} \leq U_{e} \leq ‖ e_{e} ‖^{2}$ holds after time $T_{a}$ . Then, substituting these inequalities into (Equation14(14) $U_{e} (t) \leq U_{e} (T_{a}) e^{- k_{e} (1 - δ) (t - T_{a}) \forall} t \geq T_{a} .$ (14) ) yields $\frac{1}{2} ‖ e_{e} (t) ‖^{2} \leq ‖ e_{e} (T_{a}) ‖^{2} e^{- k_{e} (1 - δ) (t - T_{a}) \forall} t \geq T_{a} .$ Therefore, by taking the square root for both sides of this inequality and defining $α_{1}, β_{1} > 0$ as $α_{1} := \sqrt{2} ‖ e_{e} (T_{a}) ‖$ and $β_{1} := k_{e} (1 - δ) / 2$ , we obtain (Equation8(8) $‖ e_{e} (t) ‖ \leq α_{1} e^{- β_{1} (t - T_{a}) \forall} t \geq T_{a}$ (8) ).

Remark 4.1

The condition (Equation7(7) $τ \geq \frac{2 k_{e} (1 + δ)}{δ}, δ \in (0, 1),$ (7) ) implies that the choice of a large feedback gain $k_{e}$ requires fast frame rates τ (small sampling intervals). This property is intuitive because large gains increase the influence of the sampling error ϵ, which might be a poor impact on the estimation. In other words, after choosing a camera with a certain frame rate, it is not free to make the gain large. Although the condition (Equation7(7) $τ \geq \frac{2 k_{e} (1 + δ)}{δ}, δ \in (0, 1),$ (7) ) is only sufficient, this analysis provides the significant insight that camera frame rates or image processing time is not negligible for pose estimation.

4.2. Tracking performance analysis

We next analyse the tracking performance for the moving target. Suppose that $‖ V_{w o}^{b} (t) ‖ \leq κ^{\forall} t \geq t_{0}$ holds for a positive scalar $κ > 0$ , i.e. the target velocity is bounded. We also assume that the value of κ is known a prior, e.g. simply as prior information or due to hardware limitations of the target vehicle. Then, the time derivative of the potential function $U_{e}$ along the trajectories of (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) and (Equation6(6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) ) yields $\begin{aligned} {\dot{U}}_{e} & = e_{e}^{T} u_{e} + e_{e}^{T} {A d}_{(e^{\hat{ξ} θ_{e e}})} V_{w o}^{b} \\ \leq - k_{e} e_{e}^{T} e_{e} (t_{i}) + ‖ e_{e} ‖ ‖ V_{w o}^{b} ‖ \\ \leq - k_{e} ‖ e_{e} ‖^{2} + k_{e} ‖ e_{e} ‖ ‖ ϵ ‖ + κ ‖ e_{e} ‖ . \end{aligned}$ Here, for ease of representation, we employ the following notation, differently from ${A d}_{(g)}$ in (Equation5(5) $\begin{aligned} V_{e e}^{b} := (g_{e e}^{- 1} {\dot{g}}_{e e})^{\lor} = - {A d}_{(g_{e e}^{- 1})} u_{e} + V_{w o}^{b}, \\ {A d}_{(g)} := [\begin{matrix} e^{\hat{ξ} θ} & \hat{p} e^{\hat{ξ} θ} \\ O_{3 \times 3} & e^{\hat{ξ} θ} \end{matrix}] . \end{aligned}$ (5) ): ${A d}_{(e^{\hat{ξ} θ})} := [\begin{matrix} e^{\hat{ξ} θ} & O_{3 \times 3} \\ O_{3 \times 3} & e^{\hat{ξ} θ} \end{matrix}] \in R^{6 \times 6} .$ Let us now introduce a performance indicator $γ > 0$ to evaluate the sampling error ϵ. Then, if $‖ ϵ ‖ \leq γ$ is satisfied, we get (15) ${\dot{U}}_{e} \leq - k_{e} ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ .$ (15) Since the right-hand side of (Equation15(15) ${\dot{U}}_{e} \leq - k_{e} ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ .$ (15) ) consists only of $‖ e_{e} ‖$ , we get the following theorem by employing ultimate boundedness analysis [Citation24]:

Theorem 4.2

Suppose that the norm of the target object velocity $V_{w o}^{b}$ is upper bounded by κ. Then, for every initial estimation error $e_{e} (t_{0})$ , there exists $T_{b} \geq t_{0}$ such that the solution $e_{e} (t)$ of the closed-loop system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) and (Equation6(6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) ) satisfies (16) $‖ e_{e} (t) ‖ \leq \frac{\sqrt{2} (k_{e} γ + κ)}{k_{e} δ}^{\forall} t \geq T_{b}$ (16) if $| θ_{e e} | \leq π / 2$ and the following frame rate condition hold:

(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17)

Proof.

If $‖ ϵ ‖ \leq γ$ is satisfied for all time, we get the following inequality for $δ \in (0, 1)$ from (Equation15(15) ${\dot{U}}_{e} \leq - k_{e} ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ .$ (15) ): (18) $\begin{aligned} {\dot{U}}_{e} & \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} - k_{e} δ ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ \\ \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2}^{\forall} ‖ e_{e} ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} . \end{aligned}$ (18) Therefore, from Theorem 4.18 of [Citation24] and the property that $(1 / 2) ‖ e_{e} ‖^{2} \leq U_{e} \leq ‖ e_{e} ‖^{2}$ holds for $| θ_{e e} | < π / 2$ , we can conclude that for every $e_{e} (t_{0})$ , there exists $T_{b} \geq t_{0}$ such that $e_{e} (t)$ satisfies (Equation16(16) $‖ e_{e} (t) ‖ \leq \frac{\sqrt{2} (k_{e} γ + κ)}{k_{e} δ}^{\forall} t \geq T_{b}$ (16) ).

We next derive the frame rate condition to guarantee the inequality $‖ ϵ ‖ \leq γ$ . Similarly to the proof of Theorem 4.1, we investigate the dynamics of $‖ ϵ ‖$ in $t \in [t_{i}, t_{i + 1})$ as follows: (19) $\frac{d}{d t} ‖ ϵ ‖ \leq ‖ \dot{ϵ} ‖ = ‖ {\dot{e}}_{e} ‖ \leq 2 k_{e} ‖ e_{e} (t_{i}) ‖ + k_{e} ‖ ϵ ‖ + 2 κ .$ (19) The differences from (Equation12(12) $\begin{aligned} ‖ {\dot{e}}_{e} ‖ \\ = ‖ \begin{matrix} - u_{e p} - {\hat{u}}_{e R} p_{e e} \\ - \frac{1}{2} e^{- \hat{ξ} θ_{e e}} ((1 + 2 \cos θ_{e e}) I_{3} - e^{- \hat{ξ} θ_{e e}}) u_{e R} \end{matrix} ‖ \\ \leq ‖ \begin{matrix} k_{e} (- I_{3} - s k (e^{\hat{ξ} θ_{e e} (t_{i})})) p_{e e} \\ \frac{k_{e}}{2} e^{- \hat{ξ} θ_{e e}} (e^{- \hat{ξ} θ_{e e}} - (1 + 2 \cos θ_{e e}) I_{3}) s k (e^{\hat{ξ} θ_{e e}})^{\lor} \end{matrix} ‖ \\ + ‖ \begin{matrix} k_{e} ϵ_{p} \\ \frac{k_{e}}{2} e^{- \hat{ξ} θ_{e e}} (e^{- \hat{ξ} θ_{e e}} - (1 + 2 \cos θ_{e e}) I_{3}) ϵ_{R} \end{matrix} ‖ \\ \leq 2 k_{e} ‖ e_{e} ‖ + 2 k_{e} ‖ ϵ ‖ . \end{aligned}$ (12) ) are to handle $V_{w o}^{b}$ which appears in (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) and to consider not $e_{e} (t)$ but $e_{e} (t_{i})$ to reduce the conservativeness. Let us now derive the upper bound of $‖ e_{e} (t_{i}) ‖$ . We first consider the case that $‖ e_{e} (t_{0}) ‖ \geq (k_{e} γ + κ) / (k_{e} δ)$ . Then, the following inequality holds from $(1 / 2) ‖ e_{e} ‖^{2} \leq U_{e}$ and (Equation18(18) $\begin{aligned} {\dot{U}}_{e} & \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} - k_{e} δ ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ \\ \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2}^{\forall} ‖ e_{e} ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} . \end{aligned}$ (18) ): $‖ e_{e} (t) ‖ \leq \sqrt{2 U_{e} (t)} \leq \sqrt{2 U_{e} (t_{0})}^{\forall} t \geq t_{0},$ which also means $‖ e_{e} (t_{i}) ‖ \leq \sqrt{2 U_{e} (t_{0})}^{\forall} i \in N_{0}$ . We thus obtain $\frac{d}{d t} ‖ ϵ ‖ \leq k_{e} ‖ ϵ ‖ + 2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}$ from (Equation19(19) $\frac{d}{d t} ‖ ϵ ‖ \leq ‖ \dot{ϵ} ‖ = ‖ {\dot{e}}_{e} ‖ \leq 2 k_{e} ‖ e_{e} (t_{i}) ‖ + k_{e} ‖ ϵ ‖ + 2 κ .$ (19) ) and the first frame rate condition in (Equation17(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17) ) via the same analysis as in Theorem 4.1.

On the other hand, when the initial estimation error satisfies $‖ e_{e} (t_{0}) ‖ < (k_{e} γ + κ) / (k_{e} δ)$ , $U_{e} (t)$ might increase and become larger than $U_{e} (t_{0})$ . However, since (Equation18(18) $\begin{aligned} {\dot{U}}_{e} & \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2} - k_{e} δ ‖ e_{e} ‖^{2} + (k_{e} γ + κ) ‖ e_{e} ‖ \\ \leq - k_{e} (1 - δ) ‖ e_{e} ‖^{2}^{\forall} ‖ e_{e} ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} . \end{aligned}$ (18) ) holds, $U_{e} (t)$ never goes beyond the value associated with $‖ e_{e} ‖ = (k_{e} γ + κ) / (k_{e} δ)$ , denoted by $U_{e}^{*}$ . Then, this fact provides $‖ e_{e} (t) ‖ \leq \sqrt{2 U_{e} (t)} \leq \sqrt{2 U_{e}^{*}} \leq \frac{\sqrt{2} (k_{e} γ + κ)}{k_{e} δ}^{\forall} t \geq t_{0}$ for $| θ_{e e} | < π / 2$ . Therefore, we obtain $\frac{d}{d t} ‖ ϵ ‖ \leq k_{e} ‖ ϵ ‖ + 2 κ + \frac{2 \sqrt{2} (k_{e} γ + κ)}{δ}$ from (Equation19(19) $\frac{d}{d t} ‖ ϵ ‖ \leq ‖ \dot{ϵ} ‖ = ‖ {\dot{e}}_{e} ‖ \leq 2 k_{e} ‖ e_{e} (t_{i}) ‖ + k_{e} ‖ ϵ ‖ + 2 κ .$ (19) ) and the second frame rate condition in (Equation17(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17) ).

Remark 4.2

The performance evaluation (Equation16(16) $‖ e_{e} (t) ‖ \leq \frac{\sqrt{2} (k_{e} γ + κ)}{k_{e} δ}^{\forall} t \geq T_{b}$ (16) ) can be rewritten as follows: $‖ e_{e} (t) ‖ \leq \frac{\sqrt{2} γ}{δ} + \frac{\sqrt{2} κ}{k_{e} δ}^{\forall} t \geq T_{b} .$ This means that smaller γ and larger $k_{e}$ achieve better performance. However, both of them require fast frame rates because the right-hand sides of (Equation17(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17) ) are monotonically decreasing for γ and monotonically increasing for $k_{e}$ . Therefore, as we expected, γ can be considered as the indicator of the tracking performance and the (sufficiently) allowable gains for a designer. For example, choosing a camera with a certain rate enables us to design $k_{e}$ for desired performance related to γ from (Equation16(16) $‖ e_{e} (t) ‖ \leq \frac{\sqrt{2} (k_{e} γ + κ)}{k_{e} δ}^{\forall} t \geq T_{b}$ (16) ) and (Equation17(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17) ). A design example is provided in Section 7.

Remark 4.3

Theorem 4.2 provides two frame rate conditions depending on the initial estimation error. However, if we first run the present estimation law (Equation6(6) $u_{e} (t) = k_{e} e_{e} (t_{i}), t \in [t_{i}, t_{i + 1}), k_{e} > 0.$ (6) ) before the target moves, we can consider only the second condition.

5. Sampled visual feedback pose regulation

We next propose a sampled visual feedback pose regulation mechanism based on the present sampled visual motion observer for the objective (ii).

5.1. Control error system

Similarly to the estimation error system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) presented in Section 3.1, we build the control error system. The control error $g_{c e} = (p_{c e}, e^{\hat{ξ} θ_{c e}}) \in S E (3)$ , and its vector form $e_{c} \in R^{6}$ are defined as follows: $g_{c e} := g_{d}^{- 1} {\bar{g}}_{c o}, e_{c} := [\begin{matrix} p_{c e} \\ s k (e^{\hat{ξ} θ_{c e}})^{\lor} \end{matrix}] .$ Notice that for $θ_{c e} \in (- π, π)$ , $e_{c} = 0$ holds if and only if $g_{c e} = I_{4}$ , i.e. ${\bar{g}}_{c o} = g_{d}$ . Then, the time derivative of $g_{c e}$ along the trajectories of (Equation3(3) ${\dot{\bar{g}}}_{c o} = - {\hat{V}}_{w c}^{b} {\bar{g}}_{c o} + {\bar{g}}_{c o} {\hat{u}}_{e} .$ (3) ) provides the following control error system: (20) ${\dot{g}}_{c e} = - g_{d}^{- 1} {\hat{V}}_{w c}^{b} g_{d} g_{c e} + g_{c e} {\hat{u}}_{e} .$ (20) This is also written in the vector form as $V_{c e}^{b} := (g_{c e}^{- 1} {\dot{g}}_{c e})^{\lor} = - {A d}_{(g_{c e}^{- 1})} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} + u_{e} .$ Combining the estimation error system (Equation4(4) ${\dot{g}}_{e e} = - {\hat{u}}_{e} g_{e e} + g_{e e} {\hat{V}}_{w o}^{b} .$ (4) ) with the control error system (Equation20(20) ${\dot{g}}_{c e} = - g_{d}^{- 1} {\hat{V}}_{w c}^{b} g_{d} g_{c e} + g_{c e} {\hat{u}}_{e} .$ (20) ) yields the following total error system in the vector form: (21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) It is shown in [Citation13] that if $V_{w o}^{b} = 0$ , the total error system (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) also has a passivity-like property from the input $u_{c e}$ to the output $ν_{c e} \in R^{12}$ defined as $\begin{aligned} ν_{c e} := N e_{c e}, N := [\begin{matrix} - I_{6} & O_{6 \times 6} \\ {A d}_{(e^{- \hat{ξ} θ_{c e}})} & - I_{6} \end{matrix}] \in R^{12 \times 12}, \\ e_{c e} := [\begin{matrix} e_{c} \\ e_{e} \end{matrix}] \in R^{12} . \end{aligned}$ Here, $e_{c e}$ is the total control and estimation error vector, and the corresponding storage function $U \geq 0$ is defined as $U := \frac{1}{2} ‖ p_{c e} ‖^{2} + ϕ (e^{\hat{ξ} θ_{c e}}) + \frac{1}{2} ‖ p_{e e} ‖^{2} + ϕ (e^{\hat{ξ} θ_{e e}}),$ which yields $\dot{U} = u_{c e}^{T} ν_{c e}$ , i.e. a passivity-like property from the input $u_{c e}$ to the output $ν_{c e}$ [Citation13]. We note that U=0 is equivalent to $e_{c e} = 0$ for $θ_{c e}, θ_{e e} \in (- π, π)$ , and $e_{c e} = 0$ means $g_{c o} = g_{d}$ , that is, the pose regulation (the objective (ii)) is achieved.

Based on the passivity-like property, Fujita et al. [Citation13] propose the following negative feedback law to achieve the pose regulation as in Figure (a): $u_{c e} = - K ν_{c e}, K > 0.$ Here, $K \in R^{12 \times 12}$ is a positive definite gain matrix, and the achievement of the regulation is proved by the direct use of the storage function U as a potential function for the Lyapunov-based energy approach. However, this technique also assumes the continuous availability of the visual measurements f, which allows any positive definite matrix K.

5.2. Pose regulation mechanism

Consider the same settings as in Section 3, i.e. the fixed frame rate τ and the sampling time sequence $t_{i}, i \in N_{0}$ . Then, motivated by the passivity-like property of the total error system (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ), we propose the following visual feedback pose regulation input based on the sampled visual motion observer: (22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) Notice here that only the estimation error $e_{e} (t_{i})$ is constant until the next sampling time since we consider the case that the computation time to obtain the estimate ${\bar{g}}_{c o}$ is small enough to be neglected. This structure enables us to greatly reduce the conservativeness of the frame rate condition provided by the conference version of this paper [Citation22]. In [Citation22], ${\bar{g}}_{c o}$ is also sampled, i.e. instead of $e_{c} (t)$ and $e^{- \hat{ξ} θ_{c e} (t)}$ , $e_{c} (t_{i})$ and $e^{- \hat{ξ} θ_{c e} (t_{i})}$ are used in the regulation input (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ).

This paper assumes $k_{e} \leq k_{c}$ for better performance in the subsequent discussion. The reason to employ this gain relationship is that only the observer input for the estimation is sampled, and as a result, a large gain $k_{e}$ increases the influence of the sampling error $ϵ (t) = e_{e} (t) - e_{e} (t_{i})$ , which results in bad control performance. The block diagram of the present sampled visual feedback system is illustrated in Figure .

Figure 5. Block diagram of sampled visual feedback system.

6. Analysis of pose regulation

Similarly to Section 4, this section provides the convergence analysis for the stationary target object and the tracking performance analysis for the moving target.

6.1. Convergence analysis

Using the same definition of the sampling error $ϵ (t) = e_{e} (t) - e_{e} (t_{i}), t \in [t_{i}, t_{i + 1})^{\forall} i \in N_{0}$ as in Section 4, we have the following theorem:

Theorem 6.1

Suppose that the target object is static (i.e. $V_{w o}^{b} \equiv 0$ ). Then, if the camera frame rate satisfies the condition

(23) $\begin{aligned} τ \geq \frac{\sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}}}{\ln (\frac{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ + \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}}{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ - \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}})}, \\ δ \in (0, 1), \end{aligned}$ (23) there exist finite time $T_{c} \geq t_{0}$ and positive scalars $α_{2}, β_{2} > 0$ such that $‖ e_{c e} (t) ‖ \leq α_{2} e^{- β_{2} (t - T_{c}) \forall} t \geq T_{c}$ for the closed-loop system (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) and (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ).

Proof.

When $V_{w o}^{b} = 0$ holds, the time derivative of the potential function U along the trajectories of (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) and (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) for $t \in [t_{i}, t_{i + 1})$ yields $\begin{aligned} \dot{U} & = e_{c e}^{T} N^{T} u_{c e} \\ = - e_{c e}^{T} N^{T} K N e_{c e} + k_{e} (e_{c}^{T} - e_{c}^{T} {A d}_{(e^{\hat{ξ} θ_{c e}})}) ϵ \\ \leq - \frac{3 - \sqrt{5}}{2} k_{e} ‖ e_{c e} ‖^{2} + \sqrt{2} k_{e} ‖ e_{c e} ‖ ‖ ϵ ‖ . \end{aligned}$ Here, we use the property $e_{c e}^{T} N^{T} K N e_{c e} \geq λ_{min} (N^{T} K N) ‖ e_{c e} ‖^{2} = (3 - \sqrt{5}) k_{e} ‖ e_{c e} ‖^{2} / 2$ . ( $λ_{min} (A) \in R$ is the smallest eigenvalue for any symmetric matrix $A \in R^{n \times n}$ .) Therefore, if the inequality $‖ ϵ ‖ \leq \frac{(3 - \sqrt{5})}{2 \sqrt{2}} δ ‖ e_{c e} ‖$ is satisfied, we obtain $\dot{U} \leq - \frac{3 - \sqrt{5}}{2} k_{e} (1 - δ) ‖ e_{c e} ‖^{2}$ for $t \in [t_{i}, t_{i + 1})$ .

Then, by the same techniques as in the proof of Theorem 4.1, we get (24) $\frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{c e} ‖} \leq \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{c e} ‖} + \frac{‖ ϵ ‖}{‖ e_{c e} ‖^{2}} ‖ {\dot{e}}_{c e} ‖$ (24) and (25) $\begin{aligned} ‖ {\dot{e}}_{e} ‖ \leq \sqrt{10} k_{e} ‖ e_{c e} ‖ + 2 k_{e} ‖ ϵ ‖, \\ ‖ {\dot{e}}_{c e} ‖ \leq (k_{c} + 2 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) ‖ e_{c e} ‖ + 2 \sqrt{2} k_{e} ‖ ϵ ‖ . \end{aligned}$ (25) Substituting (Equation25(25) $\begin{aligned} ‖ {\dot{e}}_{e} ‖ \leq \sqrt{10} k_{e} ‖ e_{c e} ‖ + 2 k_{e} ‖ ϵ ‖, \\ ‖ {\dot{e}}_{c e} ‖ \leq (k_{c} + 2 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) ‖ e_{c e} ‖ + 2 \sqrt{2} k_{e} ‖ ϵ ‖ . \end{aligned}$ (25) ) into (Equation24(24) $\frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{c e} ‖} \leq \frac{‖ {\dot{e}}_{e} ‖}{‖ e_{c e} ‖} + \frac{‖ ϵ ‖}{‖ e_{c e} ‖^{2}} ‖ {\dot{e}}_{c e} ‖$ (24) ) provides $\begin{aligned} \frac{d}{d t} \frac{‖ ϵ ‖}{‖ e_{c e} ‖} & \leq 2 \sqrt{2} k_{e} {(\frac{‖ ϵ ‖}{‖ e_{c e} ‖})}^{2} \\ + (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \frac{‖ ϵ ‖}{‖ e_{c e} ‖} + \sqrt{10} k_{e} . \end{aligned}$ The remaining analysis to obtain the frame rate condition (Equation23(23) $\begin{aligned} τ \geq \frac{\sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}}}{\ln (\frac{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ + \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}}{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ - \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}})}, \\ δ \in (0, 1), \end{aligned}$ (23) ) is the same as in the proof of Theorem 4.1

The condition (Equation23(23) $\begin{aligned} τ \geq \frac{\sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}}}{\ln (\frac{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ + \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}}{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ - \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}})}, \\ δ \in (0, 1), \end{aligned}$ (23) ) also implies that large feedback gains $k_{c}$ and $k_{e}$ require fast camera frame rates τ.

6.2. Tracking performance analysis

By using the same assumption $‖ V_{w o}^{b} ‖ \leq κ^{\forall} t \geq t_{0}$ as in Section 4.2, the time derivative of the potential function U along the trajectories of (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) and (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) is given as follows: $\begin{aligned} \dot{U} & = e_{c e}^{T} N^{T} u_{c e} + e_{e}^{T} {A d}_{(e^{\hat{ξ} θ_{e e}})} V_{w o}^{b} \\ \leq - \frac{3 - \sqrt{5}}{2} k_{e} ‖ e_{c e} ‖^{2} + \sqrt{2} k_{e} ‖ e_{c e} ‖ ‖ ϵ ‖ + κ ‖ e_{c e} ‖ . \end{aligned}$ Here, we note that $V_{w o}^{b}$ appears only in the estimation part for the total error system (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ). Therefore, if $‖ ϵ ‖ \leq γ$ is satisfied for a certain performance indicator $γ > 0$ , we get (26) $\dot{U} \leq - \frac{3 - \sqrt{5}}{2} k_{e} ‖ e_{c e} ‖^{2} + (\sqrt{2} k_{e} γ + κ) ‖ e_{c e} ‖ .$ (26) Then, we have the following theorem similar to Theorem 4.2:

Theorem 6.2

Suppose that the norm of the target object velocity $V_{w o}^{b}$ is upper bounded by κ. Then, for every initial control and estimation error $e_{c e} (t_{0})$ , there exists $T_{d} \geq t_{0}$ such that the solution $e_{c e} (t)$ of the closed-loop system (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) and (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) satisfies (27) $‖ e_{c e} (t) ‖ \leq \frac{4 k_{e} γ + 2 \sqrt{2} κ}{(3 - \sqrt{5}) k_{e} δ}^{\forall} t \geq T_{d}$ (27) if $| θ_{e e} |, | θ_{c e} | \leq π / 2$ and the following frame rate condition hold:

(28) $τ \geq {\begin{cases} \frac{2 k_{e}}{\ln (1 + \frac{k_{e} γ}{κ + k_{e} \sqrt{5 U (t_{0})}})} i f ‖ e_{c e} (t_{0}) ‖ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ} \\ \frac{2 k_{e}}{\ln (1 + \frac{(6 - 2 \sqrt{5}) k_{e} γ δ}{(6 - 2 \sqrt{5}) κ δ + 4 \sqrt{5} (\sqrt{2} k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (28)

Proof.

If $‖ ϵ ‖ \leq γ$ is satisfied for all time, the following inequality is obtained from (Equation26(26) $\dot{U} \leq - \frac{3 - \sqrt{5}}{2} k_{e} ‖ e_{c e} ‖^{2} + (\sqrt{2} k_{e} γ + κ) ‖ e_{c e} ‖ .$ (26) ): $\begin{aligned} \dot{U} & \leq - \frac{3 - \sqrt{5}}{2} k_{e} (1 - δ) ‖ e_{c e} ‖^{2} \forall ‖ e_{c e} ‖ \\ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ}, δ \in (0, 1) . \end{aligned}$ Therefore, from Theorem 4.18 of [Citation24] and the property that $(1 / 2) ‖ e_{c e} ‖^{2} \leq U \leq ‖ e_{c e} ‖^{2}$ holds for $| θ_{e e} |, | θ_{c e} | < π / 2$ , we can conclude that for every $e_{c e} (t_{0})$ , there exists $T_{d} \geq t_{0}$ such that $e_{c e} (t)$ satisfies (Equation27(27) $‖ e_{c e} (t) ‖ \leq \frac{4 k_{e} γ + 2 \sqrt{2} κ}{(3 - \sqrt{5}) k_{e} δ}^{\forall} t \geq T_{d}$ (27) ).

We next derive the frame rate condition to guarantee the inequality $‖ ϵ ‖ \leq γ$ . We get the following dynamics of $‖ ϵ ‖$ from (Equation21(21) $\begin{aligned} [\begin{matrix} V_{c e}^{b} \\ V_{e e}^{b} \end{matrix}] = - [\begin{matrix} {A d}_{(g_{c e}^{- 1})} & - I_{6} \\ O_{6 \times 6} & {A d}_{(g_{e e}^{- 1})} \end{matrix}] u_{c e} + [\begin{matrix} 0 \\ V_{w o}^{b} \end{matrix}], \\ u_{c e} := [\begin{matrix} {A d}_{(g_{d}^{- 1})} V_{w c}^{b} \\ u_{e} \end{matrix}] \in R^{12} . \end{aligned}$ (21) ) and (Equation25(25) $\begin{aligned} ‖ {\dot{e}}_{e} ‖ \leq \sqrt{10} k_{e} ‖ e_{c e} ‖ + 2 k_{e} ‖ ϵ ‖, \\ ‖ {\dot{e}}_{c e} ‖ \leq (k_{c} + 2 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) ‖ e_{c e} ‖ + 2 \sqrt{2} k_{e} ‖ ϵ ‖ . \end{aligned}$ (25) ): $\frac{d}{d t} ‖ ϵ ‖ \leq ‖ \dot{ϵ} ‖ = ‖ {\dot{e}}_{e} ‖ \leq \sqrt{10} k_{e} ‖ e_{c e} ‖ + 2 k_{e} ‖ ϵ ‖ + 2 κ .$ Then, by the same approach as in the proof of Theorem 4.2, we get the frame rate condition (Equation28(28) $τ \geq {\begin{cases} \frac{2 k_{e}}{\ln (1 + \frac{k_{e} γ}{κ + k_{e} \sqrt{5 U (t_{0})}})} i f ‖ e_{c e} (t_{0}) ‖ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ} \\ \frac{2 k_{e}}{\ln (1 + \frac{(6 - 2 \sqrt{5}) k_{e} γ δ}{(6 - 2 \sqrt{5}) κ δ + 4 \sqrt{5} (\sqrt{2} k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (28) ).

The condition (Equation28(28) $τ \geq {\begin{cases} \frac{2 k_{e}}{\ln (1 + \frac{k_{e} γ}{κ + k_{e} \sqrt{5 U (t_{0})}})} i f ‖ e_{c e} (t_{0}) ‖ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ} \\ \frac{2 k_{e}}{\ln (1 + \frac{(6 - 2 \sqrt{5}) k_{e} γ δ}{(6 - 2 \sqrt{5}) κ δ + 4 \sqrt{5} (\sqrt{2} k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (28) ) also implies that smaller γ and larger $k_{e}$ require fast frame rates. Here, larger $k_{e}$ also requires larger $k_{c}$ from the gain relationship $k_{c} \geq k_{e}$ .

6.3. Variable camera frame rate case

So far, we have considered the situation that the visual measurements (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ) can be extracted at every fixed sampling time $1 / τ$ . However, actual sampling time for frame rates including image processing time is variable. To deal with this issue, we consider the worst case (i.e. the maximum sampling time).

Suppose that the visual measurements (Equation2(2) $f_{i} = \frac{σ}{z_{c i}} [\begin{matrix} x_{c i} \\ y_{c i} \end{matrix}], p_{c i} = [x_{c i} y_{c i} z_{c i}]^{T}, i \in {1, \dots, m} .$ (2) ) are extracted at time instants ${t_{0}, t_{1}, t_{2}, \dots}$ , where $t_{i + 1} - t_{i} = 1 / τ_{i}, τ_{i} > 0, i \in N_{0}$ . Instead of the fixed camera frame rate case, we now suppose that the worst frame rate, denoted by $τ_{min} > 0$ , is known a priori via pre-experiments of image processing. Then, by simply replacing τ with $τ_{min}$ in the conditions (Equation7(7) $τ \geq \frac{2 k_{e} (1 + δ)}{δ}, δ \in (0, 1),$ (7) ), (Equation17(17) $τ \geq {\begin{cases} \frac{k_{e}}{\ln (1 + \frac{k_{e} γ}{2 κ + 2 k_{e} \sqrt{2 U_{e} (t_{0})}})} i f ‖ e_{e} (t_{0}) ‖ \geq \frac{k_{e} γ + κ}{k_{e} δ} \\ \frac{k_{e}}{\ln (1 + \frac{k_{e} γ δ}{2 κ δ + 2 \sqrt{2} (k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (17) ), (Equation23(23) $\begin{aligned} τ \geq \frac{\sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}}}{\ln (\frac{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ + \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}}{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ - \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}})}, \\ δ \in (0, 1), \end{aligned}$ (23) ), and (Equation28(28) $τ \geq {\begin{cases} \frac{2 k_{e}}{\ln (1 + \frac{k_{e} γ}{κ + k_{e} \sqrt{5 U (t_{0})}})} i f ‖ e_{c e} (t_{0}) ‖ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ} \\ \frac{2 k_{e}}{\ln (1 + \frac{(6 - 2 \sqrt{5}) k_{e} γ δ}{(6 - 2 \sqrt{5}) κ δ + 4 \sqrt{5} (\sqrt{2} k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (28) ), the non-increasing properties of $U_{e}$ and U are always guaranteed. We thus have the following corollary:

Corollary 6.3

Suppose that $τ_{i} \geq τ_{min}$ is satisfied for all $i \in N_{0}$ . Then, the same statements as in Theorems 4.1, 4.2, 6.1, and 6.2 hold by replacing τ with $τ_{min}$ .

7. Verification

This section demonstrates the effectiveness of the present sampled visual feedback pose regulation mechanism (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) via simulation and an experiment. The verification only of the estimation is omitted because it has been already shown in our previous work [Citation21].

7.1. Simulation

Consider a pinhole camera pointing in the z-axis direction of $Σ_{c}$ with the focal length $σ = 0.003$ [m], and suppose that its frame rate is variable with the minimum rate $τ_{min} = 60$ [fps]. The initial relative pose of the target object to the camera is set as $p_{c o} (0) = [35 7]^{T}$ [m] and $ξ θ_{c o} (0) = [0 π / 40]^{T}$ [rad], where we set $t_{0} = 0$ . The positions of four feature points in $Σ_{o}$ are given by $p_{o 1} = [0.5 0.5 0.5]^{T}$ , $p_{o 2} = [0.5 - 0.5 0.5]^{T}$ , $p_{o 3} = [- 0.5 0.5 0.5]^{T}$ , and $p_{o 4} = [- 0.5 - 0.5 0.5]^{T}$ [m]. The target object velocity is set as follows:

$\begin{aligned} V_{w o}^{b} = {\begin{cases} [0 0 \sqrt{2} 0 0 0]^{T} & t \in [0, 30) \\ [0.5 0 1 - 0.5 \sin (2 t) 0 0.5 \cos (t)]^{T} & t \in [30, 40) \\ [0 \cos (t) 0 0 \sin (0.5 t) 0]^{T} & t \in [40, 50) \\ 0 & t \in [50, 60) \end{cases}, \end{aligned}$ where the units of $v_{w o}$ and $ω_{w o}$ are [m/s] and [rad/s], respectively. We artificially set the velocity to 0 after 50s. This setting enables us to verify both of tracking (until 50s) and convergence (after 50s) in a single demonstration.

The present sampled visual feedback pose regulation mechanism (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) with $k_{c} = 6.5$ , $k_{e} = 1.7$ , $δ = 0.95$ , $γ = 0.83$ , ${\bar{p}}_{c o} (0) = [0 0 1]^{T}$ [m], and $\bar{ξ} {\bar{θ}}_{c o} (0) = 0$ [rad] is applied to the camera to achieve the desired relative pose $p_{d} = [0 0 0.5]^{T}$ [m] and $ξ θ_{d} = 0$ [rad], which yields $‖ e_{c e} (0) ‖ = 8.4$ . In this setting, the frame rate condition (Equation23(23) $\begin{aligned} τ \geq \frac{\sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}}}{\ln (\frac{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ + \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}}{\begin{matrix} 8 \sqrt{5} k_{e} + (3 - \sqrt{5}) δ (k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}}) \\ - \sqrt{(k_{c} + 4 k_{e} + \sqrt{k_{c}^{2} + 4 k_{e}^{2}})^{2} - 16 \sqrt{5} k_{e}^{2}} \end{matrix}})}, \\ δ \in (0, 1), \end{aligned}$ (23) ) for convergence is $τ \geq 30.4$ and (Equation28(28) $τ \geq {\begin{cases} \frac{2 k_{e}}{\ln (1 + \frac{k_{e} γ}{κ + k_{e} \sqrt{5 U (t_{0})}})} i f ‖ e_{c e} (t_{0}) ‖ \geq \frac{2 (\sqrt{2} k_{e} γ + κ)}{(3 - \sqrt{5}) k_{e} δ} \\ \frac{2 k_{e}}{\ln (1 + \frac{(6 - 2 \sqrt{5}) k_{e} γ δ}{(6 - 2 \sqrt{5}) κ δ + 4 \sqrt{5} (\sqrt{2} k_{e} γ + κ)})} o t h e r w i s e \end{cases} .$ (28) ) for tracking becomes $τ \geq 59.6$ , that is, both frame rate conditions are satisfied. As a result, Corollary 6.3 (according to Theorem 6.2) provides the tracking performance that there exists $T_{d} \geq 0$ such that $‖ e_{c e} (t) ‖ \leq 7.8^{\forall} t \geq T_{d}$ .

The simulation results are depicted in Figures and . Figure shows the time response of the potential function U, and that of the norm of the total control and estimation error $‖ e_{c e} ‖$ is depicted in Figure . The exponential stability of $‖ e_{c e} ‖ = 0$ is seen from the behaviour after 50s. We also see good tracking performance from the behaviour until 50s, where the potential function U sometimes increases due to the non-zero target object velocity $V_{w o}^{b}$ , but it occurs within the bound as shown in the present analysis. In summary, the present sampled visual feedback pose regulation mechanism works successfully.

Figure 6. Potential function (simulation).

Figure 7. Norm of total error (simulation).

7.2. Experiment

We next show the validity of the present sampled visual feedback pose regulation mechanism (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) via an experiment. Here, two self-developed 2D omnidirectional robots are employed as the camera robot and the target object robot (see Figure ). In this experiment, the pose estimation is conducted in three-dimensional space, but the pose regulation is demonstrated on two-dimensional plane (x–z plane) with the projection from 6D translational/angular velocity inputs onto 3D ones (i.e. $V_{w c}^{b} = [v_{w c, x} v_{w c, z} ω_{w c, y}]^{T} \in R^{3}$ ).

Figure 8. Experimental testbed.

Each omnidirectional robot mainly consists of three motors (RA250100–58Y91 from Daisen Electronics Industrial, Co., Ltd.) with omnidirectional wheels (4571398310089 from Vstone, Co., Ltd.) and a single camera (FL3–U3–13S2C–CS from FLIR Systems, Inc.). The focal length of the camera is tuned as $σ = 0.0193$ [m], and the camera originally has 120fps. However, when we explicitly consider the image processing time to extract the four feature points attached to the target object robot (see Figure ), the actual minimum frame rate becomes 18fps in this experiment. We also introduce a motion capture camera system (with OptiTrack Flex13 from NaturalPoint, Inc.) to obtain actual experimental data for the evaluation of the convergence/tracking, but this information is not used for the control inputs of the camera robot.

The present sampled visual feedback pose regulation mechanism (Equation22(22) $\begin{aligned} u_{c e} (t) & = K [\begin{matrix} e_{c} (t) \\ e_{e} (t_{i}) - {A d}_{(e^{- \hat{ξ} θ_{c e}})} e_{c} (t) \end{matrix}], t \in [t_{i}, t_{i + 1}), \\ K & := [\begin{matrix} k_{c} I_{6} & O_{6 \times 6} \\ O_{6 \times 6} & k_{e} I_{6} \end{matrix}] \in R^{12 \times 12}, k_{c}, k_{e} > 0. \end{aligned}$ (22) ) with $k_{c} = 2$ , $k_{e} = 1.4$ , $δ = 0.9$ , ${\bar{p}}_{c o} = [0 0 0.4]^{T}$ [m], and $\bar{ξ} {\bar{θ}}_{c o} = 0$ [rad] is applied to the camera robot to achieve the desired relative pose $p_{d} = [0 0 0.33]^{T}$ [m] and $ξ θ_{d} = 0$ [rad]. Here, the initial relative pose is set as $p_{c o} (0) \approx [0 0 0.37]$ [m] and $ξ θ_{c o} (0) \approx 0$ [rad], and the feature points are attached at $p_{o 1} \approx [0.02 0.12 0.10]^{T}$ , $p_{o 2} \approx [- 0.02 - 0.08 0.10]^{T}$ , $p_{o 3} \approx [0.02 0.08 0.14]^{T}$ , and $p_{o 4} \approx [- 0.02 0.12 0.14]^{T}$ [m] in $Σ_{o}$ . The reference body velocity commands of the target robot are given as follows: $\begin{aligned} [\begin{matrix} v_{w o, x} \\ v_{w o, z} \\ ω_{w o, y} \end{matrix}] = \\ {\begin{cases} 0 & t \in [0, 10) \\ [\begin{matrix} - 0.025 \sin (\frac{(t - 10) π}{10}) \end{matrix} \\ {\begin{matrix} 0.07 \sin (\frac{(t - 10) π}{10}) 0 \end{matrix}]}^{T} & t \in [10, 20) \\ {[0 0 0.4 \sin (\frac{(t - 20) π}{5})]}^{T} & t \in [20, 25) \\ 0 & t \in [25, 40) \end{cases} . \end{aligned}$ Here, we artificially set the velocity to 0 until 10s and after 25s to verify tracking and convergence in a single demonstration. We also confirmed that the sampling rate of the observer was about 270Hz, which was small enough to be neglected compared with the camera frame rate 18fps.

The experimental results are shown in Figures . Figures and respectively depict the time responses of the potential function U and the norm of the total control and estimation error $‖ e_{c e} ‖$ . On the other hand, Figures – show each input/state behaviour focussed on the pose regulation on the 2D experimental field. We see from these figures that the present visual feedback pose regulation mechanism successfully achieves the tracking to the moving target, and the convergence to the desired relative pose to the stationary target is almost achieved. The convergence errors and the tracking delay are due to the physical elements such as the actual dynamics of the robots, the friction between the wheel and the field, and the distortion of the camera image.

Figure 9. Potential function (experiment).

Figure 10. Norm of total error (experiment).

Figure 11. Velocity input command (experiment).

Figure 12. Relative position (experiment).

Figure 13. Relative orientation (experiment).

8. Conclusion

This paper presented a vision-based pose observer and its control technique in the sampled data setting by camera frame rates. In the convergence/tracking analysis for the proposed methodologies, we provided the relationship between the frame rates and estimation/control gains. Specifically, we showed that the estimation and control errors are ultimately bounded by a function of the camera frame rate, estimation/control gains, and target object velocity, which provides us with the guidelines for gain settings. The utility of the proposed technique is demonstrated via simulation and an experiment with real hardware.

One of our future directions is to consider robot dynamics. In this regard, previous works [Citation13,Citation16] have already presented passivity-based visual feedback pose regulation mechanisms also for rigid body dynamics in the Euler-Lagrange equation form and the Newton-Euler one. We thus extend the current results with the assistance of these techniques.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work is supported by the Japan Society for the Promotion of Science (JSPS) KAKENHI [grant number 21K04113].

Notes on contributors

Tatsuya Ibuki

Tatsuya Ibuki received his B.Eng., M.Eng., and Ph.D. degrees in mechanical and control engineering from Tokyo Institute of Technology in 2008, 2010, and 2013, respectively. He was a research fellow with the Japan Society for the Promotion of Science from 2012 to 2013, an assistant professor with the Department of Systems and Control Engineering, Tokyo Institute of Technology from 2013 to 2020, and a visiting scholar with the School of Electrical and Computer Engineering, Georgia Institute of Technology in 2019. Since 2020, he has been a senior assistant professor with the Department of Electronics and Bioinformatics, Meiji University. His research interests include cooperative control of robotic networks, fusion of control theory and machine learning, and vision-based estimation and control.

Satoshi Nakano

Satoshi Nakano received a B.Eng. degree from Nagoya Institute of Technology, Japan, in 2013 and the M.Eng. and Ph.D. degrees in mechanical and control engineering from Tokyo Institute of Technology, Japan, in 2015 and 2019, respectively. Since 2019, he has been an assistant professor in the department of engineering, Nagoya Institute of Technology, Japan. His research interests include nonlinear control, constrained control, and vision-based estimation and control.

Shunsuke Shigaki

Shunsuke Shigaki received his B. Eng., M. Eng., and Ph.D. degrees in mechanical and control system engineering from Tokyo Institute of Technology, Tokyo, Japan in 2013, 2015, and 2018, respectively. He was a JSPS Research Fellowship for Young Scientist (DC1) from 2015 to March 2018, an assistant professor in the division of systems research, Yokohama National University from 2018 to 2019, an assistant professor in the department of system innovation, Osaka University from 2019 to 2023, and a visiting scientist with Max Planck Institute for Chemical Ecology in 2022. He is currently an assistant professor in the principles of informatics research division, National Institute of Informatics, Japan since 2023. His research interests include bio-inspired robotics and algorithms, soft robotics, machine learning, and neuroethology.

Takeshi Hatanaka

Takeshi Hatanaka received a Ph.D. degree in applied mathematics and physics from Kyoto University in 2007. He then held faculty positions at Tokyo Institute of Technology and Osaka University. Since April 2020, he has been an associate professor at Tokyo Institute of Technology. He is the coauthor of “Passivity-Based Control and Estimation in Networked Robotics” (Springer, 2015) and the coeditor of “Economically-enabled Energy Management: Interplay between Control Engineering and Economics” (Springer Nature, 2020). His research interests include cyber-physical-human systems and networked robotics. He received the Kimura Award (2017), Pioneer Award (2014), Outstanding Book Award (2016), Control Division Conference Award (2018), Takeda Prize (2020), and Outstanding Paper Awards (2009, 2015, 2020, 2021, and 2023) all from SICE. He also received 3rd IFAC CPHS Best Research Paper Award (2020) and 10th Asian Control Conference Best Paper Prize Award (2015). He is serving/served as an AE for IEEE TSCT, Advanced Robotics, and SICE JCMSI, and is a member of the Conference Editorial Board of IEEE CSS. He is a senior member of IEEE.

Notes

1 In this equality, we use the homogeneous representation $[p_{c i}^{T} 1]^{T}, [p_{o i}^{T} 1]^{T} \in R^{4}$ .

2 Pose estimation/regulation problems with the target object velocity estimation have been tackled in [Citation16].

3 For ease of representation, we often simply use ‘0’ to denote zero vectors with appropriate dimensions.

4 This is easily shown by the properties that $ϕ (e^{\hat{ξ} θ})$ and $s k (e^{\hat{ξ} θ})^{\lor}$ can be respectively rewritten by $1 - \cos θ$ and $ξ \sin θ$ , and thus $ϕ (e^{\hat{ξ} θ}) < 1$ means $| θ | < π / 2$ . In this case, $(1 / 2) ‖ e_{e} ‖^{2} \leq U_{e} \leq ‖ e_{e} ‖^{2}$ also holds.

References

Ma Y, Soatto S, Košecká J. An invitation to 3-D vision: from images to geometric models. New York (NY): Springer; 2004.
Google Scholar
Hutchinson S, Hager GD, Corke PI. A tutorial on visual servo control. IEEE Trans Robot Autom. 1996 Oct;12(5):651–670. doi: 10.1109/70.538972
Google Scholar
Chaumette F, Hutchinson S. Visual servo control, part I: basic approaches. IEEE Robot Autom Mag. 2006 Dec;13(4):82–90. doi: 10.1109/MRA.2006.250573
Web of Science ®Google Scholar
Chaumette F, Hutchinson S. Visual servo control, part II: advanced approaches. IEEE Robot Autom Mag. 2007 Mar;14(1):109–118. doi: 10.1109/MRA.2007.339609
Web of Science ®Google Scholar
AlBeladi A, Ripperger E, Hutchinson S, et al. Hybrid eye-in-hand/eye-to-hand image based visual servoing for soft continuum arms. IEEE Robot Autom Lett. 2022 Oct;7(4):11298–11305. doi: 10.1109/LRA.2022.3194690
Web of Science ®Google Scholar
Niu G, Yang Q, Gao Y, et al. Vision-based autonomous landing for unmanned aerial and ground vehicles cooperative systems. IEEE Robot Autom Lett. 2022 Jul;7(3):6234–6241. doi: 10.1109/LRA.2021.3101882
Web of Science ®Google Scholar
Zhou S, Miao Z, Zhao H, et al. Vision-based control of an industrial vehicle in unstructured environments. IEEE Trans Control Syst Technol. 2022 Mar;30(2):598–610. doi: 10.1109/TCST.2021.3073003
Web of Science ®Google Scholar
Guo D, Jin X, Shao D, et al. Image-based regulation of mobile robots without pose measurements. IEEE Control Syst Lett. 2022 Jan;6:2156–2161. doi: 10.1109/LCSYS.2021.3139288
Google Scholar
Rotithor G, Trombetta D, Kamalapurkar R, et al. Full- and reduced-order observers for image-based depth estimation using concurrent learning. IEEE Trans Control Syst Technol. 2021 Nov;29(6):2647–2653. doi: 10.1109/TCST.2020.3036369
Web of Science ®Google Scholar
Bell ZI, Deptula P, Doucette EA, et al. Simultaneous estimation of Euclidean distances to a stationary object's features and the Euclidean trajectory of a monocular camera. IEEE Trans Autom Control. 2021 Sep;66(9):4252–4258. doi: 10.1109/TAC.2020.3035597
Web of Science ®Google Scholar
Li Y, Wang H, Xie Y, et al. Adaptive image-space regulation for robotic systems. IEEE Trans Control Syst Technol. 2021 Mar;29(2):850–857. doi: 10.1109/TCST.87
Web of Science ®Google Scholar
Keenan P, Janabi-Sharifi F, Assa A. Vision-based robotic traversal of textureless smooth surfaces. IEEE Trans Robot. 2020 Aug;36(4):1287–1306. doi: 10.1109/TRO.8860
Web of Science ®Google Scholar
Fujita M, Kawai H, Spong MW. Passivity-based dynamic visual feedback control for three dimensional target tracking: stability and L2-gain performance analysis. IEEE Trans Control Syst Technol. 2007 Jan;15(1):40–52. doi: 10.1109/TCST.2006.883236
Web of Science ®Google Scholar
Kawai H, Murao T, Fujita M. Passivity-based visual motion observer with panoramic camera for pose control. J Int Robot Syst. 2011 May;64(3–4):561–583. doi: 10.1007/s10846-011-9557-5
Web of Science ®Google Scholar
Ibuki T, Hatanaka T, Fujita M. Visual feedback pose synchronization with a generalized camera model. In: 50th IEEE Conference on Decision and Control and European Control Conference. Orlando, IEEE; 2011. p. 4999–5004.
Google Scholar
Ibuki T, Hatanaka T, Fujita M. Passivity-based visual feedback pose regulation integrating a target motion model in three dimensions. SICE J Control Meas Syst Integr. 2013 Sep;6(5):322–330. doi: 10.9746/jcmsi.6.322
Google Scholar
Yamauchi J, Saito M, Omainska M, et al. Cooperative visual pursuit control with learning of target motion via distributed Gaussian processes under varying visibility. SICE J Control Meas Syst Integr. 2022 Dec;15(2):228–240. doi: 10.1080/18824889.2022.2155454
Google Scholar
Tabuada P. Event-triggered real-time scheduling of stabilizing control tasks. IEEE Trans Autom Control. 2007 Sep;52(9):1680–1685. doi: 10.1109/TAC.2007.904277
Web of Science ®Google Scholar
Bemporad A, Heemels M, Johansson M. Networked control systems. London: Springer; 2010.
Google Scholar
Garcia E, Antsaklis PJ. Model-based event-triggered control for systems with quantization and time-varying network delays. IEEE Trans Autom Control. 2013 Feb;58(2):422–434. doi: 10.1109/TAC.2012.2211411
Web of Science ®Google Scholar
Ibuki T, Namba Y, Hatanaka T, et al. Passivity-based discrete visual motion observer taking account of camera frame rates. In: 52nd IEEE Conference on Decision and Control. Florence, IEEE; 2013. p. 7660–7665.
Google Scholar
Ibuki T, Walter JR, Hatanaka T, et al. Frame rate-based discrete visual feedback pose regulation: a passivity approach. IFAC Proc Volumes. 2014;47(3):11171–11176. doi: 10.3182/20140824-6-ZA-1003.02768
Google Scholar
Luenberger DG. An introduction to observers. IEEE Trans Autom Control. 1971 Dec;AC–16(6):596–602. doi: 10.1109/TAC.1971.1099826
Google Scholar
Khalil HK. Nonlinear systems. 3rd ed. Upper Saddle River (NJ): Prentice Hall; 2002.
Google Scholar

Sampled visual feedback pose estimation and regulation based on camera frame rates

ABSTRACT

1. Introduction