Full article: Real-time simulations of human tongue movements with a reduced order model of a non-linear dynamic biomechanical model

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Keywords:

1. Introduction

A 3D biomechanical model of the human tongue has been developed to study speech production (Hermant et al. Citation2017) and to help clinicians to predict the functional consequences of tongue cancer surgery (Buchaillard et al. Citation2007). To simulate movements in speech and in swallowing, such a Finite Element (FE) model requires a transient analysis in order to generate tongue trajectories over time in response to various patterns of muscles activations. It also requires a large deformation hyper-elastic framework to account for the complex non-linear geometrical and mechanical properties of tongue tissues. For all these reasons, any simulation of a one-second movement requires many hours of computation with a standard computer. This paper proposes to evaluate the Model Order Reduction (MOR) technique developed by ANSYS® called Dynamic ROM Builder (DRB), which is accessible in the ANSYS Twin Builder product. Contrary to other recent MOR techniques that propose real-time computations for quasi-static analysis (Lauzeral et al. Citation2019), the DRB is adapted to a transient analysis and should simulate, in real-time the non-linear dynamical behavior of the tongue model.

2. Methods

2.1. Machine learning-based MOR

Tongue deformation over time results from the time variations of m muscle commands (inputs variables) described in the model by n time varying outputs sampled at n_t regular time steps. Thus, the mechanical response of the model to muscle commands is described by the “excitation” and “output” matrices $G_{m, n_{t}}$ and $X_{n, n_{t}}$ characterizing respectively the command vectors and the output vectors at each of the n_t sampling time steps.

The objective of the MOR is to learn, from a limited number of sets ${{\hat{X}}_{n, n_{t}}, {\hat{G}}_{m, n_{t}}}$ (called “scenarios” henceforth) a functional model of the FE model. For this, the dynamical behavior of the tongue is approximated by a Reduced Order Model (ROM) defined by two equations, $\dot{X} (t) = f (X (t), G (t))$ $X (0) = X_{0},$ where f is a non-linear function which parameters must be learned by minimizing the average quadratic error E defined as $E = \sum_{1}^{N} \frac{(\frac{1}{n} \sum_{i = 1}^{n} (\frac{1}{n_{t}} \sum_{j = 1}^{n_{t}} {(X_{i, j} - \hat{X_{i, j}})}^{2}))}{N},$ with N being the number of scenarios.

The function f is implemented as a neural network similar to the one used in Siegelmann et al. (Citation1997). A standard optimization method such as gradient descent is used in the iterative optimization process of f, which stops when error E goes below a predefined threshold. If this threshold cannot be reached, a “free variable” i(t) is iteratively added to the model in the form of an additional cell in the inputs and in the outputs of the neural network. These free variables are the way to account for high order non-linearities over time without increasing the depth of the neural network, avoiding thus the “vanishing gradient” problem.

2.2. Biomechanical model of the human tongue

The tongue model is described in Hermant et al. (Citation2017). It is based on a FE mesh consisting of 7763 nodes and 8780 hexahedral elements. Mechanical properties of the tongue soft tissue are accounted for with a 2-parameter Mooney-Rivlin constitutive law ( $C_{10} = 192 P a;$ $C_{20} = 90 P a$ ). Tongue viscosity is approximated with a Rayleigh model ( $α = 20;$ $β = 0.0$ ). Tongue quasi-incompressibility is modelled with a Poisson ratio v of 0.4999 and “no-displacement boundaries conditions” are defined on the external nodes in contact with the jaw and on the lowest boundary of the mouth floor.

2.3. Learning and evaluation data

We focused on tongue movements resulting from the activations of two muscles, the Styloglossus (SG) and the Genioglossus Posterior (GG-P), which have been shown to be important for speech production and swallowing. In all the simulations the excitation associated with each muscle is an activation that lasts 0.35 s, with an amplitude that varies from 0 to α ∈ [0.2; 1.0] and with a transition time t_α ∈ [0.05 s; 0.11 s].

For the learning of the ROM, 20 scenarios are used for the SG alone and 100 for the co-activations SG + GG-P, in which α varies independently for each muscle. The evaluation of the ROM is then conducted with 20 and 32 scenarios, respectively.

3. Results and discussion

shows the mean RMS Error and standard deviation of the prediction of tongue trajectories made with the ROM computed over all the evaluation scenarios on the whole duration of the trajectories of the 1861 nodes of the tongue surface. It shows that the MOR algorithm can predict very accurately the movement due to single muscle activations but is less precise with two muscles. This seems coherent because tongue movement is known to be more complex when two muscles are activated (high non-linearities). This is confirmed by looking at the sagittal trajectories of some FE nodes located at the tip, the top and the back of the tongue during SG + GG-P activation ().

Figure 1. Sagittal displacement of nodes along the horizontal and vertical axes during SG + GG-P activations. (dotted line: ROM, solid line: ground-truth: full model).

Table 1. Mean RMS error and standard deviations of the predictions of the ROM. (cf. Trajectories in ).

Download CSV Display Table

superimposes the trajectories generated by the ROM and by the full model. To better analyse where most of the errors of the ROM are located, plots the 3 D error map all around the tongue surface. The highest errors are located around the tip and dorsum. This is not surprising since these regions are the most affected ones by SG and GGP activations. This is consistent with the difference in complexity of the ROM when one or two muscles are activated. For single activation, no free variable is needed whereas one free variable is added for double activation. As could be expected on a theoretical level, more non-linearity induces more complexity in the ROM.

Figure 2. 3 D error map of the ROM for SG + GG-P.(Tongue tip on the left – Back on the right).

4. Conclusions

The ANSYS MOR algorithm has been used to approximate a non-linear FE model of the human tongue, whose modeling of behavior and fast movements requires transient FE analyses.

ROM have been constructed for a “simple” case (SG activation) and for a “more complex” case (SG + GG-P). Model comparison shows the necessity to add a free variable to integrate higher non-linearities. This shows the robustness of the method for various levels of complexity. Next challenge is to be able to predict tongue movement during contact situations.

Additional information

Funding

This work was supported by the ANR (ANR-11-LABX-0004) and MIAI @ Grenoble Alpes, (ANR-19-P3IA-0003).

References

Buchaillard S, Brix M, Perrier P, Payan Y. 2007. Simulations of the consequences of tongue surgery on tongue mobility: implications for speech production in post-surgery conditions. Int J Med Robot. 3(3):252–261.
PubMed Web of Science ®Google Scholar
Hermant N, Perrier P, Yohan P. 2017. Human tongue biomechanical modeling. In: Payan Y, Ohayon J, editors. Biomechanics of living organs: hyperelastic constitutive laws for finite element modeling. Elsevier; Chap. 19; p. 395–411.
Google Scholar
Lauzeral N, Borzacchiello D, Kugler M, George D, Rémond Y, Hostettler A, Chinesta F. 2019. A model order reduction approach to create patient-specific mechanical models of human liver in computational medicine applications. Comput Methods Programs Biomed. 170:95–106.
PubMed Web of Science ®Google Scholar
Siegelmann HT, Horne BG, Giles CL. 1997. Computational capabilities of recurrent NARX neural networks. IEEE Trans Syst Man Cybern B: Cybern. 27(2):208–215.
PubMed Web of Science ®Google Scholar

Real-time simulations of human tongue movements with a reduced order model of a non-linear dynamic biomechanical model

1. Introduction

2. Methods

2.1. Machine learning-based MOR

2.2. Biomechanical model of the human tongue

2.3. Learning and evaluation data

3. Results and discussion

Table 1. Mean RMS error and standard deviations of the predictions of the ROM. (cf. Trajectories in ).

4. Conclusions

References

Information for

Open access

Opportunities

Help and information

Real-time simulations of human tongue movements with a reduced order model of a non-linear dynamic biomechanical model

1. Introduction

2. Methods

2.1. Machine learning-based MOR

2.2. Biomechanical model of the human tongue

2.3. Learning and evaluation data

3. Results and discussion

Table 1. Mean RMS error and standard deviations of the predictions of the ROM. (cf. Trajectories in Figure 1).

4. Conclusions

Additional information

Funding

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

Table 1. Mean RMS error and standard deviations of the predictions of the ROM. (cf. Trajectories in ).