Full article: Variational assimilation of surface wave data for bathymetry reconstruction. Part I: algorithm and test cases

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

Accurate mapping of ocean bathymetry is needed for effective modelling of ocean dynamics, such as tsunami prediction. Available bathymetry data does not always provide the resolution to model such nonlinear waves accurately, and collection of accurate data is logistically challenging. As an alternative, in this study we develop and evaluate a variational data assimilation scheme for the one-dimensional nonlinear shallow water equations that estimates bathymetry using a finite set of observations of surface wave height. We demonstrate that convergence to exact bathymetry is improved by including more observation locations and by implementing a low-pass filter in the data assimilation algorithm to remove small-scale noise. A necessary condition for convergence of the bathymetry reconstruction is that the amplitude of the initial conditions is less than 1% of the bathymetry height. We use density-based global sensitivity analysis (GSA) to assess the sensitivity of the surface wave and reconstruction error to model parameters. By demonstrating low sensitivity of the surface wave to the reconstruction error, we show that reconstructing the bathymetry with a relative error of about 10% is sufficiently accurate for surface wave modelling in most cases. These results can be used to guide the development of similar assimilation schemes in higher dimensions and more realistic geometries.

Keywords:

1. Introduction

The process of mapping ocean bathymetry from measurements of surface waves is ill-posed: it is characterized by sensitivity to small amounts of noise in the system and is susceptible to the instability inherent in the inversion process (Ozisik and Orlande, Citation2000). Data assimilation is one such inversion process, where observations of a true state are combined with a mathematical model in order to recover missing data governing the system evolution. The verification of a variational adjoint-based scheme of the kind developed in this paper is challenging, especially since there is no general analytical solution available for the nonlinear shallow water equations, even in the case of flat bathymetry. Despite this, accurate representation of bathymetry data is essential when predicting tsunamis, since bathymetry variation modifies speed, direction, and stability of the propagating wave (Craig and Sulem, Citation2009). Bathymetry data can be either static or dynamic, where the latter includes shifts in the ocean floor due to seismic activity.

The high number of degrees of freedom in this problem makes it difficult to determine the criteria for optimal bathymetry reconstruction. We must consider the shallow water system governing tsunami propagation, as well as parameters such as the amplitude and shape of the bathymetry, and the initial conditions of the surface waves. This is in addition to calibration of the optimization scheme, and deriving optimal configurations of the observation network.

In this study we address two complementary questions:

How accurately can bathymetry data be reconstructed from surface wave measurements, and what determines the accuracy?
How accurate does the bathymetry data need to be in order to model the surface waves accurately?

We attempt to address these questions for an idealised 1-D case, as a first step towards more realistic models. We quantify the key relationships between the initial conditions and the bathymetry amplitude relative to the average fluid depth. We also analyse the effect of the number and locations of the observations on the convergence properties of the assimilation. We implement a Sobolev gradient smoothing technique (effectively a low-pass filter) within our optimization scheme, and illustrate its ability to reduce the small-scale noise present in the bathymetry reconstruction. We then investigate the consequences of errors in the bathymetry data on the resulting surface wave by observing trends in the surface wave propagation error as the amplitude of the initial condition, the amplitude of the bathymetry, and the number of observation points is varied.

Section 2 provides a review of the effect of bathymetry on surface waves, and efforts to map ocean bathymetry, highlighting empirical, numerical and theoretical approaches. In Sec. 3, we provide a concise overview of the shallow water system and derive the first order adjoint data assimilation scheme using optimal control theory, and summarise the algorithm. Section 4 presents preliminary results for different choices of initial conditions and exact bathymetry for the data assimilation scheme. This investigation reveals small-scale noise in the optimal reconstruction. To reduce this noise, Sec. 5 proposes a low pass filter, which effectively removes higher frequencies in our reconstruction by increasing the regularity of our estimate at each iteration, raising it from the space $L^{2} (R)$ to $H^{2} (R) .$ We discuss results of the smoothed optimisation scheme and illustrate the removal of noise in the reconstructed bathymetry in several test cases. Section 6 analyses the relationships between the amplitudes of the initial conditions and bathymetry relative to the average depth, and attempts to formulate a relationship summarising certain necessary conditions for convergence. We also analyse the effect of number of observation points on the optimal reconstruction. Finally, we provide a sensitivity analysis of the surface wave to errors in the bathymetry reconstruction.

We observe that the low pass filter effectively reduces small-scale noise in the bathymetry reconstruction for different bathymetry shapes. Additionally, a necessary condition for convergence is that the amplitude of the initial conditions be at least two orders of magnitudes smaller than the amplitude of the bathymetry. Convergence is significantly improved by increasing the number of observation points, however this is not a sufficient condition for convergence. Finally, we show qualitatively that the surface wave exhibits low sensitivity to bathymetry reconstruction error, motivating further insight into the sensitivity of the reconstruction error and resulting surface wave error to model parameters. In Sec. 7 we use global sensitivity analysis (GSA) to derive sensitivity indices quantifying the influence of (i) bathymetry position relative to the observations, and (ii) the amplitudes of the initial condition and bathymetry, on the error in bathymetry reconstruction and the surface wave. We focus our analysis on a localised surface wave propagating over a compact bathymetry. Using density-based GSA, we quantify and subsequently rank the influence of these input factors on the bathymetry and surface wave errors, respectively.

We summarise our conclusions in Sec. 8, and provide insights that motivate further analyses for the full 2-D model and more realistic settings for tsunami models.

2. Review of bathymetry effects and previous work

Bathymetry can have a significant impact on propagating shallow water waves, by altering the depth-dependent wave-speed $c = \sqrt{g h (x, t)},$ where $h (x, t) = H + η (x, t) - β (x)$ is the total fluid depth as a function of mean depth H, free surface perturbation $η (x, t)$ bathymetry $β (x) .$ A demonstration of bathymetry effects is given in , where we show the surface wave with and without bathymetry. The speed and height of the surface wave are modified by the bathymetry. This is because a tsunami’s energy flux remains relatively constant, and so as the tsunami’s speed varies, so does its height (shoaling). In two dimensions the direction of propagation is also changed, since the characteristics are no longer straight lines. Thus bathymetry can significantly impact the arrival time of tsunami waves, and coastal communities can receive different amounts of tsunami energy based on local bathymetry effects. An accurate and detailed map of the bathymetry is therefore an essential component of accurate tsunami models.

Fig. 1. (a–c) Three test cases for bathymetry $β (x)$ and free surface perturbation initial conditions $η (x, 0)$ for the data assimilation scheme. The green circles represent the locations of the observations, with N_obs = 5 and $y_{1}^{(o)} = 0.1 L .$ Note that while the spatial distribution is correct, amplitude of the initial conditions $\hat{η},$ amplitude of the bathymetry $\hat{β},$ and average depth H are not to scale in these diagrams, as $\hat{η}$ was restricted to 1% of $\hat{β}$ across most of the numerical tests. Plots (d–f) show the propagating free surface wave at t = 1.95 with flat bottom (blue) and bathymetry (red) for each Cases I, II and III, respectively, to highlight the effect of bathymetry on surface wave propagation.

Attempts to create an accurate map of oceanic bathymetry have been made by direct measurements, and indirectly by using information from propagating surface waves. Direct measurement includes platforms like ship-based high frequency sonar. However many of these methods are often either too costly or have poor spatial resolution. Often it is easier to measure waves propagating on the free surface, and use this information to create a map of the bottom topography from classical wave theory. This is an inverse problem. With improvements in high resolution satellite imagery in recent years, there is increasing interest in using inverse methods to provide accurate reconstructions of bathymetry information.

Wunsch (Citation1996) provides a detailed review of inverse methods for ocean circulation models, describing methods for both deep and shallow water. In coastal regions, depth-inversion methods have been refined to account for observational data in areas with large interference from human activities and muddy water (Ge et al., Citation2020). Additionally, inverse methods are also routinely used in open channel flow modelling, where bed topography is approximated using surface measurements (Gessese and Sellier, Citation2012).

A longstanding approach to solving this inverse problem uses the dispersion relation of surface waves. Earlier works such Lubard et al. (Citation1980) used measurements of the frequency-wavenumber spectrum made via optical images, obtained using cameras mounted on an oceanographic research tower. Since then, various methods using dispersion relations have been investigated, where bathymetry is measured by fitting the theoretical dispersion relation for gravity waves (where depth is a system parameter), and derived using inversion formulas.

More recent works include Dugan (Citation1997), extended by Piotrowski and Dugan (Citation2002), where image sequences of shoaling ocean waves taken from an aircraft are used to retrieve maps of water depth via the linear dispersion relation. The accuracy of this method can be as high as 5% if the waves are reasonably linear. However, Grilli (Citation1998) builds on Dugan (Citation1997), arguing that the latter is limited due to neglect of amplitude dispersion effects, which accumulate through increasing nonlinearity as waves approach breaking in shallow water. He compares the linear frequency dispersion to a third order polynomial relationship between wave speed c as a function of wavenumber k and depth h, showing that due to amplitude dispersion effects, linear wave theory may greatly under estimate c, and lead to poorer estimates of bathymetry inversion formulas based on a linear dispersion relation.

These inversion algorithms are calibrated based on results of simulated periodic waves over mild slopes in a two-dimensional ‘numerical wave tank’. This fully nonlinear based on potential flow (FNPF) numerical wave tank methodology was developed by Grilli and Subramanya (Citation1996) with wave generation and absorption methods, to calculate speed and height variation for a number of shoaling waves over slopes ranging from $1 : 35$ to $1 : 70,$ as shown in Grilli (Citation1998). It was demonstrated in Grilli (Citation1998) to have higher accuracy in coastal simulations than the nonlinear shallow water equations.

Tsai and Yue (Citation1996) also demonstrated how FNPF numerical tanks allow calculation of ‘numerically exact’ properties of shoaling wave up to breaking point, and can provide accurate representation of surface waves independent of nonlinearity parameters.

While these methods are based on an empirical formula for the 2-D nonlinear inversion problem in idealised cases, Nicholls and Taber (Citation2009) derive an inversion formula for bathymetry analytically, using the nonlinearity of the 1-D governing Euler equations for ideal fluid flow to detect bathymetry information. The governing equations for the surface wave are expressed as a Hamiltonian system, and a Dirichlet-to-Neumann operator (DNO) is applied to the system in order to remove some implicit dependencies. The result is a single equation of the wave height at the surface in terms of the bathymetry, and subsequently an inversion is derived. However, because their inversion formula is linear, they are required to assume a small amplitude for the bathymetry and their numerical results only consider normalised bathymetry amplitudes of 0.07 and 0.025.

Additionally, surface waves can be also used to characterise a rapid dynamic change in bathymetry. Jang et al. (Citation2010) take a similar approach to Grilli (Citation1998) to the related problem of measuring a sudden shift in the sea floor, e.g. due to seismic activity, using measurements of the surface waves. Their inversion formula, however, is based on the same approach as Dugan (Citation1997): using the linear dispersion relation with bathymetry as a parameter, and using transforms to show that the problem becomes one of solving an integral equation involving the known surface wave data. The uniqueness of this solution is demonstrated. However, analysis shows that there is a lack of stability in the measurement of the bottom displacement, and a question as to whether it depends continuously on the wave elevation. They overcome this using regularisation methods iteratively as a stabilisation technique, and show numerical convergence to the integral solution.

Each of the aforementioned methodologies has its strengths and weaknesses. Piotrowski and Dugan (Citation2002) and Grilli (Citation1998) both discuss practical measurement techniques of surface waves, whereas theoretical approaches such as Nicholls and Taber (Citation2009), and Jang et al. (Citation2010) show high degree of convergence, but assume full knowledge of the surface wave, and do not address the complexities involved in obtaining accurate measurements in a real-world scenario, such as noise or incomplete measurements. Nicholls and Taber (Citation2009) state that their future objectives aim to find an effective way of extracting wave fields from full observational data.

In summary, while the theoretical results from such models are promising, their applicability to real-world measurements has not been established. In the present study we attempt to combine theoretical results with realistic assumptions. We consider 1-D geometry as a first step to validate the basic approach and investigate fundamental questions. Assuming a finite set of observations, we analyse the effects of different bathymetry features on convergence, as well as the effect of different amplitudes and shape of the initial condition. These effects have not been considered in detail in the reviewed works.

Another issue is the difference between the inversion formulas derived for linear and nonlinear systems. Grilli (Citation1998), and Nicholls and Taber (Citation2009) give results accounting for nonlinearity and the resulting dispersion effects, whereas Piotrowski and Dugan (Citation2002), and Jang et al. (Citation2010) are restricted to linear dispersion relations for gravity waves. However, there remains the question of whether the empirical formulation of the inversion as derived by Grilli (Citation1998) is as rigorous as the analytically derived solution of Nicholls and Taber (Citation2009). Both assume periodic waves, but the practical limitations on the accuracy of the free surface data make it difficult to assess the relative efficacy of these two methods. Ultimately, a complete evaluation of these approaches will depend on research which systematically compares inversion formulas based on the nonlinear governing equations for free surface wave propagation, as well as addresses the practical issues of collecting realistic field wave data.

In the present study, we approach the bathymetry estimation problem from a variational data assimilation perspective, with the goal of formulating an algorithm for the 1-D nonlinear shallow water system that predicts bathymetry from a small set of observations. We neglect rotational Coriolis effects and noise and assume static bathymetry with lateral boundary conditions. Our approach does not involve inversion of the dispersion relation. Instead, we formulate an optimisation problem, seeking to minimise the error between observations and forecasts of the free surface wave. The analytical derivation of the variational algorithm differs from the empirical techniques introduced by Grilli (Citation1998) and is derived for the infinite-dimensional case. The use of sparse observations aims to provide a relatively more realistic set of assumptions than Nicholls and Taber (Citation2009), who require complete observations of surface wave fields.

We use the conclusions of this idealised case to better understand the role of model parameters and the observation operator on bathymetry reconstruction. Our goal is to take the qualitative results observed here, and use them as a benchmark for more rigorous sensitivity analyses based on second order adjoint and GSA approaches. Consequently, we focus on highlighting results and techniques than can improve existing methods for accurate tsunami prediction.

3. Derivation of the adjoint based data assimilation scheme

The nonlinear shallow water equations (SWE) are a coupled system of equations for travelling free surface waves. They are derived from the two-dimensional Euler equations, under the assumption that the wavelength λ of free surface waves is much larger than the total ocean depth h, allowing us to average the Euler equations over the vertical dimension. The fluid column height becomes $h = H + η (x, t) - β (x),$ where H is the average depth, η is the perturbation of the free surface, and β is time-independent sea floor perturbation from zero, i.e. the ‘bathymetry’. Appropriate vertical averaging gives the irrotational, constant density incompressible one-dimensional nonlinear shallow water equations, (3.1a) $\frac{\partial η}{\partial t} + \frac{\partial}{\partial x} ((H + η - β) u) = 0,$ (3.1a) (3.1b) $\frac{\partial u}{\partial t} + \frac{\partial}{\partial x} (\frac{1}{2} u^{2} + g η) = 0,$ (3.1b) (3.1c) $η (x, 0) = ϕ (x),$ (3.1c) (3.1d) $u (x, 0) = 0.$ (3.1d)

(Note that in the absence of the Coriolis effect the nonlinear waves in a shallow water system are non-dispersive.) We assume that the initial conditions $ϕ (x)$ are compactly supported, and that the boundary conditions are periodic on some domain $Ω = {x; x \in [- L, L]} .$ Our objective is to implement a variational data assimilation scheme constrained by Equation(3.1)(3.1a) $\frac{\partial η}{\partial t} + \frac{\partial}{\partial x} ((H + η - β) u) = 0,$ (3.1a) in order to estimate the bathymetry $β (x) .$ We wish to derive an optimal estimate of the bottom topography using a finite number of observations of the free surface perturbation, for all times $t \in [0, T] .$ To simplify further, we normalise the system (3.1) by the average height H and the gravitational acceleration g such that the wave speed $c = \sqrt{g H} = 1 .$

We quantify our objective as the partial differential equation (PDE) constrained minimisation of some cost function $J,$ (3.2) $J (β) = \frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{M} [η^{(f)} (x_{j}, t; β) - y_{j}^{(o)} (t)]^{2} d t,$ (3.2) where $y_{j}^{(o)} (t)$ are the observations of the true free surface perturbations taken at positions $x_{j}, j = 1, \dots, N_{obs},$ and $η^{(f)} (x_{j}, t; β)$ is the solution of our system at $x_{j}$ generated by the bathymetry β. We define the optimal bathymetry $β^{(b)}$ as (3.3) $β^{(b)} = {argmin}_{β \in L^{2} (Ω)} J (β) .$ (3.3)

This is equivalent to solving the gradient minimization problem (3.4) $\nabla^{L^{2}} J (β^{(b)}) = 0.$ (3.4)

As direct computation of this optimization problem is too computationally expensive, we formulate an algorithm that allows us to extract an explicit expression for the gradient $\nabla^{L^{2}} J (β^{(b)}),$ and subsequently find the minimiser $β^{(b)}$ more efficiently. As a first step, we formulate the first variation of $J,$ given some arbitrary perturbation $β'$ of size ε. This is given by the Gâteaux derivative, (3.5) $\begin{matrix} J' (β; β') = \lim_{ε \to 0} \frac{J (β + ε β') - J (β)}{ε} \\ = \frac{d}{d ε} J (β + ε β') |_{ε = 0} \end{matrix}$ (3.5)

Using a perturbation expansion in ε about $ε = 0$ on the term $J (β + ε β')$ and truncating to $O (ε),$ we can reformulate Equation(3.5)(3.5) $\begin{matrix} J' (β; β') = \lim_{ε \to 0} \frac{J (β + ε β') - J (β)}{ε} \\ = \frac{d}{d ε} J (β + ε β') |_{ε = 0} \end{matrix}$ (3.5) as (3.6) $J' (β; β') = - \int_{0}^{T} (η^{(f)} (x_{j}, t; β) - y^{(o)} (t)) η' d t,$ (3.6) where $(η', u')$ are the solutions of the perturbed system of Equation(3.1)(3.1a) $\frac{\partial η}{\partial t} + \frac{\partial}{\partial x} ((H + η - β) u) = 0,$ (3.1a) brought about by the perturbation $β \to ε β',$ (3.7) $u \to u + ε u',$ (3.7) (3.8) $η \to η + ε η',$ (3.8) and by subsequently extracting the $O (ε)$ system.

As the Gâteaux derivative is a directional derivative in the direction of the perturbation $β',$ we can express (3.6) as the inner product between $\nabla J$ and $β',$ (3.9) $J' (β; β') = {〈 \nabla J, β' 〉}_{L^{2} (Ω)} = \int_{- L}^{L} \nabla^{L_{2}} J β' d x .$ (3.9)

Then the following forms of $J (β; β')$ are equivalent, (3.10) $J' (β; β') = - \int_{0}^{T} (η^{(f)} (x_{j}, t; β) - y^{(o)} (t)) η' d t = \int_{- L}^{L} \nabla^{L^{2}} J β' d t .$ (3.10)

Therefore, finding an equivalent expression for the last term in Equation(3.10)(3.10) $J' (β; β') = - \int_{0}^{T} (η^{(f)} (x_{j}, t; β) - y^{(o)} (t)) η' d t = \int_{- L}^{L} \nabla^{L^{2}} J β' d t .$ (3.10) will enable us to extract an expression for $\nabla^{L^{2}} J .$ This is achieved by formulating the Lagrangian associated with the linearised system for $(η', u')$ with some arbitrary adjoint variables $(η^{*}, u^{*}),$ (3.11) $\int_{0}^{T} \int_{- L}^{L} η^{*} (x, t) [\frac{\partial η'}{\partial t} + \frac{\partial}{\partial x} ((η' - β') u + (η + 1 - β) u')] + u^{*} (x, t) [\frac{\partial u'}{\partial t} + \frac{\partial}{\partial x} (η' + u u')] d x d t = 0.$ (3.11)

Integrating by parts in time and space reduces Equation(3.11)(3.11) $\int_{0}^{T} \int_{- L}^{L} η^{*} (x, t) [\frac{\partial η'}{\partial t} + \frac{\partial}{\partial x} ((η' - β') u + (η + 1 - β) u')] + u^{*} (x, t) [\frac{\partial u'}{\partial t} + \frac{\partial}{\partial x} (η' + u u')] d x d t = 0.$ (3.11) to (3.12) $\begin{matrix} 0 = - \int_{0}^{T} \int_{0}^{L} η' {\frac{\partial η^{*}}{\partial t} + u \frac{\partial η^{*}}{\partial x} + \frac{\partial u^{*}}{\partial x}} + u' {\frac{\partial u^{*}}{\partial t} + (η + 1 - β) \frac{\partial η^{*}}{\partial x} + u \frac{\partial u^{*}}{\partial x}} - β' u \frac{\partial η^{*}}{\partial x} d x d t \\ + \int_{0}^{T} η^{*} [(η' - β') u + (η + 1 - β) u'] |_{- L}^{L} d t + \int_{0}^{T} u^{*} [η' + u u'] |_{- L}^{L} d t \\ + \int_{- L}^{L} η^{*} η' |_{t = T} d x - \int_{- L}^{L} η^{*} η' |_{t = 0} d x \\ + \int_{- L}^{L} u^{*} u' |_{t = T} d x - \int_{- L}^{L} u^{*} u' |_{t = 0} d x . \end{matrix}$ (3.12)

Due to periodicity, the boundary terms vanish. If we choose the adjoint variables $(η^{*}, u^{*})$ as the solution to (3.13a) $\frac{\partial η^{*}}{\partial t} + u \frac{\partial η^{*}}{\partial x} + \frac{\partial u^{*}}{\partial x} = \sum_{j = 1}^{M} [η^{(f)} (x_{j}, t; β) - y^{(o)} (t)] δ (x - x_{j}),$ (3.13a) (3.13b) $\frac{\partial u^{*}}{\partial t} + (1 + η - β) \frac{\partial η^{*}}{\partial x} + u \frac{\partial u^{*}}{\partial x} = 0,$ (3.13b) (3.13c) $η^{*} (x, T) = 0,$ (3.13c) (3.13d) $u^{*} (x, T) = 0,$ (3.13d)

(where the Dirac delta term in Equation3.13a(3.13a) $\frac{\partial η^{*}}{\partial t} + u \frac{\partial η^{*}}{\partial x} + \frac{\partial u^{*}}{\partial x} = \sum_{j = 1}^{M} [η^{(f)} (x_{j}, t; β) - y^{(o)} (t)] δ (x - x_{j}),$ (3.13a) is the observation operator mapping η onto the observation space), then (3.12) is reduced to (3.14) $\int_{0}^{T} \int_{0}^{L} (η^{(f)} (x_{j}, t; β) - y^{(o)} (t)) η' d x d t = \int_{0}^{T} \int_{0}^{L} β' u \frac{\partial η^{*}}{\partial x} d x d t .$ (3.14)

We refer to the system Equation(3.13)(3.13a) $\frac{\partial η^{*}}{\partial t} + u \frac{\partial η^{*}}{\partial x} + \frac{\partial u^{*}}{\partial x} = \sum_{j = 1}^{M} [η^{(f)} (x_{j}, t; β) - y^{(o)} (t)] δ (x - x_{j}),$ (3.13a) as the adjoint equations. We note that the normalised mean depth H = 1 is preserved in the adjoint system. Combining Equation(3.14)(3.14) $\int_{0}^{T} \int_{0}^{L} (η^{(f)} (x_{j}, t; β) - y^{(o)} (t)) η' d x d t = \int_{0}^{T} \int_{0}^{L} β' u \frac{\partial η^{*}}{\partial x} d x d t .$ (3.14) with the equivalence given by (3.10), we have (3.15) $\int_{- L}^{L} \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} β' d t d x = \int_{- L}^{L} \nabla^{L^{2}} J β' d x,$ (3.15) and thus, since our functional is linear and bounded and belongs to the space of square-integrable functions, we can use the Riesz representation theorem to extract $\nabla^{L_{2}} J,$ giving (3.16) $\nabla^{L^{2}} J = \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t .$ (3.16)

Losch and Wunsch (Citation2003) use a similar adjoint based minimisation for their bathymetry detection analysis, however they do not consider the infinite-dimensional case as we have here. The benefits of our approach is that it is independent of the discretisation used in its numerical implementation.

To verify that our formulation for $\nabla^{L^{2}} J$ is correct, we define the Kappa test (3.17) $κ (ε) = \lim_{ε \to 0} \frac{1}{ε} \frac{J (β + ε β') - J (β)}{{〈 \nabla^{L^{2}} J, β' 〉}_{L^{2} (Ω)}},$ (3.17) where $κ (ε)$ is the quotient of the two equivalent forms for the variation $J (β; β')$ we used in the above derivation. Given some perturbation $β',$ if we have correctly defined $\nabla^{L^{2}} J,$ then as $ε \to 0,$ we should see $κ (ε) \to 1 .$ The results of the kappa test for different cases are presented in Secs. 4 and 5.

The minimiser $β^{(b)}$ yielding $\nabla^{L^{2}} J = 0,$ is computed using a steepest descent algorithm given some starting guess $β^{(g)} .$ Using a line minimisation algorithm to find the optimal step size at each iteration, this can be summarised as (3.18) $β^{(n + 1)} = β^{(n)} - τ_{n} \nabla^{L_{2}} J (β^{(n)})$ (3.18) where the optimal step-size τ_n can be found at each iteration using the minimisation algorithm (3.19) $τ_{n} = {argmin}_{τ \in R} J (β^{(n)} (x) - τ \nabla^{L_{2}} J (β^{(n)} (x))) .$ (3.19)

The optimal bathymetry reconstruction $β^{(b)}$ is the fixed point of this iterative scheme. The steps for the process are outlined in Algorithm 1.

Algorithm 1.

Data Assimilation Algorithm for Bathymetry Estimation

1: Pick an initial estimate for $β^{(g)} .$

2: Solve the initial value problem for $(u, η)$ from t = 0 to t = T.

3: Solve the adjoint problem for $(u^{*}, η^{*})$ backwards in time from t = T to t = 0 to find $η^{*} (x, t) .$

4: Approximate $\int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t$ at every point in spatial domain Ω.

5: Define $\nabla^{L^{2}} J = \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t .$

6: Compute the optimal time step τ_n through a line minimisation algorithm.

7: Use a gradient descent algorithm to compute $β^{(n + 1)} (x) = β^{(n)} (x) - τ_{n} \nabla^{L^{2}} J (β^{(n)} (x)) .$

8: Repeat until $∥ \nabla^{L^{2}} J ∥ < ε$ for some small ε ( $∥ \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t ∥ \approx 0$ ).

9: Set $β^{(b)} (x) : = β^{(n)} (x) .$

4. Initial results using L² gradients

To verify the numerical implementation of Algorithm 1, we consider three cases, characterised by different true bathymetries $β^{(t)} (x)$ and initial conditions $ϕ (x) .$ These cases are summarized in , and shown in . They were chosen to analyse convergence in scenarios where the support of $ϕ (x)$ and the support of $β^{(t)} (x)$ overlap or are disjoint. Additionally, we want to evaluate the effect of a surface wave with compactly supported initial conditions (Cases I and II) or periodic initial conditions (Case III). We consider Gaussian and sandbar profiles for the bathymetry, similar to CitationNicholls and Taber (2009), as a 1-D approximation for peaks and ridges characterising ocean bathymetry. The main application of this study is tsunami modelling given some optimal reconstruction of missing bathymetry data, hence we are primarily interested in a non-periodic propagating surface wave, as in Case I and Case II. However, including the periodic initial conditions Case III in our analysis helps us understand the effects of the observation operator and model parameters on the optimal reconstruction.

Table 1. Notation used in the derivation of data assimilation scheme of the SWE to find the optimal bathymetry, using same format as given in Table 1 of Kevlahan et al. (Citation2019).

Display Table

Table 2. Cases considered for data assimilation Algorithm 1.

Download CSV Display Table

We implement these schemes using a standard second order finite difference-finite volume approximation on a staggered grid in space, and a four stage third order Runge–Kutta scheme (Spiteri and Ruuth, Citation2002) in time. The resolution of the spatial grid is N = 512, and the spatial domain is $Ω = {x \in R; - L \leq x \leq L} .$ Velocity u(x, t) is located at cell edges and the bathymetry $β (x)$ and free surface perturbation $η (x, t)$ are located at cell nodes. Periodic boundary conditions are imposed at x = L and $x = - L$ where L = 3. We assume we have no background information for bathymetry a priori, and set $β^{(g)} (x) = 0 .$

The system is integrated for $t \in [0, T] .$ In Kevlahan et al. (Citation2019) the variational data assimilation system for the initial conditions reconstruction is integrated for $t \in [0, T],$ where the final time T = 2 is chosen such that the free surface wave does not reach the boundary. This was appropriate when our objective was to reconstruct the initial conditions of a tsunami wave in real time. For bathymetry assimilation, as we have no constraints on the assimilation time, the final time is chosen to be $T = 2 L,$ where boundary effects were present. Periodic boundary conditions are artificial in the sense that they are not a substitute for realistic ocean conditions and are not intended to simulate coastlines. However they are appropriate in our idealised 1-D case as they simplify the dynamics, and do not violate underlying principles. In our case we restrict the choice of free surface wave to (i) a travelling Gaussian, and (ii) a sinusoidal wave. The latter is periodic and is not significantly altered by the boundary conditions. For the Gaussian initial conditions, the wave is effectively reflected at the boundary, and consequently is a wave travelling in the opposite direction. We observed that there were no errors generated at the boundaries that propagated inside the domain and impacted results. The main conclusions of this study were not altered by increasing the assimilation time from T = 2 to $T = 2 L,$ however convergence of the reconstructed bathymetry improved with the longer assimilation time.

The results in illustrate the convergence of the data assimilation scheme for each case outlined in , as well as the convergence of the kappa test (3.17). In all cases we consider a relative L² error of less than 10% as the threshold for ‘converged’ bathymetry. Let us first consider the convergence of the cost function Equation(3.2)(3.2) $J (β) = \frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{M} [η^{(f)} (x_{j}, t; β) - y_{j}^{(o)} (t)]^{2} d t,$ (3.2) . Ultimately, the purpose of the optimization scheme is to minimise the error between the observations of the true height perturbation $y^{(o)} (t)$ and the approximate solution $η^{(f)}$ given the optimal bathymetry $β^{(b)} .$ Minimising the cost function is necessary for accurate reconstruction of the bathymetry. However, due to the ill-posed nature of the problem, convergence of the cost function is not sufficient to guarantee accurate bathymetry reconstruction. Indeed, the results highlight the difference between these two objectives. We see in , that the relative decrease in the cost function over 500 iterations of Algorithm 1 is greatest for Case I, at $O (10^{- 6}),$ and thus the assimilation algorithm converges successfully for this case. In contrast, the relative decrease in the cost function for Cases II and III is not less than $O (10^{- 2})$ and $O (10^{- 4}),$ respectively. However, the errors in the reconstructed bathymetry $β^{(b)}$ corresponding to these cost functions in have very different behaviour.

Fig. 2. Results for iterative data assimilation scheme outlined in Algorithm 1, with $\nabla J \in L^{2} (Ω) .$ Only in Case I do we consider that the assimilation has reconstructed the bathymetry with sufficient accuracy ( $< 10 %$ relative error). (a) Convergence of the kappa test for the three cases. (b) Relative reduction in the cost function after 500 iterations. (c) Relative error in the reconstructed bathymetry. (d–f) Optimal reconstructed bathymetry for each case. We observe noise in the reconstruction for each case, especially in Case II.

Fig. 2. Results for iterative data assimilation scheme outlined in Algorithm 1, with ∇J∈L2(Ω). Only in Case I do we consider that the assimilation has reconstructed the bathymetry with sufficient accuracy (<10% relative error). (a) Convergence of the kappa test for the three cases. (b) Relative reduction in the cost function after 500 iterations. (c) Relative error in the reconstructed bathymetry. (d–f) Optimal reconstructed bathymetry for each case. We observe noise in the reconstruction for each case, especially in Case II.

It is clear that for each case convergence of the cost function does not necessarily correspond to the true bathymetry $β^{(t)} .$ Only in Case I does the reconstructed bathymetry converge to the true bathymetry, although even in this case the error is relatively large, 0.04. f shows the reconstructed bathymetry for each case. It is immediately clear that the primary source of error is small-scale noise. For Case I we see that, although the Gaussian bathymetry is well-resolved, there is still small-scale noise present in the tails. However, for Case III, where we assume periodic initial conditions $ϕ (x),$ the error in the reconstructed bathymetry is large, and we conclude that the observability of the bathymetry by sensors measuring a sinusoidal propagating surface wave is significantly lower than that of a travelling Gaussian wavefront, as in Case I.

For Case II, it is interesting to note that the noise is larger scale on the front slope of the bathymetry (x < 0) than the back slope of the bathymetry (x < 0). Since the observation points are placed to the right hand side of the Gaussian initial conditions $ϕ$ (as shown in ), there are no observations of the left propagating wave. Additionally, unlike Cases I and III, the observations are not positioned before the support of the bathymetry. This suggests that observations for x > 0 may not be able to sufficiently capture the bathymetry effects on the left-propagating wave. We provide a more quantitative analysis of sensitivity to the placement of the observations and the resulting effect on reconstruction error in Sec. 7.

We expect that if the kappa test converges, the error of the associated variational algorithm should also converge (even though this is not necessarily true). However, we see in that the best convergence for the kappa test is for Case III, whereas shows that Case III actually has the least accurate bathymetry reconstruction.

In the following section we show that an optimisation scheme where the gradient of the cost function Equation(3.16)(3.16) $\nabla^{L^{2}} J = \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t .$ (3.16) exists in $L^{2} (Ω)$ is not smooth enough to obtain classical solutions to Equation(3.1)(3.1a) $\frac{\partial η}{\partial t} + \frac{\partial}{\partial x} ((H + η - β) u) = 0,$ (3.1a) , and thus in Sec. 5 we derive analytically a low pass filter by smoothing our gradient such that $\nabla J \in H^{2} (Ω)$ and provide results of the numerical implementation. Using smoother H² gradients largely eliminates the small scale errors in the reconstructed bathymetry we have seen here using L² gradient.

5. Smoothing using Sobolev gradients

In system Equation(3.1)(3.2) $J (β) = \frac{1}{2} \int_{0}^{T} \sum_{i = 1}^{M} [η^{(f)} (x_{j}, t; β) - y_{j}^{(o)} (t)]^{2} d t,$ (3.2) , the bathymetry is incorporated via the ${(β u)}_{x}$ term. Thus, a classical solution to this system requires smoothness not just of β, but also of its first derivative. Because of this, we require the gradient Equation(3.16)(3.16) $\nabla^{L^{2}} J = \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} d t .$ (3.16) to be in the Sobolev space $H^{2} (Ω),$ which imposes smoothness conditions on $β_{x}$ as well as β. The following derivation of the corresponding Sobolev gradient is adapted from Protas (Citation2008).

$H^{2} (Ω)$ is a Sobolev space equipped with the inner product (5.1) $\begin{matrix} {〈 v_{1}, v_{2} 〉}_{H^{2} (Ω)} = {〈 v_{1}, v_{2} 〉}_{L^{2} (Ω)} + l_{1}^{2} {〈 \frac{\partial v_{1}}{\partial s}, \frac{\partial v_{2}}{\partial s} 〉}_{L^{2} (Ω)} + l_{2}^{4} {〈 \frac{\partial^{2} v_{1}}{\partial s^{2}}, \frac{\partial^{2} v_{2}}{\partial s^{2}} 〉}_{L^{2} (Ω)} \\ = \int_{s = - L}^{L} [v_{1} v_{2} + l_{1}^{2} \frac{\partial v_{1}}{\partial s} \frac{\partial v_{2}}{\partial s} + l_{2}^{4} \frac{\partial^{2} v_{1}}{\partial s^{2}} \frac{\partial^{2} v_{2}}{\partial s^{2}}] d s, \end{matrix}$ (5.1) where $v_{1}, v_{2} \in H^{2} (Ω),$ and $l_{1}, l_{2} \in R$ are the length scale parameters used to adjust the regularity. As long as l₁, l₂ are finite, by the Riesz representation theorem (and equivalence of inner products), (5.2) $\begin{matrix} J' (β; β') = \int_{s = - L}^{L} \nabla^{L^{2}} J β' d s \\ = {〈 \nabla^{L^{2}} J, β' 〉}_{L^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{H^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{L^{2} (Ω)} + l_{1}^{2} {〈 \frac{\partial \nabla^{H^{2}} J}{\partial s}, \frac{\partial β'}{\partial s} 〉}_{L^{2} (Ω)} + l_{2}^{4} {〈 \frac{\partial^{2} \nabla^{H^{2}} J}{\partial s^{2}}, \frac{\partial^{2} β'}{\partial s^{2}} 〉}_{L^{2} (Ω)} . \end{matrix}$ (5.2)

In order to extract the gradient using equivalence of inner products (as we did in Equation3.15(3.15) $\int_{- L}^{L} \int_{0}^{T} u \frac{\partial η^{*}}{\partial x} β' d t d x = \int_{- L}^{L} \nabla^{L^{2}} J β' d x,$ (3.15) ), we need to isolate the $β'$ term in Equation(5.2)(5.2) $\begin{matrix} J' (β; β') = \int_{s = - L}^{L} \nabla^{L^{2}} J β' d s \\ = {〈 \nabla^{L^{2}} J, β' 〉}_{L^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{H^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{L^{2} (Ω)} + l_{1}^{2} {〈 \frac{\partial \nabla^{H^{2}} J}{\partial s}, \frac{\partial β'}{\partial s} 〉}_{L^{2} (Ω)} + l_{2}^{4} {〈 \frac{\partial^{2} \nabla^{H^{2}} J}{\partial s^{2}}, \frac{\partial^{2} β'}{\partial s^{2}} 〉}_{L^{2} (Ω)} . \end{matrix}$ (5.2) . We use integration by parts as before, on the second and third term in Equation(5.2)(5.2) $\begin{matrix} J' (β; β') = \int_{s = - L}^{L} \nabla^{L^{2}} J β' d s \\ = {〈 \nabla^{L^{2}} J, β' 〉}_{L^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{H^{2} (Ω)} \\ = {〈 \nabla^{H^{2}} J, β' 〉}_{L^{2} (Ω)} + l_{1}^{2} {〈 \frac{\partial \nabla^{H^{2}} J}{\partial s}, \frac{\partial β'}{\partial s} 〉}_{L^{2} (Ω)} + l_{2}^{4} {〈 \frac{\partial^{2} \nabla^{H^{2}} J}{\partial s^{2}}, \frac{\partial^{2} β'}{\partial s^{2}} 〉}_{L^{2} (Ω)} . \end{matrix}$ (5.2) .

We impose periodic conditions in space to eliminate the resulting boundary terms, and subsequently we have (5.3) $\begin{matrix} {〈 \nabla^{L^{2}} J, β' 〉}_{L^{2} (Ω)} = {〈 \nabla^{H^{2}} J, β' 〉}_{H^{2} (Ω)}, \\ \Rightarrow \int_{s = - L}^{L} \nabla^{L^{2}} J β' d s = \int_{s = - L}^{L} [\nabla^{H^{2}} J - l_{1}^{2} \frac{\partial^{2} \nabla^{H^{2}} J}{\partial s^{2}} + l_{2}^{4} \frac{\partial^{4} \nabla^{H^{2}} J}{\partial s^{4}}] β' d s . \end{matrix}$ (5.3)

Since this holds for every arbitrary perturbation $β',$ the process of smoothing the gradient from $L^{2} (Ω)$ to $H^{2} (Ω)$ is equivalent to solving an inhomogeneous boundary value problem, which in Fourier space simplifies to (5.4) ${\hat{(\nabla^{H^{2}} J)}}_{k} = \frac{1}{1 + l_{1}^{2} k^{2} + l_{2}^{4} k^{4}} {\hat{(\nabla^{L^{2}} J)}}_{k} .$ (5.4)

This is effectively a low pass filter applied to the L² gradient. We can make this filter as aggressive as needed by ‘tuning’ l₁ and l₂, where the case $l_{1} = l_{2} = 0$ is equivalent to the L² gradient. For our smoothed data assimilation algorithm, we set $l_{1} = 0$ and calibrate l₂ as this gives us the desired regularity and reduces the number of degrees of freedom in the problem. We now consider the numerical implementation of this updated optimization scheme, summarised in Algorithm 2.

Algorithm 2.

Data Assimilation algorithm with low pass filter for bathymetry estimation.

1: Pick initial estimate for $β^{(g)} .$

2: Solve the initial value problem for $(u, η)$ from t = 0 to t = T.

3: Solve adjoint problem for $(u^{*}, η^{*})$ backwards in time from t = T to t = 0 to find $η^{*} (x, t) .$

4: Approximate $\int_{0}^{T} u \partial_{x} η^{*} d t$ at every point in spatial domain Ω.

5: Define $\nabla^{L^{2}} J = \int_{0}^{T} u \partial_{x} η^{*} d t .$

6: Apply low pass filter Equation(5.4)(5.4) ${\hat{(\nabla^{H^{2}} J)}}_{k} = \frac{1}{1 + l_{1}^{2} k^{2} + l_{2}^{4} k^{4}} {\hat{(\nabla^{L^{2}} J)}}_{k} .$ (5.4) to $\nabla^{L^{2}} J$ to get $\nabla^{H^{2}} J$

7: Compute the optimal time step τ_n through a line minimisation algorithm.

8: Use a gradient descent algorithm to compute $β^{(n + 1)} (x) = β^{(n)} (x) - τ_{n} \nabla^{H^{2}} J (β^{(n)} (x)) .$

9: Repeat until $∥ \nabla^{H^{2}} J ∥ < ε$ for some small ε ( $∥ \int_{0}^{T} u \partial_{x} η^{*} d t ∥ \approx 0$ ).

10: Set $β^{(b)} (x) : = β^{(n)} (x) .$

Before we present the results of the data assimilation scheme, we illustrate the efficacy of the filtering in . We compare the gradient in L² with the gradient in H¹ and H² obtained after smoothing for Case III. The purpose of this comparison is to illustrate that requiring $\nabla J \in H^{2}$ is more effective at reducing noise than H¹ smoothing, where the H¹ inner product is equivalent to Equation(5.1)(5.1) $\begin{matrix} {〈 v_{1}, v_{2} 〉}_{H^{2} (Ω)} = {〈 v_{1}, v_{2} 〉}_{L^{2} (Ω)} + l_{1}^{2} {〈 \frac{\partial v_{1}}{\partial s}, \frac{\partial v_{2}}{\partial s} 〉}_{L^{2} (Ω)} + l_{2}^{4} {〈 \frac{\partial^{2} v_{1}}{\partial s^{2}}, \frac{\partial^{2} v_{2}}{\partial s^{2}} 〉}_{L^{2} (Ω)} \\ = \int_{s = - L}^{L} [v_{1} v_{2} + l_{1}^{2} \frac{\partial v_{1}}{\partial s} \frac{\partial v_{2}}{\partial s} + l_{2}^{4} \frac{\partial^{2} v_{1}}{\partial s^{2}} \frac{\partial^{2} v_{2}}{\partial s^{2}}] d s, \end{matrix}$ (5.1) with $l_{2} = 0 .$ In both cases, we try to choose optimal values of l₁ (for H¹) and l₂ (for H²) to filter out higher frequencies that contribute to the noise in the bathymetry reconstruction, without also getting rid of necessary information for observing the bathymetry propagated by the lower wavenumbers. We observe in that the H¹ gradient filters out the noise, but also reduces the amplitude of the signal peaks, whereas the H² gradient filters the noise and is closer to the original signal shape. We therefore conclude that H² Sobolev smoothing is optimal.

Fig. 3. The gradient of the cost function $J (β),$ obtained after one iteration for Case III, for H¹ and H² Sobolev smoothing compared to the (unsmoothed) L² gradient.

Fig. 3. The gradient of the cost function J(β), obtained after one iteration for Case III, for H1 and H2 Sobolev smoothing compared to the (unsmoothed) L2 gradient.

The results of Algorithm 2 are given in . The plots of c show the results of the kappa test, the reduction in the cost function, and the relative error in the reconstruction for all cases, respectively. The first thing we note is that in each case, the error is lower for the results with H² smoothing compared to results in . Especially with Cases I and II, we observe the error decreases by at least an order of magnitude. The reconstructed bathymetry shown in (d) and (e) illustrates how the noise has been greatly reduced, and for Case I is almost negligible. For Case II we see some noise remaining on the plateau of the sandbar. However, this is nevertheless a drastic improvement compared with the unfiltered result. This increase is reflected in the kappa test results given in for Case I and II, although the increased convergence does not scale proportionally with the error, as observed previously in .

Fig. 4. Results for assimilation scheme with Sobolev H² smoothing applied to $\nabla^{L^{2}} J .$ (a) Convergence of the kappa test. (b) Convergence of the cost function. (c) Relative L² error $∥ β^{(t)} - β^{(n)} ∥_{L^{2}}^{2}$ / $∥ β^{(t)} ∥_{L^{2}}^{2}$ between the exact and reconstructed bathymetry at each iteration. (d–f) Reconstructed bathymetry with H² smoothing and the exact bathymetry for cases I, II and III, respectively. Convergence is improved compared to results without smoothing given in .

Fig. 4. Results for assimilation scheme with Sobolev H2 smoothing applied to ∇L2J. (a) Convergence of the kappa test. (b) Convergence of the cost function. (c) Relative L2 error ∥β(t)−β(n)∥L22/∥β(t)∥L22 between the exact and reconstructed bathymetry at each iteration. (d–f) Reconstructed bathymetry with H2 smoothing and the exact bathymetry for cases I, II and III, respectively. Convergence is improved compared to results without smoothing given in Fig. 2.

However, for Case III there is no increase in convergence for the kappa test in , and while the reconstruction of $β (x)$ has less small-scale noise, we observe that amplitude of the Gaussian in is significantly smaller than the exact bathymetry. These results did not improve even with a more restrictive choice of l₂, leading us to conclude that additional factors, such as the system parameters and placement of the sensors $y^{(o)} (t),$ may affect the observability of the non-localised bathymetry. In Sec. 6 we attempt to analyse some of these effects.

6. Necessary conditions on the model parameters and the observation operator

In the previous section we showed that using smoother H² gradients is necessary to avoid small scale errors in the reconstructed bathymetry. We now conduct a qualitative analysis to understand the effect of model parameters and the observation operator on the convergence of the data assimilation. The qualitative results observed here motivate the more rigorous analyses in Khan and Kevlahan (Citation2020) and Sec. 7, using second order analysis (SOA) and GSA, respectively. All subsequent results are for $J (β) \in H^{2} .$

6.1. Necessary conditions on parameters

For the purposes of this study, we restrict our parameter analysis to understanding the relationship between the amplitude of the initial conditions $\hat{η},$ the amplitude of the true bathymetry $\hat{β},$ and the average depth H, which we have normalised to 1. The number of observation points N_obs = 45, and they are positioned as in with $y_{1}^{(o)} = 0.1 L .$ As this research is focused on improvement in tsunami prediction, our objective is to understand how surface waves propagate over bathymetry, and these factors play an important part when considering the scales involved. Tsunamis are characterised by their long wavelengths, often reaching 100–150 km in the deep oceans, and their relatively small amplitude of 0.1–1 m, making them often imperceptible. When approaching coastlines the amplitude of the surface wave can be $20 - 50$ m whereas the wavelength may still be up to 2 km. As the energy flux of the wave speed is dependent on depth-dependent wave speed $c = \sqrt{g (H + η (x, t) - β (x))},$ the amplitude and smoothness of the bathymetry can have a big impact on tsunami propagation (Damen et al., Citation2005). Additionally, in deeper water when h is larger the effects of bathymetry are smaller. Consequently we also investigate the effectiveness of free surface observations in reconstructing bathymetries with amplitudes much smaller than the average depth H.

We therefore give special consideration to Case I, Gaussian initial conditions with Gaussian bathymetry, as shown in . We wish to highlight how convergence is affected when (i) the ratio $\hat{η} / \hat{β}$ is increased, and (ii) the normalised bathymetry amplitude $\hat{β} / H$ varies. As H = 1, for the latter we may consider values of $\hat{β} .$

We investigated convergence of the kappa test and relative error in the reconstructed bathymetry when $\hat{η} / \hat{β}$ is $O (10^{- 1})$ (‘large’), $O (10^{- 2})$ (‘small’), and $O (10^{- 3})$ (‘very small’), respectively. The experiments are conducted for $\hat{β} = O (10^{- 1})$ and for $\hat{β} = O (10^{- 3}) .$ The results of the six configurations are presented in . For each case we provide the error in the kappa test $| κ (ε) - 1 |$ and the relative L² error in the reconstruction. We set a tolerance for the kappa test, such that all values of $| κ (ε) - 1 | \geq 10^{- 2}$ are non-convergent. The entries highlighted in red indicate cases where the optimisation algorithm became unstable and the gradient $\nabla J$ failed to converge to zero or became unbounded.

Table 3. Analysis of six experiments for Case I where $\hat{η} / \hat{β}$ and $\hat{β}$ are varying orders of magnitude.

Display Table

We observe that there are two cases where convergence was achieved (the only cells not highlighted in red): (i) when $\hat{η} / \hat{β} = O (10^{- 1})$ (‘large’), $\hat{β} = O (10^{- 3})$ (‘very small’), and (ii) when $\hat{η} / \hat{β} = O (10^{- 3})$ (‘very small’), $\hat{β} = O (10^{- 1})$ (‘large’). We note that for accurate and stable results we require either $\hat{η} / \hat{β}$ to be small or $\hat{β}$ to be very small ( $O (10^{- 3})$ ). However, when both are very small ( $O (10^{- 3})$ ) the results are non-convergent. We also note that convergence fails when both are large ( $O (10^{- 1})$ ). Additionally, we see that when $\hat{η} / \hat{β} = O (10^{- 2}), \hat{β} = O (10^{- 1})$ the bathymetry reconstruction error is relatively low, despite the kappa test error being higher than the set tolerance, suggesting that the latter may be relaxed to permit $| κ (ε) - 1 | = O (10^{- 2}) .$

Consequently, the only admissible cases where the reconstruction error converges are when (i) the surface waves are small in amplitude compared to the amplitude of the bathymetry, $\hat{η} / \hat{β} \leq O (10^{- 2}),$ or (ii) when the bathymetry amplitude $\hat{β}$ is small enough that the ratio between $\hat{η}$ and H is $O (10^{- 3})$ or smaller as a consequence.

Having explored the relationship between $\hat{η} / \hat{β}$ and convergence, we now consider the effects of the amplitude of the bathymetry $\hat{β}$ when $\hat{η}$ is fixed. We analyse convergence of Algorithm 2 for relative bathymetry amplitudes $\hat{β} / H$ ranging from 1% of the average depth to $30 % .$ These results are summarised in , and we have included results for Cases II and III for more insight. In all cases we fix $\hat{η}$ to be 0.001 and N_obs = 45. We see that for Case I, indicates that the error is highest when $\hat{β}$ is small, however it shows a steady decrease even when $\hat{β}$ is 30% of H. Case III shows a similar trend, even though the error is two orders of magnitude larger than for Case I. For Case II, the error remains stable at approximately $O (10^{- 2})$ for all values of $\hat{β} .$

Fig. 5. The relative error in the bathymetry reconstruction $∥ β^{(t)} - β^{(b)} ∥_{L^{2} (Ω)}$ / $∥ β^{(t)} ∥_{L^{2} (Ω)}$ (where $Ω = [- L, L]$ ), shown for different amplitudes $\hat{β} / H,$ with amplitude of initial conditions $\hat{η} = 0.01 %$ of H. Note that Case III has barely converged for any value of bathymetry amplitude.

Fig. 5. The relative error in the bathymetry reconstruction ∥β(t)−β(b)∥L2(Ω)/∥β(t)∥L2(Ω) (where Ω=[−L,L]), shown for different amplitudes β̂/H, with amplitude of initial conditions η̂=0.01% of H. Note that Case III has barely converged for any value of bathymetry amplitude.

As bathymetry effects on the surface wave decrease when the fluid depth h is large, it is reasonable that higher amplitudes of bathymetry lead to a slight decrease in the reconstruction error, as observability of the bathymetry by the surface wave may improve. However, to consider the effects of higher amplitudes of the bathymetry in tsunami models, it is reasonable to analyse the error in the surface wave error given the reconstructed bathymetry as $\hat{β}$ increases. Results for this analysis are given in Sec. 6.3.

Based on the results shown in , we suggest that for situations like Case I (i.e. a localised surface wave propagating over a compact bathymetry feature of a similar size) a necessary condition for the data assimilation scheme to recover the true bathymetry is that $\hat{η} / \hat{β}$ be at most $O (10^{- 2})$ when bathymetry amplitudes are large ( $O (10^{- 1})$ ). We observed in that increasing the bathymetry amplitude (when the initial condition amplitude is sufficiently small) does not affect the error in the bathymetry, however further analysis is needed on the resulting error in surface wave, given the reconstructed bathymetry, in order to quantify the effects of bathymetry in tsunami propagation.

6.2. Observation operator

The observation operator (i.e. the number and location of observations of sea surface height) used in our optimisation scheme is significant, as we have direct control over the number and location of sea surface sensors in real world simulations. We are therefore interested in configuring observation points to optimise convergence to the exact bathymetry.

We consider the effect of the number of observation points on convergence. In we present the results of Algorithm 2 for $N_{obs} = 5, 10, 20, 25$ observation points. Each configuration is a set of equidistant points, with the first point placed at $x = L / 10 .$ For Cases I and II the first point is within the support of the Gaussian initial condition. We show the convergence of both the cost function and the relative L² error. In all three test cases, the general conclusion is that more observation points produces better convergence. For Cases I and II the error decreases by approximately two orders of magnitude as N_obs is increased from 5 to 45. However, none of the results for Case III (periodic surface wave over Gaussian bathymetry) converge, and the error remains greater than 10% for all values of N_obs.

Fig. 6. Relative cost function and relative L² error for different numbers of observation points.

Fig. 6. Relative cost function and relative L2 error for different numbers of observation points.

To better understand the effects of N_obs, in we present the reconstructed bathymetry for each case, with N_obs = 5 and N_obs = 45, respectively. We observe in that for Case I (Gaussian surface wave over Gaussian bathymetry) even with N_obs = 5, the bathymetry is well-resolved, and increasing the number of observations successfully eliminates some small-scale noise at the base of the Gaussian.

Fig. 7. Reconstructed bathymetry for Cases I, II and III with N_obs = 5 and N_obs = 45, respectively.

In it is clear that for Case II (Gaussian surface wave over a sandbar) N_obs = 5 is not enough to resolve the bathymetry shape, especially for the bathymetry features in the region x < 0. However, increasing N_obs to 45 leads to accurately reconstructed bathymetry. The only significant error is a small amount of noise at the plateau of the sandbar profile near x = 0. For Case III, we note that while neither of the reconstructed bathymetries in converge to the true bathymetry in , convergence is much better qualitatively for N_obs = 45. However, the peak of the Gaussian is not fully resolved and the amplitude of the reconstruction with N_obs = 45 is smaller than the amplitude of the true bathymetry.

Based on these results, we conclude that a sufficiently large number of observations N_obs is a necessary condition for optimal convergence. However, as indicated by the reconstructions in , it is not a sufficient condition for convergence in all configurations. Despite this, results in demonstrate that observations of a localised surface wave are able to reconstruct both a Gaussian bathymetry and a Sandbar bathymetry with relatively small error in convergence. The following question remains: how does the size of the reconstruction error of the bathymetry affect prediction of the free surface wave?

In the context of tsunami modelling, our priority is accurate prediction of the free surface wave given the reconstructed bathymetry. Therefore, motivated by results observed thus far, in Sec. 6.3 we analyse the L² error in the surface wave given the reconstructed bathymetry, as N_obs is increased. Additionally, a rigorous sensitivity analysis of the surface wave error to observations is presented in Khan and Kevlahan (Citation2020) using second order variational techniques, and in Sec. 7 using GSA. The objective of this analysis is to explore further, not only the effect of the number of observation, but also the position of the observations in the spatial domain.

6.3. Sensitivity of propagating surface wave to to bathymetry reconstruction error

Our final analysis concerns the sensitivity of the surface wave to the bathymetry, in particular, to errors in the bathymetry data. Given some optimal reconstruction $β^{(b)},$ we wish to gauge the sensitivity of the propagating surface wave $η (x, t)$ to the errors in the reconstruction. Our objective is to address the following question: how does the reconstruction error in the bathymetry impact the propagating surface wave? If our goal is to use reconstructions of the bathymetry to generate more accurate predictions of tsunami waves, then the main consideration is not the error between the optimal bathymetry reconstruction and the true bathymetry, but the resulting error in $η (x, t) .$ In other words, we only need the reconstructed bathymetry to be ‘good enough’ to produce sufficiently accurate surface waves.

Thus far we have qualitatively assessed the convergence to the exact bathymetry as we varied the observation operator, and the amplitudes of the initial conditions and bathymetry $\hat{η}$ and $\hat{β} .$ We now address the corresponding error in the surface wave given the reconstructed bathymetry $η (x, t; β^{(o)})$ as a function of $\hat{η}, \hat{β},$ and N_obs.

The results for each are summarised in , respectively. The L² error in the bathymetry is plotted alongside the resulting L² error (in space and time) in the surface wave. shows the respective errors as a function of bathymetry amplitude, where $\hat{η} = 0.001 .$ In each of the three cases, we observe the error in the surface wave for $\hat{β} ≲ 0.1$ is almost 2 orders of magnitude lower than the error in the bathymetry estimation. As the relative amplitude of the bathymetry $\hat{β}$ increases, so does the surface wave error, even though the bathymetry error remains relatively constant. This is intuitive, as we observe that since the effect of bathymetry decreases as the fluid depth increases (a consequence of smaller amplitudes $\hat{β}$ ), the height $η (x, t)$ is affected more by larger bathymetry amplitudes. As such, errors in the reconstruction of larger bathymetry perturbations are more likely to be observed in the resulting surface wave. Therefore, we consider $\hat{β} \leq 0.1$ a necessary condition for optimal convergence of the surface wave. Overall, the results suggest there is low sensitivity of the surface wave to bathymetry reconstruction errors as $\hat{β}$ varies.

Fig. 8. The relative L² error in the bathymetry reconstruction, shown for different amplitudes $\hat{β},$ and the corresponding relative L² error in the propagating surface wave $η (x, t) .$ The amplitude of the initial conditions $\hat{η}$ is 0.001, and N_obs = 45.

Fig. 8. The relative L2 error in the bathymetry reconstruction, shown for different amplitudes β̂, and the corresponding relative L2 error in the propagating surface wave η(x,t). The amplitude of the initial conditions η̂ is 0.001, and Nobs = 45.

Fig. 9. The relative L² error in the bathymetry reconstruction, shown for different values of N_obs, and the corresponding relative L² error in the propagating surface wave $η (x, t) .$ The amplitude of bathymetry $\hat{β}$ is 0.1. The amplitude of the initial conditions $\hat{η}$ is fixed to be 1% of $\hat{β} .$

Fig. 9. The relative L2 error in the bathymetry reconstruction, shown for different values of Nobs, and the corresponding relative L2 error in the propagating surface wave η(x,t). The amplitude of bathymetry β̂ is 0.1. The amplitude of the initial conditions η̂ is fixed to be 1% of β̂.

Fig. 10. The relative L² error in the bathymetry reconstruction and the corresponding relative L² error in the propagating surface wave $η (x, t),$ as a function of the initial conditions amplitude $\hat{η} .$ The amplitude of the bathymetry is fixed to be 0.2.

Fig. 10. The relative L2 error in the bathymetry reconstruction and the corresponding relative L2 error in the propagating surface wave η(x,t), as a function of the initial conditions amplitude η̂. The amplitude of the bathymetry is fixed to be 0.2.

presents the L² error in the bathymetry alongside the resulting L² error in the surface wave, as a function of the number of observation points. We consider $N_{obs} = 5, 10, 20, 45,$ and plot the resulting errors for each of the three cases. $\hat{β}$ is fixed at 0.1 and $\hat{η} = 0.01 \hat{β} .$ We observe that for Cases I and III, the error in the surface wave is almost two orders of magnitude smaller than the bathymetry error, and both decrease proportionally as the number of observation points increases. For Case II, the difference in the errors is $O (10^{- 1})$ for smaller values of N_obs, however this difference increases as we increase the number of the observation points, suggesting the sensitivity of the surface wave decreases with more observation points for a sandbar profile. A reason for this may be that for Case II there are no observation points located before the support of the bathymetry, and consequently a smaller value of N_obs may not be sufficient to create a an accurate profile, resulting in increased error in the reconstruction (as observed in ). Consequently, the error in the surface wave increases as well. However, as more observation points are added in the noise in the bathymetry is smoothed, and the smoother profile may explain the relative larger decrease in the surface wave error, up to two orders of magnitude smaller than the bathymetry error.

Finally, shows the bathymetry reconstruction and surface wave errors as a function of the initial conditions amplitude $\hat{η} .$ We have already observed a correlation in the amplitudes $\hat{η}$ and $\hat{β},$ and we concluded in Sec. 6.1 for optimal convergence we require $\hat{η} / \hat{β} = O (10^{- 2}) .$ Therefore, in we present results for small values of $\hat{η},$ and $\hat{β} = 0.2 .$ For each case we observe that the error in the surface wave is orders of magnitude lower than the bathymetry error for most values of $\hat{η} .$ Both errors are relatively high for smaller values of $\hat{η},$ but as $\hat{η} \to 10^{- 3},$ they both decrease. This suggests that the free surface wave is more sensitive to bathymetry reconstruction error when $\hat{η}$ is very small.

To summarise, we observed that across variations in (i) the number of observation points, and (ii) model parameters such as $\hat{η}$ and $\hat{η},$ the error in the surface wave was orders of magnitude lower than the bathymetry reconstruction error, suggesting low sensitivity of the surface wave to bathymetry reconstruction error. The consequences of this low sensitivity are significant. It is essential to understand the sensitivity of the surface wave to the bathymetry for two distinct but related reasons. First, the assimilation described in this study is contingent upon the assumption that the surface wave (and observations) are affected by the bathymetry. We have demonstrated this is true in . Secondly, the variation in the surface wave error in shows that changes in bathymetry amplitude and shape do significantly affect surface wave propagation. However, we observe that the sensitivity of the surface wave to the bathymetry reconstruction error is low. This means that the surface wave is not sensitive to the details of the bathymetry: a roughly accurate reconstruction of bathymetry should be sufficient in most cases.

We conclude that, although bathymetry does significantly modify surface wave propagation, for tsunami modelling we only need the bathymetry to be accurate enough that the surface wave is modelled correctly, and that we can derive criteria for when the bathymetry reconstruction is reconstructed sufficiently well. Additionally, we also do not require the optimal reconstruction $β^{(b)}$ to be unique. These implications motivate a more rigorous analysis of surface wave sensitivity, and we explore this further in Sec. 7.

7. Global sensitivity analysis (GSA)

Based on the observed qualitative results in Sec. 6.3, in this section, we use GSA to more rigorously quantify the influence of model parameters on the error in the bathymetry reconstruction, and the resulting error in the surface wave, respectively. We focus our analysis on a localised surface wave propagating over a compact bathymetry, as in Case I. Using GSA we aim to quantify, and subsequently rank the influence of these parameters on the bathymetry and surface wave errors, respectively.

7.1. Sensitivity analysis methods, derivation, and sampling considerations

Saltelli et al. (Citation2010) define GSA as the set of mathematical techniques used to assess the propagation of uncertainty in a numerical model. In practice, a set of synthetic indices are derived that quantify the relative contribution to the output variance from different input parameters. These are known as ‘sensitivity’ indices. Liu and Homma (Citation2009) propose that good sensitivity indices should exhibit the following properties:

They are global, and consider the influence of model inputs on the entire output range.
They are quantifiable, and thus can be computed and reproduced numerically.
They are not conditional on any assumed input values.
The value of the sensitivity index for some input gives an easy interpretation of the sensitivity of the model to the parameter.
They are consistent across multiple samples and simulations.
Ideally, they are moment-independent, and do not rely on a specific quantitative measure of the output distribution.

GSA methods can be categorised as one-at-a-time methods (OAT), where output variations are induced by varying one input at a time, or all-at-a-time methods (AAT), where all input factors are varied simultaneously. GSA methods are widely used in earth systems modelling (ESM). An example is the work conducted by the Modelling, Observations, Identification for Environmental Sciences (MOISE, Citation2014) project on variance-based sensitivity analysis on a marine ecosystem model of the Ligurian sea.

A concise review of the contributions of GSA on advancement of ESM is provided in Wagener and Pianosi (Citation2019). They highlight the surge in computational capacity for Earth and climate models, and address the resulting problem of increased interaction between model components and parameters, even when representing a relatively low number of physical processes. This issue is particularly problematic in ESM where incomplete knowledge and the lack of relevant exact solutions makes model validation very difficult. In addition, such models are forced with noisy and inaccurate observations. Typical uses of GSA include:

Ranking the influence of the parameters on model output from highest to lowest.
Creating a threshold criteria for sensitivity allowing us to determine parameters of negligible influence (screening).
Finding thresholds in the input parameter values that map into specific output regions (factor mapping).

Thus, the primary objective of this study is to use GSA to analyse the data assimilation results bathymetry detection presented in this paper, in order to rank, screen and factor map the influence of its parameters.

Our objective is to measure the relative L² reconstruction error in the bathymetry (7.1) $Y_{β} = \frac{‖ β^{(t)} - β^{(b)} ‖_{L^{2} (Ω_{X})}}{‖ β^{(t)} ‖_{L^{2} (Ω_{X})}},$ (7.1) where $β^{(t)}$ is the ‘true’ bathymetry and $β^{(b)}$ is the ‘best’ reconstruction obtained via the data assimilation scheme. Additionally, we define the relative L² error in the propagating surface wave $η (x, t)$ given the reconstructed bathymetry $β^{(b)}$ as (7.2) $Y_{η} = \frac{‖ η (β^{(t)}) - η (β^{(b)}) ‖_{L^{2} (Ω_{X \times T})}}{‖ η (β^{(t)}) ‖_{L^{2} (Ω_{X \times T})}} .$ (7.2)

In Sec. 6.3 we observed that the surface wave error was orders of magnitude lower than the bathymetry error (7.1), suggesting low sensitivity to the bathymetry error, as a function of the bathymetry amplitude $\hat{β} .$ Based on this initial observation, we wish to derive a more rigorous way to measure sensitivity of results to parameters $\hat{η}$ (amplitude of the surface wave), $\hat{β}$ and ψ (amplitude and position of the bathymetry, respectively) such that (7.3) $η^{(t)} (x, 0) = \hat{η} e^{- {(10 x)}^{2}}$ (7.3) (7.4) $β^{(t)} (x) = \hat{β} e^{- {(10 (x - ψ))}^{2}} .$ (7.4)

We focus on Case I, where we have compactly supported Gaussian initial conditions centred at zero, and a localised Gaussian bathymetry centred at $x = ψ .$

The key property of the parameter ψ is that it determines the location of the bathymetry relative to the observation points. When ψ is small, a smaller proportion of observation points observes the surface wave before it interacts with the bathymetry, and for ψ approximately less than 1, the observations do not span the entire bathymetry support. We wish to discover whether the influence of shifting the bathymetry position ψ relative to the fixed observation points is significant.

7.2. Density based sensitivity analysis (DBSA)

Since skewness in the output distribution is common in non-linear and environmental models(Pianosi and Wagener, Citation2015), we use a subset of sensitivity analyses that does not use moments of the model output, such as the expected value, variance or skewness, as a measure of uncertainty. Such methods are collectively known as density based sensitivity analysis’(DBSA). The idea is to measure sensitivity through variations of the probability density function that occur when the influence of a certain input factor is removed. This is done by computing the difference between unconditional PDFs generated by varying all parameters, and conditional PDFs obtained when fixing individual input parameters at a particular nominal value (Pianosi et al., Citation2016). The sensitivity index is then computed as a statistic based on this divergence.

For the present analysis, we follow the method outlined in Pianosi and Wagener (Citation2015), who improve on this idea by characterising output distributions by their cumulative density functions (CDFs) instead. Known as the PAWN method, density-based sensitivity indices are calculated using conditional and unconditional CDFs, using the fact that these are more efficiently derived than PDFs. We give a brief overview of the PAWN method, the divergence statistic used to compute the indices, and the sampling strategy used in the numerical implementation.

Let us define the unconditional cumulative density distribution of our output Y (where Y can be either $Y_{β}$ or $Y_{η}$ ) as $F_{Y} (Y),$ and the conditional cumulative density distribution when the input parameter x_i is fixed as $F_{Y | x_{i}} (Y) .$ As $F_{Y | x_{i}} (Y)$ represents the case where there is no variability resulting from x_i, the distance between the two functions $F_{Y | x_{i}} (Y)$ and $F_{Y} (Y)$ represents the variability in the output induced by x_i. This distance is proportional to the influence of x_i, i.e. if $F_{Y | x_{i}} (Y)$ = $F_{Y} (Y)$ then x_i has zero influence on the variability of output Y. In the PAWN method, this distance is computed using the Kolmogorov–Smirnov (KS) statistic (7.5) $KS (x_{i}) = \max_{Y} | F_{Y | x_{i}} (Y) - F_{Y} (Y) | .$ (7.5)

As KS depends on the value of x_i, the PAWN sensitivity index T_i is calculated using a statistic (such as the median or maximum) over all values of x_i, (7.6) $T_{i} = \underset{x_{i}}{Stat} [KS (x_{i})] .$ (7.6)

Pianosi and Wagener (Citation2015) observe that the sensitivity index T_i satisfies all of the properties of a ‘good’ sensitivity index, as it is global, quantitative and model-independent. It has the advantage that it is also moment-independent. By definition it is normalised to take values between 0 and 1.

In the numerical implementation, empirical CDFs are used to compute $F_{Y | x_{i}} (Y)$ and $F_{Y} (Y),$ using a sample of the parameter space. PAWN has a big advantage over the sample size requirements in variance-based sensitivity analysis (VBSA), as a smaller sample size can be used to effectively approximate the CDFs due to their regularity properties. $F_{Y} (Y)$ is approximated using N_u model evaluations obtained by sampling over the entire parameter space, while $F_{Y | x_{i}} (Y)$ is approximated using N_c model evaluations derived by sampling over the non-fixed parameters only, while keeping x_i fixed. Additionally, the conditioning value x_i used to compute T_i in (7.6) is replaced by $x_{i} = {\bar{x_{i}}}^{(1)}, {\bar{x_{i}}}^{(2)}, . ., {\bar{x_{i}}}^{(n)},$ n randomly-sampled values for the fixed input x_i. Thus, the total number of model evaluations is $N_{u} + n \times d \times N_{c} .$ Generally these values are chosen by trial and error. Pianosi and Wagener (Citation2015) suggest a reasonable choice for n is between 10 and 50.

For our analysis, we set N_u = 200, N_c = 150, and n = 15, giving total of 6950 model evaluations. The computation of the density-based indices is then a quick post-processing step. To verify whether the sample size is adequate, bootstrapping can be used to analyse robustness of sensitivity index estimates, as in the previous section. We present the results for the DBSA sensitivity indices approximated using the PAWN method in the following section.

7.3. Results of DBSA

The number of evaluations used in the analysis was adjusted to exclude sample points where the data assimilation did not converge (i.e. where the necessary ratio between the amplitudes of the initial conditions and the bathymetry did not hold). N_c was adjusted to 145, and 6725 evaluations were used to approximate T_i. For the initial analysis we chose the maximum as the statistic used in 7.6.

presents the initial results for the sensitivity indices T_i for each input parameter. gives the result for model output $Y_{β}$ and for $Y_{η} .$ $Y_{β}$ has the greatest sensitivity to position of the bathymetry ψ ( $86 %$ ). Interestingly, we can see in that the amplitude of the initial conditions $\hat{η}$ and position of the bathymetry ψ have relatively low influence on the error in the surface wave $Y_{η}$ compared to β, at 13% and 22%, respectively. On the other hand, the amplitude of the bathymetry $\hat{β}$ is highly influential for the surface wave error $Y_{η},$ with $T_{\hat{β}} = 0.895 .$

Fig. 11. Approximation of the density-based sensitivity indices for $\hat{β}$ (red), $\hat{η}$ (blue) and ψ (green). (a) Influence of each parameter on $Y_{β} .$ (b) Influence of each parameter on $Y_{η} .$

Fig. 11. Approximation of the density-based sensitivity indices for β̂ (red), η̂ (blue) and ψ (green). (a) Influence of each parameter on Yβ. (b) Influence of each parameter on Yη.

We need to check these results for robustness and convergence before we can accept them as conclusive. Bootstrapping is done over 700 re-samples, and confidence intervals are derived. These can be seen in . We present the distances between upper and lower bounds for each parameter in , and subsequently compute ${Stat}_{c f}$ for each.

Fig. 12. Approximation of the density-based sensitivity indices for $\hat{β} (red), \hat{η}$ (blue) and ψ (green), with confidence intervals derived using 700 re-samples. (a) Influence of each parameter on $Y_{β} .$ (b) Influence of each parameter on $Y_{η} .$

Fig. 12. Approximation of the density-based sensitivity indices for β̂(red), η̂ (blue) and ψ (green), with confidence intervals derived using 700 re-samples. (a) Influence of each parameter on Yβ. (b) Influence of each parameter on Yη.

Table 4. Width of the confidence interval $| T_{i}^{u b} - T_{i}^{l b} |,$ and mean index $T_{i}^{m}$ averaged over 700 bootstrap re-samples.

Display Table

We observe a great improvement in the ${Stat}_{c f}$ statistics relative to the VBSA results, where ${Stat}_{c f} (Y_{β}) = 0.1081$ and ${Stat}_{c f} (Y_{η}) = 0.0850 .$ The largest margin of error from the mean $T_{i}^{m}$ across all input factors and both outputs is $6.3 % .$ If we set a tolerance level of 0.05 for the error margin, then there is only a single instance ( $| T_{\hat{β}}^{u b} - T_{\hat{β}}^{m} | = 0.0630$ ) where the error margin is higher. We hypothesise that this could be further improved by taking a slightly larger sample size.

The convergence of the sensitivity indices is given in . Across both errors $Y_{β}$ and $Y_{η},$ the most influential parameters (position ψ and amplitude $\hat{β}$ of the bathymetry) appear to converge. For the other less influential parameters in each case, convergence is still relatively smooth. However, we see that the index values are still decreasing slightly even at the largest sample, suggesting that a larger number of samples might be needed for convergence to a fixed point. Nevertheless, given the results of the robustness analysis and convergence observed thus far, we conclude that we may still accurately gauge the influence of the parameters on our model outputs with the current sample size, especially in order to determine the most influential factors. Consequently, we proceed to consider ranking, screening, and factor mapping of the input parameters.

Fig. 13. Convergence analysis for DBSA indices for (a) $Y_{β},$ and (b) $Y_{η},$ for re-samples of size N = 215 to N = 2375.

The PAWN method can be used effectively to screen for non-influential input factors. presents an overview of the different values of KS over the 15 randomly chosen sample points x_i for each input parameter. To determine which input parameters are most influential, the two-sample KS test (Pianosi and Wagener, Citation2015) has been implemented, which allows us to reject the hypothesis that $F_{Y} (Y)$ and $F_{Y | x_{i}} (Y)$ are the same if (7.7) $K S > c (α) \sqrt{\frac{N_{c} + N_{u}}{N_{c} N_{u}}},$ (7.7) where α is the confidence level, and $c (α)$ is a critical value determined in the literature (Pianosi and Wagener, Citation2015). presents results for $Y_{β}$ and for $Y_{η} .$ The red dotted line represents the threshold value at the confidence level $α = 0.05 .$ KS statistics for input x_i that fall below this critical threshold indicate that x_i is non-influential.

Fig. 14. KS statistic with significance level 0.05. Values below the dotted red line are non-influential.

We observe that for both outputs $Y_{β}$ and $Y_{η}, \hat{η}$ is non-influential across all 15 choices for the fixed value. KS statistics for $\hat{β}$ fall below the threshold at fixed values approximately between 0.1 and 0.25 for the model $Y_{β} .$ This corroborates results from Sec. 2, where we observed that the bathymetry reconstruction error $Y_{β}$ was higher when the amplitude of the bathymetry $\hat{β}$ was either too big or too small relative to the average depth H. In the present analysis the maximum KS statistic is used to compute T_i for each input parameter, and so $\hat{β}$ is still considered influential on $Y_{β} .$ It is interesting to contrast this result with the KS statistics for $\hat{β}$ and output $Y_{η},$ as shown in . While we see a similar trend, where the KS statistic is higher if the bathymetry amplitude $\hat{β}$ is at the lower or upper end of the sample interval, the resulting variation is much higher, and all KS statistics are above the influence threshold value.

In a similar manner, while the position of the bathymetry ψ has relatively low KS statistics across all fixed values of the surface wave error $Y_{η}$ (approximately half of the values are above the cutoff with no clear trend), it has relatively much higher KS values, and subsequently higher influence on the bathymetry reconstruction error $Y_{β},$ as observed in the third panel in . Once again, KS statistics are much higher at values closer to the endpoints of the parameter space for ψ, $[0.5 2] .$ We suggest this indicates that the placement of the bathymetry relative to the observation points has a significant influence on the error in the bathymetry reconstruction. This may be because as the bathymetry Gaussian placement varies over the interval, so does the time and position where the bathymetry is first observed by the measurement points.

Thus, based on the threshold level for a confidence interval $α = 0.05,$ and using the maximum KS statistic to compute T_i, we observe that for both bathymetry reconstruction error $Y_{β}$ and surface wave error $Y_{η},$ the amplitude of the initial conditions $\hat{η}$ can be considered non-influential. We note that the choice of a different statistic in (7.6), such as the mean or median, may change the value of T_i. We provide a comparison of the different indices in . Given the threshold value of 0.147 ( $c (α = 0.05)$ ), we can see that by the choice of a mean or median statistic, $\hat{β}$ does not affect whether or not a parameter is non-influential for the bathymetry reconstruction error $Y_{β} .$

Table 5. Model output $Y_{β} :$ indices T_i using different statistics in the definition of the sensitivity index (7.6).

Display Table

As we can see from , the overall ranking of the parameters is not affected by the choice of statistic used for $Y_{β} .$ The only exception is the parameter $\hat{β},$ which is influential only with the max statistic. The position of the bathymetry, ψ, is the most influential parameter for $Y_{β},$ and the amplitude of the initial conditions $\hat{η}$ is the least influential. Similarly, for model output $Y_{η},$ results in indicate that across all choice of statistics, $\hat{β}$ is the most influential parameter and $\hat{η}$ is non-influential.

Table 6. Model output $Y_{η} :$ indices T_i using different statistics in the definition of the sensitivity index (7.6).

Display Table

So far we have used the results of the PAWN sensitivity analysis to rank and screen our input factors. A third analysis that can prove insightful is factor mapping: we can compute sensitivity indices based on a sub-interval of our output Y given the current sample. For example, over all model evaluations, the maximum surface wave error $Y_{η}$ is $2.63 \times 10^{- 2},$ and the minimum is $2.23 \times 10^{- 4} .$ Similarly the maximum bathymetry reconstruction error $Y_{β}$ is 0.265 and the minimum is $3.75 \times 10^{- 2} .$ To determine the influence of input parameters when the values of $Y_{η}$ and $Y_{β}$ are relatively high, we define a parameter M such that we can compute sensitivity indexes for all input-output samples for the cases Y > M, and $Y \leq M$ separately. We choose M = 0.015 for output $Y_{β}$ and M = 0.001 for $Y_{η} .$ The results are given in .

Fig. 15. Indices T_i for influence of $\hat{β}$ (red), $\hat{η}$ (blue) and ψ (green) on model outputs Y > M (left panel), and $Y \leq M$ (right panel) for (a) $Y_{β},$ and (b) $Y_{η} .$ Confidence intervals were calculated using 700 bootstrap samples.

Fig. 15. Indices Ti for influence of β̂ (red), η̂ (blue) and ψ (green) on model outputs Y > M (left panel), and Y≤M (right panel) for (a) Yβ, and (b) Yη. Confidence intervals were calculated using 700 bootstrap samples.

While there is not much difference in terms of ranking between the left (Y > M) and right ( $Y \leq M$ ) panes for either $Y_{β}$ or $Y_{η},$ it is noteworthy that for the surface wave error in the influence of the bathymetry amplitude $\hat{β}$ decreases when $Y_{η} \leq 0.001$ (right panel). This is insightful, as it suggests that the bathymetry is more influential when the errors in the surface wave are large. However, in both intervals of $Y_{η}$ the influence of $\hat{β}$ is high (above 70%). Additionally, the effect of the bathymetry position ψ when $Y_{η}$ greater than 0.001 is 29%, and subsequently is greater than the influence threshold value. On the other hand, the influence of the initial conditions amplitude $\hat{η}$ and bathymetry position ψ drops below the threshold value when $Y_{η} \leq 0.001 .$

For the bathymetry reconstruction error $Y_{β},$ we see in that the influence of the bathymetry position ψ is approximately doubled when the bathymetry reconstruction error is less than 0.015. The lower sensitivity of each of the input parameters when the bathymetry reconstruction error is higher could be suggestive of other significant influences on the model error, that cannot be attributed to these parameters alone.

In summary, for $Y_{η}$ the most influential parameter is $\hat{β},$ the amplitude of the bathymetry, whereas the amplitude of the surface wave initial conditions $\hat{η}$ is categorised as non-influential. For the bathymetry reconstruction error $Y_{β},$ bathymetry position ψ is the most influential parameter, while the initial conditions amplitude $\hat{η}$ was determined to be non-influential. Additionally, $\hat{β}$ is influential on bathymetry reconstruction error only when the error is larger than 0.15, and ψ is only influential on the surface wave error when the error is larger than 0.001.

8. Conclusion and future directions

This study is a first step in understanding better the role of observations and model parameters in variational bathymetry assimilation. We are limited somewhat by the lack of an analytical solution for the shallow water system with non-zero bathymetry, but these computational results provide some key insights to guide further analyses. The 1-D geometry considered in this analysis is intended to be a foundation for extensions to more realistic 2-D cases.

We derived a variational data assimilation algorithm to reconstruct the bathymetry from a set of free surface wave observations by minimising a functional $J (β)$ representing the least squares error between observations and forecast solutions. We observed that $\nabla J \in L^{2} (Ω)$ resulted in small-scale noise in the bathymetry reconstruction and impacted convergence. Consequently, we showed that smoother H² gradients are necessary to avoid small-scale errors in the reconstructed bathymetry. Using the H² gradients, we accurately reconstructed the bathymetry for test cases with a Gaussian initial conditions, and (i) a Gaussian bathymetry, and (ii) a sandbar profile bathymetry.

We then analysed the relationship between the normalised bathymetry and initial conditions amplitudes $\hat{β}$ and $\hat{η}$ to understand how they influence convergence. Based on a qualitative analysis of a localised surface wave propagating over a compact bathymetry feature of a similar size, we suggest that a necessary condition for convergence of the assimilation to the true bathymetry is that the ratio of the surface wave amplitude to the bathymetry amplitude $\hat{η} / \hat{β}$ be at most $O (10^{- 2})$ when bathymetry amplitudes are $O (10^{- 1}) .$ Additionally, we observe that a relatively large number of observations N_obs is necessary for convergence. We found N_obs = 45 was the optimal number of observations for the cases we considered.

Perhaps the most significant conclusion of this study was that the surface wave $η (β^{(b)})$ has relatively low sensitivity to errors in the reconstructed bathymetry. We showed that the free surface error was orders of magnitude smaller than the bathymetry reconstruction error as a function of N_obs, $\hat{β}$ and $\hat{η},$ respectively. Reconstructing the bathymetry with a relative error of about 10% is sufficiently accurate for surface wave modelling in most cases.

If this conclusion can be verified in higher dimensions and in more realistic configurations (e.g. including turbulence and the Coriolis effect), this result will enhance tsunami forecast models by the quantifying exact tolerance levels for the error in the bathymetry necessary for accurate surface wave modelling. Additionally, tolerances for smaller scales in bathymetry reconstruction may be quantified.

To further investigate the sensitivity of the reconstruction error and resulting surface wave error to model parameters, we used a density-based method (DBSA) to empirically compute indices that rely on the variation between unconditional and conditional cumulative density functions when particular inputs are fixed, as a measure of sensitivity. We used the PAWN algorithm Pianosi and Wagener (Citation2015) to derive sensitivity indices for the three input parameters, $\hat{β}, \hat{η}$ and ψ (bathymetry amplitude, surface wave amplitude and position of the bathymetry relative to the observations, respectively), and the model outputs $Y_{β}$ and $Y_{η} .$ Our objective was to rank the the influence of the inputs from highest to lowest, screen for non-influential parameters, and finding thresholds in the input parameter values that map into specific output regions (factor mapping). We summarised the conclusions in .

Table 7. Classification of each input parameter as influential or non-influential for the sub-regions of $Y_{β}$ and $Y_{η}$ in .

Display Table

Our results showed that for the model output $Y_{η}$ (the error in the surface wave given the reconstructed bathymetry), the most influential parameter was $\hat{β},$ the amplitude of the bathymetry, whereas the amplitude of the surface wave initial conditions $\hat{η}$ was categorised as non-influential. This confirms conclusions from Sec. 6.3, where we observed that the surface wave error increased as the amplitude of the relative bathymetry $\hat{β} / H$ became larger. We considered specific output regions for both the bathymetry reconstruction error and the surface wave error. For the latter we consider relative L² errors $Y_{η} > 0.001$ and $Y_{η} \leq 0.001,$ and found that the influence of all three parameters (the bathymetry and initial conditions amplitudes $\hat{β}$ and $\hat{η},$ and the bathymetry position ψ), decreases when $Y_{η} \leq 0.001 .$

The bathymetry position ψ was the most influential parameter on the bathymetry reconstruction error $Y_{β},$ while the initial conditions amplitude $\hat{η}$ was determined to be non-influential. The influence of ψ for small errors $Y_{β} \leq 0.015$ was approximately 85%. However, this dropped by almost half for large errors $Y_{β} > 0.015 .$ Overall, the bathymetry reconstruction error was less sensitive to each of the input parameters when the bathymetry reconstruction error was large. This suggests that other parameters that are not considered in this analysis may have a significant influence for larger values of the bathymetry reconstruction error.

While the locations of the observation points were kept fixed relative to the computational domain, the position, ψ, of the bathymetry relative to the observation locations was varied. The parameter ψ therefore determines the point where the bathymetry is observed by the measurement points, as well as how many observation points observe the surface wave before it interacts with the bathymetry. We found that ψ was influential on the surface wave error for all statistics, except when the error in the surface wave $Y_{η}$ was less than 0.001. As our objective is to determine which parameters are responsible for large errors in the surface wave, we conclude that the placement of the observation points relative to the reconstructed bathymetry is influential on the surface wave accuracy. Further analysis on the best location for the observations relative to the bathymetry is conducted using second order adjoint sensitivity analysis in Part II (Khan and Kevlahan, Citation2020).

A key observation is that for both bathymetry reconstruction error and the surface wave error, the amplitude of the initial conditions $\hat{η}$ was non-influential for each sub-region. However, the lack of sensitivity to $\hat{η}$ can be explained by the fact that all values where the amplitude ratio condition was violated (and subsequently the assimilation was non-convergent) were removed from the sample used to derive sensitivity indices. Therefore, we cannot conclude that $\hat{η}$ is non-influential in the bathymetry reconstruction and the resulting free surface wave, on the basis of these results alone.

The inclusion of more parameters in the sensitivity analysis, like the final control time t = T, or the resolution of the numerical approximation (both of which would impact the accuracy of the reconstructed bathymetry) should be addressed in future work. Additionally, quantifying the sensitivity of the bathymetry reconstruction to interaction between various inputs would also be valuable, especially considering the correlation between $\hat{η}$ and $\hat{β}$ observed in Sec. 6. Sensitivity analyses of the accuracy of the observations themselves would also provide insight. In atmosphere and ocean models, observation measurements are usually noisy, and therefore a measure of their sensitivity to the observation operator would be illuminating for future work (Wagener and Pianosi, Citation2019).

The present analysis proved insightful for an idealised model of data assimilation for the 1-D shallow water equations with Gaussian initial conditions and bathymetry. In order to use these results for more realistic forecasts, the inevitable next step is to extend this approach to a full 2-D model, and include multiple forms of bathymetry, such as sandbar or ridge formations. The data assimilation algorithm for reconstructing bathymetry from surface wave observations presented here in 1-D has demonstrated the potential of this approach for more realistic 2-D applications.

Our focus on how the bathymetry and the initial conditions shapes affect reconstruction of the bathymetry from free surface observations is a novel contribution to the literature reviewed in Sec. 2. Additionally, we hope the sensitivity analyses and results observed here can pave the way for more refined approaches to incorporate bathymetry estimates in tsunami models. Extending this approach to 2-D, and verifying the main conclusions observed here, would contribute significantly improve the representation of bathymetry in tsunami models.

Acknowledgements

We would like to thank Dr. Bartosz Protas for his ongoing contribution and feedback on this research. Additionally, we would like to thank Dr. Laurent Debreau and Dr. Arthur Vidard at the Laboratoire Jean Kuntzmann for their suggestions, espcially on Global Sensitivity Analysis, which proved to be a valuable component of this work.

Additional information

Funding

This project was funded by the NSERC Discovery Grant of NKRK, and the research exchange at the Laboratoire Jean Kuntzmann was facilitated by a McMaster-CNRS Fund travel grant.

References

Craig, W. and Sulem, C. 2009. Asymptotics of surface waves over random bathymetry. Quart. Appl. Math. 68, 91–112. doi:https://doi.org/10.1090/S0033-569X-09-01177-9
Web of Science ®Google Scholar
Damen, M., van Dijk, P., Duim, J., van der Werff, H., Krol, B., Masselink, B. and van Ruitenbeek, F. 2005. Characteristics of Tsunamis. Accessed 2020 January 24. Online at: https://webapps.itc.utwente.nl/librarywww/papers_2005/tsunami/Tsunami.pdf
Google Scholar
Dugan, J. P. 1997. Bathymetry measurements from long range airborne imaging systems, In: Proceedings of the 4th International Conference on Remote Sensing for Marine and Coastal Environments. Orlando, FL, ERIM 1, 451–457.
Google Scholar
Ge, H., Liu, H. and Zhang, L. 2020. Accurate depth inversion method for coastal bathymetry: introduction of water wave high-order dispersion relation. J. Mar. Sci. Eng. 8, 153. doi:https://doi.org/10.3390/jmse8030153
Web of Science ®Google Scholar
Gessese, A. and Sellier, M. 2012. A direct solution approach to the inverse shallow-water problem. Math. Prob. Eng. 2012, 417950.
Web of Science ®Google Scholar
Grilli, S. and Subramanya, R. 1996. Numerical modeling of wave breaking induced by fixed or moving boundaries. Comput. Mech. 17, 374–391. doi:https://doi.org/10.1007/BF00363981
Web of Science ®Google Scholar
Grilli, S. T. 1998. Depth inversion in shallow water based on nonlinear properties of shoaling periodic waves. Coast. Eng., 35, 185–209. doi:https://doi.org/10.1016/S0378-3839(98)00035-0
Web of Science ®Google Scholar
Jang, T., Han, S. and Kinoshita, T. 2010. An inverse measurement of the sudden underwater movement of the sea-floor by using the time-history record of the water-wave elevation. Wave Motion 47, 146–155. doi:https://doi.org/10.1016/j.wavemoti.2009.10.002
Web of Science ®Google Scholar
Kevlahan, N., Khan, R. and Protas, B. 2019. On the convergence of data assimilation for the one-dimensional shallow water equations with sparse observations. Adv. Comput. Math. 45, 3195–3216. doi:https://doi.org/10.1007/s10444-019-09733-6
Web of Science ®Google Scholar
Khan, R. A. and Kevlahan, N. K.-R. 2020. Variational assimilation of surface wave data for bathymetry reconstruction. Part II: second order adjoint sensitivity analysis (preprint).
Google Scholar
Liu, Q. and Homma, T. 2009. A new computational method of a moment-independent uncertainty importance measure. Reliab. Eng. Syst. Safety 94, 1205–1211. doi:https://doi.org/10.1016/j.ress.2008.10.005
Web of Science ®Google Scholar
Losch, M. and Wunsch, C. 2003. Bottom topography as a control variable in an ocean model. J. Atmos. Oceanic Technol. 20, 1685–1696. doi:https://doi.org/10.1175/1520-0426(2003)020<1685:BTAACV>2.0.CO;2
Web of Science ®Google Scholar
Lubard, S. C., Krimmel, J. E., Thebaud, L., Evans, D. D. and Shemdin O. H. 1980. Optical image and laser slope meter intercomparison of high-frequency waves. J. Geophys. Res. 85, 4996–5002. doi:https://doi.org/10.1029/JC085iC09p04996
Web of Science ®Google Scholar
MOISE. 2014. Activity Report Project-Team MOISE: Modelling, Observations, Identification for Environmental Sciences. Université Joseph Fourier, Grenoble.
Google Scholar
Nicholls, D. and Taber, M. 2009. Detection of ocean bathymetry from surface wave measurements. Eur. J. Mech. B/Fluids 28, 224–233. doi:https://doi.org/10.1016/j.euromechflu.2008.06.001
Web of Science ®Google Scholar
Ozisik, M. and Orlande, H. R. B. 2000. Inverse Heat Transfer: Fundamentals and Applications. Taylor & Francis, New York.
Google Scholar
Pianosi, F., Beven, K., Freer, J., Hall, J. W., Rougier, J., Stephenson, D. B. and Wagener, T. 2016. Sensitivity analysis of environmental models: a systematic review with practical workflow. Environ. Model. Softw. 79, 214–232. doi:https://doi.org/10.1016/j.envsoft.2016.02.008
Web of Science ®Google Scholar
Pianosi, F. and Wagener, T. 2015. A simple and efficient method for global sensitivity analysis based on cumulative distribution functions. Environ. Model. Softw. 67, 1–11. doi:https://doi.org/10.1016/j.envsoft.2015.01.004
Web of Science ®Google Scholar
Piotrowski, C. C. and Dugan, J. P. 2002. Accuracy of bathymetry and current retrievals from airborne optical time-series imaging of shoaling waves. IEEE Trans. Geosci. Remote Sens. 40 (12), 2606–2618. doi:https://doi.org/10.1109/TGRS.2002.807578
Web of Science ®Google Scholar
Protas, B. 2008. Adjoint-based optimization of PDE systems with alternative gradients. J. Comput. Phys. 227, 6490–6510. doi:https://doi.org/10.1016/j.jcp.2008.03.013
Web of Science ®Google Scholar
Saltelli, A., Annoni, P., Azzini, I., Campolongo, F., Ratto, M. and Tarantola, S. 2010. Variance based sensitivity analysis of model output. Design and estimator for the total sensitivity index. Comput. Phys. Commun. 181, 259–270. doi:https://doi.org/10.1016/j.cpc.2009.09.018
Web of Science ®Google Scholar
Spiteri, R. J. and Ruuth, S. J. 2002. A new class of optimal high-order strong-stability-preserving time discretization methods. SIAM J. Numer. Anal. 40, 469–491. doi:https://doi.org/10.1137/S0036142901389025
Web of Science ®Google Scholar
Tsai, W. and Yue, D. K. P. 1996. Computation of nonlinear free-surface flows. Annu. Rev. Fluid Mech. 28, 249–278. doi:https://doi.org/10.1146/annurev.fl.28.010196.001341
Web of Science ®Google Scholar
Wagener, T. and Pianosi, F. 2019. What has global sensitivity analysis ever done for us? A systematic review to support scientific advancement and to inform policy-making in earth system modelling. Earth-Sci. Rev. 194, 1–18. doi:https://doi.org/10.1016/j.earscirev.2019.04.006
Web of Science ®Google Scholar
Wunsch, C. 1996. The Ocean Circulation Inverse Problem. Cambridge University Press, Cambridge.
Google Scholar

Variational assimilation of surface wave data for bathymetry reconstruction. Part I: algorithm and test cases

Abstract

1. Introduction

2. Review of bathymetry effects and previous work

3. Derivation of the adjoint based data assimilation scheme

4. Initial results using L² gradients

Table 1. Notation used in the derivation of data assimilation scheme of the SWE to find the optimal bathymetry, using same format as given in Table 1 of Kevlahan et al. (Citation2019).

Table 2. Cases considered for data assimilation Algorithm 1.

5. Smoothing using Sobolev gradients