Full article: A reduced basis approach for calculation of the Bethe–Salpeter excitation energies by using low-rank tensor factorisations*

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

ABSTRACT

The Bethe–Salpeter equation (BSE) is a reliable model for estimating the absorption spectra in molecules and solids on the basis of accurate calculation of the excited states from first principles. Direct diagonalisation of the BSE matrix is practically intractable due to O(N⁶) complexity scaling in the size of the atomic orbital basis set, N. In this paper, we introduce and analyse a reduced basis approach to computation of the Bethe–Salpeter excitation energies which can lead to a relaxation of the numerical costs down to O(N³). The BSE operator is specified in terms of the two-electron integrals in the Hartree–Fock molecular orbital basis and the respective energies, calculated by the tensor-based solver described in previous works. The reduced basis method includes two steps. First, the diagonal plus low-rank approximation to fully populated blocks in the BSE matrix is calculated, enabling an easier partial eigenvalue solver for a large auxiliary system relying only on matrix–vector multiplications with rank-structured matrices. Second, a small subset of eigenvectors from the auxiliary eigenvalue problem is selected to build a projection of the exact BSE system onto this reduced basis set. Numerical tests demonstrate the ϵ-rank bounds for the blocks of the BSE matrix on examples of some compact molecules. The accuracy of the reduced basis approach vs. the effective matrix rank is illustrated.

GRAPHICAL ABSTRACT

KEYWORDS:

AMS Subject Classification:

1. Introduction

In modern material science, there is a growing interest to ab initio computation of the absorption spectra for molecules or surfaces of solids. This computational problem can be treated either by using the time-dependent density functional theory (TD-DFT) [Citation1–6] or by solving the Bethe–Salpeter equation (BSE) [Citation7,Citation8] based on the Green's function formalism and many-body perturbation theory [Citation9–13]. A specific choice of the approximate computational model may depend on many physical and implementation aspects, see [Citation9] for the detailed discussion. In particular, the BSE approach leads to the challenging numerical task concerning the solution of large eigenvalue problem for a dense matrix that, in general, is non-symmetric.

In the present paper, we consider the computational aspects of the large-scale algebraic BSE spectral problem when using the data-sparse matrix structures. We follow the particular formulation of the BSE problem based on the non-interacting Green's function via representation in terms of the Hartree–Fock (HF) molecular orbitals (MOs) [Citation14,Citation15], where it was applied to H₂ molecule in the minimal basis of two Slater functions.Footnote¹In the framework of this specific BSE formulation, we focus on the algebraic aspects of solving the computationally extensive spectral problems arising in the case of larger molecular systems. It is demonstrated that this scheme becomes practically applicable to moderate-size molecules when using tensor-structured HF calculations [Citation16–19], accomplished by efficient representation of the two-electron integrals (TEI) in the MO basis in the form of a low-rank Cholesky factorisation [Citation17,Citation20]. In this way, the low-rank representation of the TEI tensor stipulates the beneficial structure of the BSE matrix blocks, thus enabling numerical algorithms of reduced complexity.

It is worth to note that the size of the BSE matrix scales quadratically with the size of the atomic orbital (AO) basis set, O(N²_b), used in ab initio HF calculations. The direct diagonalisation is limited by the O(N⁶_b) complexity, making the problem computationally expensive already for moderately sized molecules with a basis size N_b ≈ 100. Hence, a procedure that relies entirely on multiplication of the governing BSE matrix, or its approximation, with vectors (in the framework of some iterative procedure) is the only viable approach. In turn, fast matrix computations can be based on the use of low-rank representations since such data structures allow efficient storage and fast algebraic operations with linear complexity scaling in the matrix size.

Methods for solving partial eigenvalue problems for matrices with the special structure as in the BSE eigenvalue problem have been intensively studied in the literature. These structures are related to the so-called Hamiltonian matrices, exposing a particular block structure. Papers and books treating Hamiltonian eigenvalue problems include [Citation21–25]; see also the recent survey [Citation26] and the references therein. Special cases of the BSE and other eigenvalue problems related to HF approximations lead to anti-block-diagonal Hamiltonian eigenproblems that can be solved by special techniques based on minimisation principles [Citation27,Citation28]. The algebraic structure of the BSE matrix is not that of a Hamiltonian matrix in the general case, but yields a so-called complex J-symmetric matrix. Theory and numerical solution of such eigenvalue problems are discussed in [Citation29–33], where the particular instance of the BSE matrix is considered in [Citation33]. Other partial eigensolvers tailored for electronic structure calculations are discussed in [Citation34,Citation35]. The reduced basis method for large-scale systems is described in [Citation36].

In this paper, we study a reduced basis approach to the approximate numerical solution of the BSE eigenvalue problem based on model reduction via projection onto a reduced basis, which is constructed by using the eigenvectors of a simplified system matrix obeying a simpler data-sparse structure. The reduced basis method includes two steps. First, the diagonal plus low-rank approximation to the fully populated blocks in the BSE matrix is calculated, enabling an easier partial eigenvalue solver for a large auxiliary system relying only on matrix–vector multiplications with rank-structured matrices. Second, a small subset of eigenvectors from the auxiliary eigenvalue problem is selected to build the projection of the exact BSE system onto this reduced basis set.

The approximation error incurred by the reduced basis approach depending on the rank truncation parameters is investigated. Theoretical and numerical analysis on the existence of the low-rank approximation and the respective rank bounds for matrix blocks in the BSE system matrix are presented. One of the favourable features of the approach is the quadratic convergence rate in approximate excitation energies compared with the accuracy of the reduced basis set thresholded by a rank truncation parameter ϵ > 0 (see Remark 3.3).

The reduced basis approach applies to the BSE system with matrix blocks of size N_oN_v × N_oN_v, where N_o and N_v denote the number of occupied and virtual HF orbitals, respectively, such that N_b = N_o + N_v. Since in general, N_oN_v = O(N²_b), the direct numerical calculation of the matrix elements, based on the precomputed TEI tensor in the HF MO basis, has a storage and numerical cost of the order of O(N⁴_b).

The construction of the reduced basis and the preceding low-rank decomposition of matrix blocks in the Bethe–Salpeter kernel are motivated by the use of truncated Cholesky factorisation of the TEI matrix [Citation17,Citation20]. To that end, the BSE matrix blocks are represented in terms of the precomputed Cholesky factors in the HF MO basis. Along with the diagonal energy matrix, this constitutes the structured representations of the dielectric and response functions, as well as the static screened interaction matrix. Taking into account the rank decomposition of TEI, the above quantities tolerate the low-rank approximation up to a chosen threshold.Footnote²This yields the construction of a so-called simplified BSE matrix with a diagonal plus low-rank structure in the matrix blocks, thus admitting efficient storage and matrix–vector products. The reduced basis is obtained by calculating several of the lowest eigenvectors of the auxiliary eigenvalue problem for the simplified matrix. A projection of the exact BSE matrix onto the reduced basis set and diagonalisation of the arising small-size matrix completes the reduced basis scheme.

Numerical tests for single molecules and finite chains of hydrogen atoms indicate the convergence in the senior excitation energies by the increase of the separation ranks.

The rest of the paper is organised as follows. Section 2 recalls the truncated Cholesky decomposition scheme for low-rank factorisation of the TEI tensor in the HF MO basis, that is the building block in the construction of the BSE matrix. Section 3 describes the algebraic computational scheme for evaluation of the entries in the BSE matrix, analyses the low-rank structure in different matrix blocks and describes the reduced basis approach. Section 4 presents numerical tests for several compact and extended molecules demonstrating the computational features of the reduced basis method applied to the full BSE system as well as to simplified model by the so-called Tamm–Dancoff approximation (TDA).

2. Low-rank approximation of the two-electron integrals in Hartree–Fock calculus

2.1. Cholesky decomposition of the TEI matrix

The numerical treatment of the TEI tensor (also known as the electron repulsion integrals) is one of the bottlenecks in the numerical solution of the HF equation and in density function theory (DFT) calculations for large molecules [Citation37].

Given the AO basis set ${g_{μ}}_{1 \leq μ \leq N_{b}}$ , $g_{μ} \in H^{1} (R^{3})$ , the associated TEI tensor B = [b_μνλσ] [Citation38] is defined entrywise by (2.1) $\begin{matrix} b_{μ ν λ σ} & = & \int_{R^{3}} \int_{R^{3}} \frac{g_{μ} (x) g_{ν} (x) g_{λ} (y) g_{σ} (y)}{∥ x - y ∥} d x d y, \\ μ, ν, λ, σ \in {1, ..., N_{b}} = : I_{b} . \end{matrix}$ (2.1) The corresponding N²_b × N_b² TEI matrix over the large index set $I \times J$ , $I = J = I_{b} \otimes I_{b}$ , with $I_{b} : = {1, ..., N_{b}}$ , $B = mat (B) = [b_{μ ν; λ σ}] \in R^{N_{b}^{2} \times N_{b}^{2}},$ is obtained by matrix unfolding of the tensor B = [b_μνλσ]. The matrix B is proven to be symmetric and positive definite ensuring application of the incomplete Cholesky decomposition [Citation37,Citation39–41]. The tensor-based HF solver [Citation16,Citation19] employs the efficient calculation of the Cholesky factors [Citation17,Citation20] in (2.2) $B \approx L L^{T}, L \in R^{N_{b}^{2} \times R_{B}}, R_{B} = O (N_{b}),$ (2.2) where the adaptively chosen column vectors in B are calculated in an efficient way. This allows the partial decoupling of the index sets {μν} and {λσ}.

Notice that the Cholesky factorisation (Equation2.2(2.2) $B \approx L L^{T}, L \in R^{N_{b}^{2} \times R_{B}}, R_{B} = O (N_{b}),$ (2.2) ) can be written in the following index form: (2.3) $b_{μ ν; λ σ} \approx \sum_{k = 1}^{R_{B}} L_{k} (μ; ν) L_{k} (σ; λ),$ (2.3) where the second factor corresponds to the transposed matrix L^T_k. Here L_k = L_k(μ; ν), k = 1, …, R_B, denotes the N_b × N_b matrix unfolding of the column vector L(:, k) from the Cholesky factor $L \in R^{N_{b}^{2} \times R_{B}}$ .

Numerical experiments indicate that the truncated Cholesky decomposition with the separation rank O(N_b) ensures a satisfactory numerical precision ϵ > 0 of order 10⁻⁵ to 10⁻⁶. The refined rank estimate O(N_b|log ϵ|) was observed for every molecular system considered so far [Citation17,Citation20].

In the standard quantum chemical implementations in the Gaussian-type AO basis, the numerically confirmed rank bound, $rank (B) \leq C_{B} N_{b}$ (C_B is of order 1–10), allows to reduce the complexity of building up the Fock matrix F to O(N³_b), which is by far dominated by the computational cost for the exchange term K(D) (see Section 2.2 ).

2.2. Rank bounds for the TEI matrix V

The 2N_o-electron HF equation for N_o pairwise L²-orthogonal occupied electronic orbitals, $ψ_{i} : R^{3} \to R$ , $ψ_{i} \in H^{1} (R^{3})$ , reads as [Citation38] (2.4) $\begin{matrix} F ψ_{i} (x) & : = & [- \frac{1}{2} Δ + V_{c} (x) + V_{H} (x) + K] ψ_{i} (x) \\ = & ϵ_{i} ψ_{i} (x), \int_{R^{3}} ψ_{i} ψ_{j} d x = δ_{i j}, \end{matrix}$ (2.4) i, j = 1, …, N_o, where the nonlinear Fock operator $F$ on the left-hand side includes the nuclear and Hartree potentials V_c(x) and V_H(x) as well as the integral exchange operator $K$ .

In HF calculations, the full HF operator $F$ is represented in the basis set ${g_{μ}}_{1 \leq μ \leq N_{b}}$ , $g_{μ} \in H^{1} (R^{3})$ , of Gaussian-type AOs. We consider the complete set of HF MOs ${C_{p} \in R^{N_{b}}}$ , i.e. the pth column vectors in the coefficients matrix $C \in R^{N_{b} \times N_{b}}$ , and the corresponding energies {ϵ_p}, p = 1, 2, …, N_b. The occupied MOs ψ_i are represented (approximately) by the coefficient matrix $C_{o} = [C_{μ i}] \in R^{N_{b} \times N_{o}}$ (submatrix of C) as ψ_i = ∑^N_b_{μ = 1}C_μig_μ, i = 1, …, N_o.

The coefficients matrix $C \in R^{N_{b} \times N_{b}}$ solves the nonlinear eigenvalue problem (2.5) $\begin{matrix} [H + J (D) + K (D)] C & = & S C E, E = diag (ϵ_{1}, ..., ϵ_{N_{b}}), \\ C^{T} S C & = & I_{N_{b}}, \end{matrix}$ (2.5) where H is the core Hamiltonian matrix, S is the mass matrix and (2.6) $\begin{matrix} J {(D)}_{μ ν} & = & \sum_{κ, λ = 1}^{N_{b}} b_{μ ν, κ λ} D_{κ λ}, \\ K {(D)}_{μ ν} & = & - \frac{1}{2} \sum_{κ, λ = 1}^{N_{b}} b_{μ λ, ν κ} D_{κ λ}, \end{matrix}$ (2.6) with $D = 2 C_{o} C_{o}^{T} \in R^{N_{b} \times N_{b}}$ denoting the rank-N_o symmetric density matrix.

In BSE calculations, the TEI tensor B = [b_μνλσ], corresponding to the AO basis set, is represented in the MO basis: (2.7) $\begin{matrix} B \mapsto V = [v_{i a j b}] : & v_{i a j b} = \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} b_{μ ν, λ σ}, \\ a, b, i, j \in {1, ..., N_{b}} . \end{matrix}$ (2.7) The BSE calculations utilise the two subtensors of V specified by the index sets $I_{o} : = {1, ..., N_{o}}$ and $I_{v} : = {N_{o} + 1, ..., N_{b}}$ , with N_o denoting the number of occupied orbitals. The first subtensor is defined as in the MP2 calculations [Citation17]: (2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) while the second one lives on the extended index set: (2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9)

In the following, {C_i} and {C_a} denote the sets of occupied and virtual orbitals, respectively. We shall also use the notation N_v = N_b − N_o, N_ov = N_oN_v.

Denote the associated matrix by $V = [v_{i a, j b}] \in R^{N_{o v} \times N_{o v}}$ in case (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ), and similar by $\hat{V} = [{\hat{v}}_{t u, r s}] \in R^{N_{o}^{2} \times N_{v}^{2}}$ in case (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ). The straightforward computation of the matrix V by the above representations accounts for the dominating impact on the overall numerical cost of order O(N⁵_b) in the evaluation of the block entries in the BSE matrix. A method of complexity O(N⁴_b) based on the low-rank tensor decomposition of the matrix V on the full index set was described in [Citation17].

It can be shown that the rank R_B = O(N_b) approximation to matrix B ≈ LL^T with the N × R_B Cholesky factor, L, allows to introduce the low-rank representation of the tensor V, and then to reduce the asymptotic complexity of calculations to O(N⁴_b) (see [Citation17]). Indeed, let C_m be the mth column of the coefficient matrix $C = {C_{μ i}} \in R^{N_{b} \times N_{b}}$ . Then, inserting (Equation2.3(2.3) $b_{μ ν; λ σ} \approx \sum_{k = 1}^{R_{B}} L_{k} (μ; ν) L_{k} (σ; λ),$ (2.3) ) in (Equation2.7(2.7) $\begin{matrix} B \mapsto V = [v_{i a j b}] : & v_{i a j b} = \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} b_{μ ν, λ σ}, \\ a, b, i, j \in {1, ..., N_{b}} . \end{matrix}$ (2.7) ) in the case of (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ) leads to (2.10) $\begin{matrix} v_{i a j b} & = & \sum_{k = 1}^{R_{B}} \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} L_{k} (μ; ν) L_{k} (σ; λ) \\ = & \sum_{k = 1}^{R_{B}} (\sum_{μ, ν = 1}^{N_{b}} C_{μ i} C_{ν a} L_{k} (μ; ν)) (\sum_{λ, σ = 1}^{N_{b}} C_{λ j} C_{σ b} L_{k} (σ; λ)) \\ = & \sum_{k = 1}^{R_{B}} (C_{i}^{T} L_{k} C_{a}) (C_{b}^{T} L_{k}^{T} C_{j}) = \sum_{k = 1}^{R_{B}} (C_{i}^{T} L_{k} C_{a}) {(C_{j}^{T} L_{k} C_{b})}^{T} . \end{matrix}$ (2.10) A similar factorisation can be derived in the case of (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ). The precise formulation is given by the following lemma [Citation17], which will be used in further considerations.

Lemma 2.1:

Let the rank-R_B Cholesky decomposition of the matrix B be given by (Equation2.2(2.2) $B \approx L L^{T}, L \in R^{N_{b}^{2} \times R_{B}}, R_{B} = O (N_{b}),$ (2.2) ), then the matrix unfolding V = [v_{ia; jb}] corresponding to (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ) allows a rank decomposition with $rank (V) \leq R_{B}$ . The R_B-term representation of the matrix V = [v_{ia; jb}] takes the following form: $V = L_{V} L_{V}^{T}, L_{V} \in R^{N_{o v} \times R_{B}},$ where the columns of L_V are given by $\begin{matrix} L_{V} ((i - 1) N_{v i r} + a - N_{o r b}; k) = C_{i}^{T} L_{k} C_{a}, \\ k = 1, ..., R_{B}, a \in I_{v}, i \in I_{o} . \end{matrix}$ On the index set (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ), we have $\hat{V} = U_{\hat{V}} W_{\hat{V}}^{T} \in R^{N_{o}^{2} \times N_{v}^{2}}$ with $U_{\hat{V}} \in R^{N_{o}^{2} \times R_{B}}$ , $W_{\hat{V}} \in R^{N_{v}^{2} \times R_{B}}$ .

The numerical cost is determined by the computation complexity and storage size for the factors L_V, $U_{\hat{V}}$ and $W_{\hat{V}}$ in the above rank-structured factorisations.

Lemma 2.1 provides the upper bounds on $rank (V)$ in the representation (Equation2.10(2.10) $\begin{matrix} v_{i a j b} & = & \sum_{k = 1}^{R_{B}} \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} L_{k} (μ; ν) L_{k} (σ; λ) \\ = & \sum_{k = 1}^{R_{B}} (\sum_{μ, ν = 1}^{N_{b}} C_{μ i} C_{ν a} L_{k} (μ; ν)) (\sum_{λ, σ = 1}^{N_{b}} C_{λ j} C_{σ b} L_{k} (σ; λ)) \\ = & \sum_{k = 1}^{R_{B}} (C_{i}^{T} L_{k} C_{a}) (C_{b}^{T} L_{k}^{T} C_{j}) = \sum_{k = 1}^{R_{B}} (C_{i}^{T} L_{k} C_{a}) {(C_{j}^{T} L_{k} C_{b})}^{T} . \end{matrix}$ (2.10) ) which might be reduced by the ϵ-rank truncation. It can be shown that the ϵ-rank of the matrix V remains of the same magnitude as that for the TEI matrix B obtained by its ϵ-rank truncated Cholesky factorisation (see the numerical illustration in Section 3.2).

Numerical tests in [Citation17] indicate that the singular values of the TEI matrix B decay exponentially as (2.11) $σ_{k} \leq C e^{- \frac{z}{N_{b}} k},$ (2.11) where the constant z > 0 depends weakly on the molecule configuration. If we define R_B(ϵ) as the minimal number satisfying the condition (2.12) $\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k}^{2} \leq ϵ^{2},$ (2.12) then the estimate (Equation2.11(2.11) $σ_{k} \leq C e^{- \frac{z}{N_{b}} k},$ (2.11) ) leads to the ϵ-rank bound R_B(ϵ) ≤ CN_b|log ϵ|, which will be postulated in the following discussion.

Our goal is to justify that R_V(ϵ) increases only logarithmically in ϵ, similar to the bound for R_B(ϵ). To that end, we introduce the singular value decomposition (SVD) decomposition of the matrix B, $B = U D_{B} U^{T}, U \in R^{N_{b}^{2} \times R_{B}}, D_{B} \in R^{R_{B} \times R_{B}},$ which can be written in the following index form: (2.13) $b_{μ ν; λ σ} = \sum_{k = 1}^{R_{B}} σ_{k} U_{k} (μ; ν) U_{k} (σ; λ),$ (2.13) with $U_{k} = [U_{k} (μ; ν)] \in R^{N_{b} \times N_{b}}$ and ‖U_k‖_F = 1, k = 1, …, R_B.

Lemma 2.2:

For given ϵ > 0, there exists a rank-r approximation V_r to the matrix V, and a constant C > 0 not depending on ϵ, such that r ≤ R_B(ϵ) and $∥ V_{r} - V ∥ \leq C N_{b} ϵ | log ϵ | .$

Proof.

We estimate the R_B(ϵ)-term truncation error by using the representation (Equation2.13(2.13) $b_{μ ν; λ σ} = \sum_{k = 1}^{R_{B}} σ_{k} U_{k} (μ; ν) U_{k} (σ; λ),$ (2.13) ), (2.14) $\begin{matrix} v_{i a j b} & = & \sum_{k = 1}^{R_{B}} σ_{k} \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} U_{k} (μ; ν) U_{k} (σ; λ) \\ = & \sum_{k = 1}^{R_{B}} σ_{k} (\sum_{μ, ν = 1}^{N_{b}} C_{μ i} C_{ν a} U_{k} (μ; ν)) \\ \times (\sum_{λ, σ = 1}^{N_{b}} C_{λ j} C_{σ b} U_{k} (σ; λ)) \\ = & \sum_{k = 1}^{R_{B}} σ_{k} (C_{i}^{T} U_{k} C_{a}) (C_{b}^{T} U_{k}^{T} C_{j}) \\ = & \sum_{k = 1}^{R_{B}} σ_{k} (C_{i}^{T} U_{k} C_{a}) {(C_{j}^{T} U_{k} C_{b})}^{T}, \end{matrix}$ (2.14) which can be presented in the matrix form V = ∑^R_B_{k = 1}σ_kV_kV^T_k, where V_k(i; a) = C^T_iU_kC_a. By definition of R_B(ϵ), we have (Equation2.12(2.12) $\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k}^{2} \leq ϵ^{2},$ (2.12) ). Hence, the error of the rank-R_B(ϵ) approximation defined by V_r = ∑^R_B(ϵ)_{k = 1}σ_kV_kV^T_k, can be bounded by (2.15) $\begin{matrix} ∥\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k} V_{k} V_{k}^{T}∥ & \leq & {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k}^{2})}^{1 / 2} {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} {∥ V_{k} ∥}^{4})}^{1 / 2} \\ \leq & ϵ {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} {∥ V_{k} ∥}^{4})}^{1 / 2} \\ \leq & ϵ (R_{B} - R_{B} (ϵ)) {∥ C ∥}_{I_{o}}^{2} {∥ C ∥}_{I_{v}}^{2}, \end{matrix}$ (2.15) taking into account that ‖U_k‖ = 1, k = 1, …, R_B, and the Frobenius norm estimate $\begin{matrix} ∥ V_{k} ∥^{2} & = & ∥ V_{k} {(i; a) ∥}_{F}^{2} = ∥ C_{i}^{T} U_{k} C_{a} ∥_{F}^{2} \leq ∥ U_{k} ∥^{2} \sum_{i, a} ∥ C_{i} ∥^{2} {∥ C_{a} ∥}^{2} \\ \leq & \sum_{i \in I_{o}} ∥ C_{i} ∥^{2} \sum_{a \in I_{v}} {∥ C_{a} ∥}^{2} \end{matrix}$ holds. We suppose that R_B = O(N_b|log ϵ|), then the multiple of ϵ|log ϵ| in (Equation2.15(2.15) $\begin{matrix} ∥\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k} V_{k} V_{k}^{T}∥ & \leq & {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} σ_{k}^{2})}^{1 / 2} {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} {∥ V_{k} ∥}^{4})}^{1 / 2} \\ \leq & ϵ {(\sum_{k = R_{B} (ϵ) + 1}^{R_{B}} {∥ V_{k} ∥}^{4})}^{1 / 2} \\ \leq & ϵ (R_{B} - R_{B} (ϵ)) {∥ C ∥}_{I_{o}}^{2} {∥ C ∥}_{I_{v}}^{2}, \end{matrix}$ (2.15) ) does not depend on ϵ, which proves our lemma.

The storage cost of these decompositions restricted to the active index set $I_{v} \times I_{o}$ amounts to R_V(ϵ)N_vN_o.

represents the singular values of the matrix V for H₃₂ chain, N₂H₄ and C₂H₅NO₂ (Glycine amino acid) molecules with the size of the basis set (N_b, N_o) equal to (128, 16), (82, 9) and (170, 20), respectively. They indicate that R_V(ϵ) is linearly proportional to |log ϵ|. Moreover, it is of the same order of magnitude as R_B(ϵ) (see [Citation20]).

Figure 1. Decay of singular values of the matrix V for H₃₂-chain (1792 × 1792), N₂H₄ (657 × 657), and C₂H₅NO₂ (3000 × 3000) molecules. Size of V is given in brackets.

Figure 1. Decay of singular values of the matrix V for H32-chain (1792 × 1792), N2H4 (657 × 657), and C2H5NO2 (3000 × 3000) molecules. Size of V is given in brackets.

The calculation of V_r is based on the reduced truncated SVD algorithm applied to the initial rank-R_B Cholesky decomposition of the matrix V inherited from that for the TEI matrix B (see Lemma 2.1). Complexity of this straightforward computation on the active index set can be estimated by O(R²_BN_ov) = O(N²_bN_ov).

3. Tensor factorisation of the BSE matrix blocks

Here we discuss the main ingredients for calculation of blocks in the BSE matrix and their reduced rank approximate representation. We compose the 2N_ov × 2N_ov BSE matrix by Equations (46a) and (46b) in [Citation14], though the construction of static screened interaction matrix w(ij, ab) in Equation (Equation3.4(3.4) $a_{i a, j b} : = Δ ϵ_{i a, j b} + v_{i a, j b} - {\hat{w}}_{i j, a b},$ (3.4) ) may slightly differ.

3.1. Tensor representations using TEI matrix in MTO basis

Construction of the BSE matrix includes computation of several auxiliary quantities. First, introduce a fourth-order diagonal ‘energy’ matrix by $Δ ϵ = [Δ ϵ_{i a, j b}] \in R^{N_{o v} \times N_{o v}} : Δ ϵ_{i a, j b} = (ϵ_{a} - ϵ_{i}) δ_{i j} δ_{a b},$ that can be represented in the Kronecker product form $Δ ϵ = I_{o} \otimes diag {ϵ_{a} : a \in I_{v}} - diag {ϵ_{i} : i \in I_{o}} \otimes I_{v},$ where I_o and I_v are the identity matrices on respective index sets. It is worth noting that if the so-called homo–lumo gap of the system is positive, i.e. $ϵ_{a} - ϵ_{i} > δ > 0, a \in I_{v}, i \in I_{o},$ then the matrix $Δ ϵ$ is invertible.

Using the matrix $Δ ϵ$ and the N_ov × N_ov TEI matrix V = [v_{ia, jb}] represented in the MO basis as in (Equation2.7(2.7) $\begin{matrix} B \mapsto V = [v_{i a j b}] : & v_{i a j b} = \sum_{μ, ν, λ, σ = 1}^{N_{b}} C_{μ i} C_{ν a} C_{λ j} C_{σ b} b_{μ ν, λ σ}, \\ a, b, i, j \in {1, ..., N_{b}} . \end{matrix}$ (2.7) ), the dielectric function (N_ov × N_ov matrix) Z = [z_{pq, rs}] is defined by $z_{p q, r s} : = δ_{p r} δ_{q s} - v_{p q, r s} {[χ_{0} (ω = 0)]}_{r s, r s},$ with $χ_{0} (ω)$ being the matrix form of the so-called Lehmann representation to the response function. In turn, the matrix representation of the inverse of $χ_{0} (ω)$ is known to have the following form: $χ_{0}^{- 1} (ω) = - (\begin{matrix} Δ ϵ & 0 \\ 0 & Δ ϵ \end{matrix}) + ω (\begin{matrix} 1 & 0 \\ 0 & - 1 \end{matrix}),$ implying $χ_{0} (0) = - (\begin{matrix} Δ ϵ^{- 1} & 0 \\ 0 & Δ ϵ^{- 1} \end{matrix}) .$

Let $1 \in R^{N_{o v}}$ and $d_{ϵ} = diag {Δ ϵ^{- 1}} \in R^{N_{o v}}$ be the all-ones and diagonal vectors of $Δ ϵ^{- 1}$ , respectively, specifying the rank-1 matrix 1⊗d_ϵ. In this notation, the matrix Z = [z_{pq, rs}] takes a compact form (3.1) $Z = I_{o} \otimes I_{v} + V ⊙ (1 \cdot d_{ϵ}^{T}),$ (3.1) where ⊙ denotes the Hadamard product of matrices. Introducing the inverse matrix Z⁻¹, we finally define the so-called static screened interaction matrix (tensor) by (3.2) $W = [w_{p q, r s}] : w_{p q, r s} : = \sum_{t \in I_{v}, u \in I_{o}} z_{p q, t u}^{- 1} v_{t u, r s} .$ (3.2) In the forthcoming calculations, this equation should be considered on the conventional and extended index sets ${p, s \in I_{o}} \times {q, r \in I_{v}}$ and ${p, q \in I_{o}} \times {r, s \in I_{v}}$ , respectively, such that v_{tu, rs} corresponds either to subtensor in (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ) or in (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ).

Hence, on the conventional index set, we obtain the following matrix factorisation of W := [w_{ia, jb}]: $W = Z^{- 1} V provided that a, b \in I_{v}, i, j \in I_{o},$ where V is calculated by (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ). Lemma 2.1 suggests the existence of a low-rank factorisation for the matrix W defined above.

Lemma 3.1:

Let the matrix Z defined by (Equation3.1(3.1) $Z = I_{o} \otimes I_{v} + V ⊙ (1 \cdot d_{ϵ}^{T}),$ (3.1) ) over the index set $a, b \in I_{v}, i, j \in I_{o}$ be invertible. Then the rank of the respective matrix W = Z⁻¹V is bounded by $r a n k (W) \leq r a n k (V) \leq R_{B} .$

Proof.

Lemma 2.1 proves the representation $V = L_{V} L_{V}^{T}, L_{V} \in R^{N_{o v} \times R_{B}}$ , which ensures the rank-R_B factorisation $W = Z^{- 1} V = (Z^{- 1} L_{V}) L_{V}^{T},$ which can be calculated by solving linear system with structured data (see Section 3.2).

Furthermore, Equation (46a) in [Citation14] includes matrix entries w_{ij, ab} for $a, b \in I_{v}, i, j \in I_{o}$ . To this end, the modified matrix $\hat{W} = [{\hat{w}}_{p q, r s}]$ is computed by (Equation3.2(3.2) $W = [w_{p q, r s}] : w_{p q, r s} : = \sum_{t \in I_{v}, u \in I_{o}} z_{p q, t u}^{- 1} v_{t u, r s} .$ (3.2) ) on the index set ${p, q \in I_{o}} \times {r, s \in I_{v}}$ by using entries ${\hat{v}}_{i j, a b}$ in the matrix unfolding of the tensor $\hat{V}$ in (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ) multiplied from the left with the N²_o × N_o² submatrix of Z⁻¹.

Now the matrix representation of the BSE in the (ov, vo) subspace reads as the following eigenvalue problem determining the excitation energies ω_n: (3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) where the matrix blocks are defined in the index notation by (see (46a) and (46b) in [Citation14] for more details): (3.4) $a_{i a, j b} : = Δ ϵ_{i a, j b} + v_{i a, j b} - {\hat{w}}_{i j, a b},$ (3.4) (3.5) $b_{i a, j b} : = v_{i a, b j} - w_{i b, a j}, a, b \in I_{v}, i, j \in I_{o} .$ (3.5) In the matrix form, we obtain $A = Δ ϵ + V - \overline{W},$ where the matrix elements in $\overline{W} = [{\overline{w}}_{i a, j b}]$ are defined by ${\overline{w}}_{i a, j b} = {\hat{w}}_{i j, a b}$ , computed by (Equation3.2(3.2) $W = [w_{p q, r s}] : w_{p q, r s} : = \sum_{t \in I_{v}, u \in I_{o}} z_{p q, t u}^{- 1} v_{t u, r s} .$ (3.2) ) and (Equation2.9(2.9) $\hat{V} = [{\hat{v}}_{t u r s}] : r, s \in I_{v}, t, u \in I_{o} .$ (2.9) ) as described above. Here the diagonal plus low-rank sparsity structure in $Δ ϵ + V$ can be recognised in view of Lemma 2.1. For the matrix block B, we have $B = \tilde{V} - \tilde{W} = V - \tilde{W},$ where the matrix $\tilde{V}$ , corresponding to the partly transposed tensor, is defined entrywise by $\tilde{V} = [{\tilde{v}}_{i a j b}] : = [v_{i a b j}] = [v_{i a j b}],$ thus coinciding with V in (Equation2.8(2.8) $V = [v_{i a j b}] : a, b \in I_{v}, i, j \in I_{o},$ (2.8) ) due to the symmetry properties. Here $\tilde{W}$ is defined by permutation, $\tilde{W} = [{\tilde{w}}_{i a, j b}] = [w_{i b, a j}]$ . In the following, we investigate the ϵ-rank structure in the matrix blocks A and B resulting from the corresponding factorisations of V.

Solutions of Equation (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) come in pairs: excitation energies ω_n with eigenvectors (x_n, y_n), and de-excitation energies −ω_n with eigenvectors (x*_n, y_n*).

The block structure in the matrices A and B is inherited from the symmetry of the TEI matrix V, v_{ia, jb} = v*_{ai, bj} and the matrix W, w_{ia, jb} = w*_{bj, ai}. In particular, it is known from the literature that the matrix A is Hermitian and the matrix B is (complex) symmetric (since v_{ia, bj} = v_{jb, ai} and w_{ib, aj} = w_{ja, bi}), which we control in the matrix construction (see also [Citation33] for implications on the algebraic properties of the BSE matrix).

In the following, we confine ourselves to the case of real spin orbitals, i.e. the matrices A and B remain real. It is known that for the real spin orbitals and if A + B and A − B are positive definite, the problem can be transformed into a half-size symmetric eigenvalue equation [Citation3]. Indeed, in this case for every eigenpair, we have $A x + B y = ω x, B x + A y = - ω y,$ implying $\begin{matrix} (A + B) (x + y) & = & ω (x - y), \\ (A - B) (x - y) & = & ω (x + y) . \end{matrix}$ Now, if A + B and A − B are both positive definite, then the previous equations transform to (3.6) $\begin{matrix} M z = ω^{2} z with M = {(A - B)}^{1 / 2} (A + B) {(A - B)}^{1 / 2}, \end{matrix}$ (3.6) with respect to the normalised eigenvectors $z = \sqrt{ω} {(A - B)}^{1 / 2} (x + y)$ . However, in this case the computation of the large fully populated matrix (A − B)^1/2 may become the bottleneck.

The dimension of the matrix in (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) is 2N_oN_v × 2N_oN_v, where N_o and N_v denote the number of occupied and virtual orbitals, respectively. In general, N_oN_v is asymptotically of the size O(N²_b), i.e. the spectral problem (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) may be computationally extensive. Indeed, the direct eigenvalue solver for (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) (diagonalisation) becomes infeasible due to O(N⁶_b) complexity scaling. Furthermore, the numerical cost for calculation of the matrix elements based on the precomputed TEI integrals from the HF equation scales as O(N³_b)–O(N⁴_b), depending on how to compute the matrix W. Here, the low-rank structure in the matrix V can be adapted.

The challenging computational tasks arise in the case of lattice-structured compounds, where the number of basis functions increases proportionally to the lattice size L × L × L, i.e. N_b ∼ n₀L³, that quickly leads to intractable problems even for small lattices.

3.2. The reduced basis approach using low-rank approximations

The large matrix size in Equation (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) makes the solution of the full eigenvalue problem computationally intractable even for moderate-size molecules, not to speak of lattice-structured compounds. Hence, in realistic quantum chemical simulations of excitation energies, the calculation of several (tens of) eigenpairs may be sufficient.

In the following, we show that the part $Δ ϵ + V$ in the matrix block A has diagonal plus low-rank (DPLR) structure, while the submatrix $\tilde{V}$ in the block B exhibits low-rank approximability. Taking into account these structures, we propose a special partial eigenvalue problem solver based on the use of a reduced basis obtained from the eigenvectors of the reduced matrix that picks up only the essential part of the initial BSE matrix with the DPLR structure. The iterative solver is based on fast matrix–vector multiplication and efficient storage of all data involved in the computational scheme. Using the reduced basis approach, we then approximate the initial problem by its projection onto a reduced basis of moderate size.

We begin with the low-rank decomposition of the matrix V, $V \approx L_{V} L_{V}^{T}, L_{V} \in R^{N_{o v} \times R_{V}}, R_{V} \leq R_{B},$ where the rank parameter R_V = R_V(ϵ) = O(N_b|log ϵ|) can be optimised depending on the truncation error ϵ > 0 (see [Citation17] and Section 2.2).

First, we represent all matrix blocks and intermediate matrices included in the representation of the BSE matrix by using the above decomposition and diagonal matrices as follows. The properties of the Hadamard product imply that the matrix Z exhibits the representation $Z = I_{o} \otimes I_{v} + L_{V} L_{V}^{T} ⊙ (1 \cdot d_{ϵ}^{T}) = I_{N_{o v}} + L_{V} {(L_{V} ⊙ d_{ϵ})}^{T},$ where the rank of the second summand does not exceed R_V. Hence, the linear system solver W = Z⁻¹V can be implemented by algorithms tailored to the DPLR structure by adapting the Sherman–Morrison formula.

The computational cost for setting up the full BSE matrix F in (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) can be estimated by O(N²_ov), which includes the cost O(N_ovR_B) for generation the matrix V and the dominating cost O(N²_ov) for setting up $\overline{W}$ .

In the following, we rewrite the spectral problem (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) in the following equivalent form: (3.7) $F_{1} (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ - B^{*} & - A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) .$ (3.7)

The main idea of the reduced basis approach proposed in this paper is as follows. Instead of solving the partial eigenvalue problem for finding of, say, m₀ eigenpairs satisfying Equation (Equation3.7(3.7) $F_{1} (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ - B^{*} & - A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) .$ (3.7) ), we first solve the slightly simplified auxiliary spectral problem with a modified matrix F₀. The approximation F₀ is obtained from F₁ by using low-rank approximations of the parts $\overline{W}$ and $\tilde{W}$ of the matrix blocks A and B, respectively, i.e. A and B are replaced by (3.8) $A_{0} : = Δ ϵ + V - {\overline{W}}_{r} and B_{0} : = V - {\tilde{W}}_{r},$ (3.8) respectively. Here we assume that the matrix V is already represented in the low-rank format inherited from the Cholesky decomposition of the TEI matrix B.

The modified auxiliary problem reads (3.9) $F_{0} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) \equiv (\begin{matrix} A_{0} & B_{0} \\ - B_{0}^{*} & - A_{0}^{*} \end{matrix}) (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) = λ_{n} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) .$ (3.9) This eigenvalue problem is much simpler than that in (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ), since now the matrix blocks A₀ and B₀, defined in (Equation3.8(3.8) $A_{0} : = Δ ϵ + V - {\overline{W}}_{r} and B_{0} : = V - {\tilde{W}}_{r},$ (3.8) ), are composed of diagonal and low-rank matrices.

Having at hand the set of m₀ eigenpairs computed for the modified (reduced model) problem (Equation3.9(3.9) $F_{0} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) \equiv (\begin{matrix} A_{0} & B_{0} \\ - B_{0}^{*} & - A_{0}^{*} \end{matrix}) (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) = λ_{n} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) .$ (3.9) ), {(λ_n, ψ_n) = (λ_n, (u_n, v_n)^T)}, we solve the full eigenvalue problem for the reduced matrix obtained by projection of the initial equation onto the problem adapted small basis set {ψ_n} of size m₀.

Define a matrix $G_{1} = ψ_{n} (:, 1 : m_{0}) \in R^{2 N_{o v} \times m_{0}}$ whose columns present the spanning vectors of the reduced basis, compute the stiffness and mass matrices by projection onto the reduced basis specified by the columns in G₁, $M_{1} = G_{1}^{T} F_{1} G_{1}, S_{1} = G_{1}^{T} G_{1} \in R^{m_{0} \times m_{0}},$ and then solve the projected generalised eigenvalue problem of small size m₀ × m₀, (3.10) $M_{1} Y = γ_{n} S_{1} Y, Y \in R^{m_{0}} .$ (3.10) The portion of small eigenvalues γ_n, n = 1, …, m₀, is thought to be very close to the corresponding excitation energies ω_n, (n = 1, …, m₀) in the initial spectral problem (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ). illustrates that the larger the size m₀ of the reduced basis is, the better is the accuracy of the lowest excitation energy γ₁, as to be expected.

Table 1. The error |γ₁ − ω₁| vs. the size of reduced basis, m₀.

Download CSV Display Table

Remark 3.2:

Notice that the matrix $\overline{W}$ might have rather large ϵ-rank for small values of ϵ, which increases the cost of high-accuracy solutions. Numerical tests show (see ) that the ϵ-rank approximation to the matrix $\overline{W}$ with a moderate rank parameter allows for a numerical error in the excitation energies of the order of few percents. For this reason, we study another approximation strategy in which the rank approximation of the matrix $\overline{W}$ remains fixed, while the matrices V and $\tilde{W}$ are substituted by their adaptive ϵ-rank approximations (see ).

Matrix blocks in the auxiliary equation (Equation3.9(3.9) $F_{0} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) \equiv (\begin{matrix} A_{0} & B_{0} \\ - B_{0}^{*} & - A_{0}^{*} \end{matrix}) (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) = λ_{n} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) .$ (3.9) ) are obtained by rather rough ϵ-rank approximation to the initial system matrix. However, we observe much smaller approximations error γ_n − ω_n for the solution of the projected reduced basis system (Equation3.10(3.10) $M_{1} Y = γ_{n} S_{1} Y, Y \in R^{m_{0}} .$ (3.10) ) compared with that for auxiliary equation (Equation3.9(3.9) $F_{0} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) \equiv (\begin{matrix} A_{0} & B_{0} \\ - B_{0}^{*} & - A_{0}^{*} \end{matrix}) (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) = λ_{n} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) .$ (3.9) ). Numerical tests indicate that the difference γ_n−ω_n behaves merely quadratically in the rank truncation parameter ϵ.

Remark 3.3:

In the case of a symmetric matrix, the above-mentioned effect of ‘quadratic’ convergence rate can be justified by a well-known property of the quadratic error behaviour in the approximate eigenvalue, computed by the Rayleigh quotient with respect to the perturbed eigenvector (vectors of the reduced basis ψ_n in our construction), compared with the perturbation error in the eigenvector, which is of order O(ϵ). This beneficial property may explain the efficiency of the proposed reduced basis approach.

In the particular BSE formulation based on the HF MO basis, we may have a slight perturbation of the symmetry in the matrix block $\overline{W}$ , i.e. the above argument does not apply directly. However, we observe the same quadratic error decay in all numerical experiments implemented so far.

It is also worth to note that due to the symmetry features of the eigenproblem, the approximation computed by the reduced basis approach is always an upper bound of the true excitation energies obtained from the full BSE model. Again, this is a simple consequence of the variational properties of the Ritz values being upper bounds on the smaller eigenvalues for symmetric matrices. The ‘upper bound’ character is also clearly visible in the figures in Section 4.

4. Numerical tests for the reduced basis method

In this section, we present numerical illustrations of the reduced basis approach applied to the BSE problem for single molecules and finite chains of hydrogen atoms. The TEI tensor and MOs are obtained from ab initio HF calculations using tensor-structured solver [Citation16–19] implemented in MATLAB®.

Both the core Hamiltonian and two-electron (repulsion) integrals are computed by rank-structured algorithms using the discrete representation of the AO basis functions on fine n × n × n three-dimensional (3D) Cartesian grids [Citation17,Citation20]. In TEI calculations for molecules, the basis functions and convolution kernels involved are represented on fine 3D grids of size up to 131, 726, ³ which guaranties the sufficient accuracy of numerical quadratures. The TEI matrix is precomputed in the form of low-rank Cholesky factorisation by tensor-based algorithm incorporating 1D density fitting [Citation20].

4.1. Reduced basis method for the BSE system

The numerical examples below demonstrate that a small reduced basis set, obtained by separable approximation of the BSE matrix blocks with rank parameters of about several tens, allows to reveal several of the lowest excitation energies. Accuracy is controlled by the rank truncation threshold. Examples below utilise the grid representation of the Gaussian basis sets of type cc-pDVZ (see e.g. [Citation42,Citation43]).

presents the size of GTO basis set, N_b, and the number of MOs, N_o, in the numerical examples considered.

Table 2. The number of Gaussian type orbital (GTO) basis functions, N_b, and molecular orbitals, N_o.

Download CSV Display Table

shows numerics for H₂O (360 × 360), N₂H₄ (1314 × 1314) and C₂H₅OH (2860 × 2860), where the numbers in brackets specify the BSE matrix size. It demonstrates the quadratic decay of the error |γ₁ − ω₁| in the lowest excitation energy with respect to the approximation error |λ₁ − ω₁| for the modified auxiliary BSE problem (Equation3.9(3.9) $F_{0} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) \equiv (\begin{matrix} A_{0} & B_{0} \\ - B_{0}^{*} & - A_{0}^{*} \end{matrix}) (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) = λ_{n} (\begin{matrix} u_{n} \\ v_{n} \end{matrix}) .$ (3.9) ). Errors for eigenvalues are given in eV. The numerical error is controlled by a tolerance ϵ in the rank truncation procedure applied to the BSE submatrices V, $\overline{W}$ and $\tilde{W}$ . The resulting ϵ-ranks for the corresponding matrices are presented.

Table 3. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁|, and norms of the differences between the exact and reduced-rank matrices, ‖F₁ − F₀‖, vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ .

Display Table

Notice that the rank decomposition of the matrix V can be derived from the respective Cholesky factorisation of the TEI matrix B accomplished by the simple rank reduction. The rank approximation for the symmetric matrices $\overline{W}$ and $\tilde{W}$ can be calculated by pivoted Cholesky factorisation. demonstrates that the approximation error in the reduced basis, |γ₁ − ω₁|, is at least one order of magnitude smaller than that for auxiliary problem, |λ₁ − ω₁|, i.e. $| γ_{n} - ω_{n} | ≪ | λ_{n} - ω_{n} |,$ which justifies the use of the reduced basis equation (Equation3.10(3.10) $M_{1} Y = γ_{n} S_{1} Y, Y \in R^{m_{0}} .$ (3.10) ).

This effect can be also seen in for N₂H₄ molecule demonstrating the convergence γ_n → ω_n and λ_n → ω_n with respect to the increasing rank parameter determining the auxiliary problem (the size of the reduced basis set is m₀ = 30). The left and right figures correspond to the rank truncation thresholds, ϵ = 0.6 and ϵ = 0.1, respectively. The quantities λ_n, γ_n and ω_n are marked by black, blue and red colours, respectively.

Figure 2. Comparison of m₀ = 30 lower eigenvalues for the reduced and exact BSE systems vs. ϵ in the case of N₂H₄ molecule. The number in a text box indicates the error in the first eigenvalue |γ₁ − ω₁|.

Figure 2. Comparison of m0 = 30 lower eigenvalues for the reduced and exact BSE systems vs. ϵ in the case of N2H4 molecule. The number in a text box indicates the error in the first eigenvalue |γ1 − ω1|.

represents similar results for amino acid glycine, C₂H₅NO₂, with the BSE matrix size 6000 × 6000. In this case, the truncation thresholds ϵ = 0.2 leads to the rank parameters R_V = 54 , $R_{\overline{W}} = 50$ , $R_{\tilde{W}} = 50$ and the error for the minimal eigenvalue, ω₁ = 0.72 eV. For ϵ = 0.08, we have the rank parameters R_V = 100 , $R_{\overline{W}} = 215$ , $R_{\tilde{W}} = 129$ and the error for the minimal eigenvalue equals to 0.38 eV.

Figure 3. Comparison of m₀ = 30 lower eigenvalues for the reduced and exact BSE systems vs. ϵ in the case of Glycine amino acid The number in a text box indicates the error in the first eigenvalue |γ₁ − ω₁|.

Figure 3. Comparison of m0 = 30 lower eigenvalues for the reduced and exact BSE systems vs. ϵ in the case of Glycine amino acid The number in a text box indicates the error in the first eigenvalue |γ1 − ω1|.

The lowest values of the BSE excitation energy for H₂O molecule computed by solving our exact system is 8.72 eV which agrees with the value 8.7 eV for ice water presented in [Citation44]. The reduced basis method using the rank truncation threshold ϵ = 10⁻¹ provides the value 8.95 eV.

, left and right, illustrates the BSE energy spectrum of the H₂O molecule for the lowest N_red = 30 eigenvalues vs. the rank truncation parameters ϵ = 0.6 and 0.1, where the ranks of V and the BSE matrix block $\tilde{W}$ are equal to 4, 5 and 28, 30, respectively, while the block $\overline{W}$ remains unchanged. For the choice ϵ = 0.6 and ϵ = 0.1, the error in the first (lowest) eigenvalue for the solution of the problem using the reduced basis is about 0.11 and 0.025 eV, respectively.

Figure 4. Comparison of m₀ = 30 lower eigenvalues for the reduced and exact BSE systems for H₂O molecule: ϵ = 0.6, left; ϵ = 0.1, right. The error in the first eigenvalue |γ₁ − ω₁| is shown in a text box.

Figure 4. Comparison of m0 = 30 lower eigenvalues for the reduced and exact BSE systems for H2O molecule: ϵ = 0.6, left; ϵ = 0.1, right. The error in the first eigenvalue |γ1 − ω1| is shown in a text box.

Next, we present BSE calculations for chains of 16 and 32 hydrogen atoms placed in a 3D bounding box with the size 64³ bohr.³ The interatomic interval equals to 1.39 bohr. The HF calculations were performed with 64 and 128 Gaussian-type basis functions using grids of size 32, 768³ and 16, 384³, for computation of the core Hamiltonian and TEI, respectively. Cholesky factors of TEI matrix are of size 4096 × 175 and 16, 384 × 348 for the chains with 16 and 32 hydrogen atoms.

demonstrates the decay of the error in the lowest eigenvalues |γ₁ − ω₁| with respect to the tolerance ϵ in the rank approximation of the BSE matrix for the chains of hydrogen atoms, H₁₆ (896 × 896) and H₃₂ (3584 × 3584), where the numbers in brackets specify the BSE matrix size. We observe linear scaling of the corresponding ranks of V, $\overline{W}$ and $\tilde{W}$ with respect to the size of the system as expected. Since the rank of $\overline{W}$ decays slowly, we studied the case, with fixed rank( $\overline{W}$ ) = max {rank(V), rank $\tilde{W}}$ (usually it coincides with rank(V)). The improved accuracy of the resulting spectrum is achieved even for rather large ϵ = 2 × 10⁻¹.

Table 4. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁| vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ for chains of 16 and 32 hydrogen atoms.

Display Table

Table 5. The model error |μ₁ − ω₁| in TDA approximation for different molecules.

Download CSV Display Table

4.2. Reduced basis approach to the Tamm–Dancoff model

It is interesting to apply the reduced basis approach described above to the so-called TDA [Citation3], which corresponds to setting the matrix B = 0 in Equation (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ). It also allows to estimate the difference between the excitation energies from the full BSE model and those obtained by the TDA, which introduces an additional small model error.

The TDA model simplifies Equation (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) to a standard Hermitian eigenvalue problem (4.1) $A x_{n} = μ_{n} x_{n}, x_{n} \in R^{N_{o v}} A \in R^{N_{o v} \times N_{o v}}$ (4.1) with the factor-two smaller matrix size N_ov. The reduced basis approach via low-rank approximation described in Section 3.2 can be applied directly to the TDA equation.

Below we present numerical tests indicating that the approximation error introduced by the TDA compared with the initial BSE system (Equation3.3(3.3) $F (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) \equiv (\begin{matrix} A & B \\ B^{*} & A^{*} \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}) = ω_{n} (\begin{matrix} I & 0 \\ 0 & - I \end{matrix}) (\begin{matrix} x_{n} \\ y_{n} \end{matrix}),$ (3.3) ) remains on the level of 0.003 Hartree for several compact molecules (see ). This table indicates a tendency to decrease the TDA model error for larger molecules, say 0.0017 Hartree (0.045 eV) for glycine amino acid.

displays the error of TDA approximation in comparison with the full BSE system for the first m₀ = 15 lower eigenvalues on the examples of H₂O and C₂H₅OH molecules.

Figure 5. Comparison between m₀ = 15 lower eigenvalues μ_n and ω_n for the TDA and full BSE models, respectively, on the examples of H₂O and C₂H₅OH molecules.

Figure 5. Comparison between m0 = 15 lower eigenvalues μn and ωn for the TDA and full BSE models, respectively, on the examples of H2O and C2H5OH molecules.

5. Conclusions

This paper introduces and analyses the reduced basis method for computation of several lowest eigenvalues in the BSE, based on the solution of an auxiliary, simplified eigenvalue problem via diagonal plus low-rank approximations to the BSE matrix blocks. The reduced spectral problem of small size is derived via projection of the full BSE matrix onto the reduced basis set, composed of several dominant eigenvectors of the simplified problem. The ϵ-rank bounds for the requested subtensors of the TEI tensor represented in the basis set of HF MOs are proved in Lemmas 2.1 and 2.2. Asymptotic estimates on storage demands are provided. Numerical tests confirm merely quadratic error behaviour in the excitation energies with respect to the accuracy of the rank approximation. The particular construction of the BSE matrix is based on the BSE-GW approximation with non-interacting HF Green's function (we follow the construction considered in [Citation14]).

The main goal of the present paper is the development and numerical verification of the reduced basis method applied to large-scale BSE eigenvalue problems. Potential efficiency of the approach is illustrated numerically on the examples of single molecules and finite hydrogen chains. The numerical studies demonstrate that eigenvalues of the reduced spectral problem provide a sufficient approximation to the lowest excitation energies of the exact BSE system. For all the examples considered so far, the accuracy of the order of 0.15 − 0.3 eV was achievedFootnote³by the reduced basis method with rather moderate ranks as indicated in and . The behaviour of the approximation error vs. rank parameters remains similar for compact molecules and for finite chains of atoms.

Future work will be focused on application and development of efficient linear algebra algorithms for fast and accurate solution of the arising large eigenvalue problems taking into account data-sparse block structures. In particular, the complementary structural representations to the matrix $\overline{W}$ may enhance the accuracy of the reduced basis method. These issues will be considered in a forthcoming paper.

Another possible research direction is concerned with the quantised tensor train (QTT) approximation [Citation45] of the long vectors and large matrices involved, in order to perform the fast matrix–vector calculations in the QTT tensor arithmetics with ϵ-rank truncation, see [Citation46].

Acknowledgments

The authors would like to thank A. Savin (UPMC, Paris) and J. Toulouse (UPMC, Paris) for valuable comments on the problem setting for the BSE model and for providing useful references. Furthermore, we acknowledge helpful remarks by C. Yang (LBL, Berkeley) on the initial draft of the manuscript.

Disclosure statement

No potential conflict of interest was reported by the authors.

Notes

* Dedicated to Prof. Andreas Savin on occasion of his 65th birthday.

1. In [Citation14], it was demonstrated that in the case of small interatomic distance this model remains in a good agreement with the reference data for full configuration–interaction calculations though it does not describe the dissociation case. The BSE model was shown to provide satisfactory results in the latter case when using the exact Green's function.

2. Though one can notice that the static-screened interaction matrix, responsible for the exchange interaction of electrons, does not allow the high-accuracy low-rank approximation, as it is already known about the HF exchange.

3. For example, paper [Citation11] assumes the BSE model errors in the range of 0.1–0.3 eV as the acceptable precision.

References

E. Runge and E. Gross, Phys. Rev. Lett. 52 (12), 997–1000 (1984).
Web of Science ®Google Scholar
E. Gross and W. Kohn, Adv. Quantum Chem. 21 (255), 287–323 (1990).
Google Scholar
M.E. Casida, in Recent Advances in Density Functional Methods, Part I, edited by D.P. Chong (World Scientific, Singapore, 1995), Vol. 155, pp. 1207–1216.
Google Scholar
R.E. Stratmann, G.E. Scuseria, and M.J. Frisch, J. Chem. Phys. 109, 8218 (1998).
Web of Science ®Google Scholar
C. Cramer and D. Truhlar, Phys. Chem. Chem. Phys. 11 (46), 10757–10816 (2009).
PubMed Web of Science ®Google Scholar
L. Reining, V. Olevano, A. Rubio, and G. Onida, Phys. Rev. Lett. 88, 066404 (2002).
PubMed Web of Science ®Google Scholar
E.E. Salpeter and H.A. Bethe, Phys. Rev. 82 (2), 309–310 (1951).
Google Scholar
L. Hedin, Phys. Rev. 139, A796 (1965).
Web of Science ®Google Scholar
G. Onida, L. Reining, and A. Rubio, Rev. Mod. Phys. 74, 601 (2002).
Web of Science ®Google Scholar
W.G. Schmidt, S. Glutsch, P.H. Hahn, and F. Bechstedt, Phys. Rev. B 67, 085307 (2003).
Web of Science ®Google Scholar
S. Körbel, P. Boulanger, I. Duchemin, X. Blase, M. AL Marques, and S. Botti, J. Chem. Theory Comput. 10 (9), 3934–3943 (2014).
PubMed Web of Science ®Google Scholar
E. Rebolini, J. Toulouse, A.M. Teale, T. Helgaker, and A. Savin, Phys. Rev. A 91, 032519 (2015).
Web of Science ®Google Scholar
J. Deslippe, G. Samsonidze, D.A. Strubbe, M. Jain, M.L. Cohen, and S. Louie, Comp. Phys. Commun. 183, 1269 (2012).
Web of Science ®Google Scholar
E. Rebolini, J. Toulouse, and A. Savin, in Concepts and Methods in Modern Theoretical Chemistry, edited by S. Ghosh and P. Chattaraj, Vol 1, Electronic Structure and Reactivity, (CPC Press, Boca Raton, 2013), p. 367.
Google Scholar
E. Rebolini, J. Toulouse, and A. Savin, Mol. Phys. 111, 1219 (2013).
Web of Science ®Google Scholar
V. Khoromskaia, Comp. Methods Appl. Math. 14, 89 (2014).
Web of Science ®Google Scholar
V. Khoromskaia and B.N. Khoromskij, Comp. Phys. Commun. 185 (1), 2–10 (2014).
Web of Science ®Google Scholar
B. N. Khoromskij, Chemometr. Intell. Lab. Syst. 110 (1), 1–19 (2012).
Web of Science ®Google Scholar
V. Khoromskaia and B.N. Khoromskij, Phys. Chem. Chem. Phys. 17, 31491 (2015).
PubMed Web of Science ®Google Scholar
V. Khoromskaia, B.N. Khoromskij, and R. Schneider, SIAM J. Sci. Comput. 35 (2), A987–A1010 (2013).
Web of Science ®Google Scholar
P. Benner and H. Faßbender, Linear Algebra Appl. 263, 75 (1997).
Web of Science ®Google Scholar
P. Benner, V. Mehrmann, and H. Xu, Numer. Math. 78 (3), 329–358 (1998).
Web of Science ®Google Scholar
D. Kressner, Numerical Methods for General and Structured Eigenvalue Problems, Vol. 46, Lecture Notes in Computational Science and Engineering (Springer, Berlin/Heidelberg, 2005).
Google Scholar
H. Faßbender and D. Kressner, GAMM Mitteilungen 29 (2), 297–318 (2006).
Google Scholar
M. Shao, F.H. da Jornada, C. Yang, J. Deslippe, and S. Louie, Linear Algebra Appl. 488, 148 (2016).
Web of Science ®Google Scholar
A. Bunse-Gerstner and H. Faßbender, in Numerical Algebra, Matrix Theory, Differential-Algebraic Equations and Control Theory, edited by P. Benner, M. Bollhöfer, D. Kressner, C. Mehl, and T. Stykel (Springer International Publishing, Heidelberg, 2015), pp. 3–23.
Google Scholar
Z. Bai and R.-C. Li, SIAM J. Matrix Anal. Appl. 33 (4), 10751100 (2012).
Web of Science ®Google Scholar
Z. Bai and R.-C. Li, SIAM J. Matrix Anal. Appl. 34 (2), 392–416 (2013).
Web of Science ®Google Scholar
A. Bunse-Gerstner, R. Byers, and V. Mehrmann, SIAM J. Matrix Anal. Appl. 13 (2), 419–453 (1992).
Web of Science ®Google Scholar
D. Mackey, N. Mackey, and F. Tisseur, Electron. J. Linear Algebra 10, 106 (2003).
Google Scholar
D. Mackey, N. Mackey, C. Mehl, and V. Mehrmann, SIAM J. Matrix Anal. Appl. 28 (4), 1029–1051 (2006).
Web of Science ®Google Scholar
C. Mehl, SIAM J. Matrix Anal. Appl. 30, 291 (2008).
Web of Science ®Google Scholar
P. Benner, H. Faßbender, and C. Yang, Some remarks on the complex J-symmetric eigenproblem. Max Planck Institute Magdeburg, Preprint MPIMD/15-12 (July 2015).
Google Scholar
E. Napoli, E. Polizzi, and Y.Y. Saad, Efficient estimation of eigenvalue counts in an interval. arXiv:1308.4275v2, 2014.
Google Scholar
L. Lin, Y. Saad, and C. Yang, Approximating spectral densities of large matrices. arXiv:1308.5467.v2., 2015.
Google Scholar
P. Benner, V. Mehrmann, and D. Sorensen, editors, Dimension Reduction of Large-Scale Systems, Vol. 45, Lecture Notes in Computational Science and Engineering (Springer, Berlin, 2005).
Google Scholar
S. Reine, T. Helgaker, and R. Lindh, WIREs Comput. Mol. Sci. 2 (2), 290–303 (2012).
Web of Science ®Google Scholar
T. Helgaker, P. Jørgensen, and J. Olsen, Molecular Electronic-Structure Theory (Wiley, Chichester, 1999).
Google Scholar
N. Beebe and J. Linderberg, Int. J. Quantum Chem. 12 (4), 683–705 (1977).
Web of Science ®Google Scholar
S. Wilson, Comput. Phys. Commun. 58, 71 (1990).
Web of Science ®Google Scholar
N. Higham, in Reliable Numerical Computations, edited by M.G. Cox and S.J. Hammarling (Oxford University Press, Oxford, 1990), pp. 161–185.
Google Scholar
T.H.J. Dunning, J. Chem. Phys. 90, 1007 (1989).
Web of Science ®Google Scholar
H.-J. Werner, P.J. Knowles, G. Knizia, F.R. Manby, and M. Schütz, Wiley Interdiscip. Rev. Comput. Mol. Sci. 2 (2), 242–253 (2011).
Web of Science ®Google Scholar
A. Hermann, W.G. Schmidt, and P. Schwerdfeger, Phys. Rev. Lett. 100, 207403 (2008).
PubMed Web of Science ®Google Scholar
B.N. Khoromskij, J. Constr. Approx. 34 (2), 257–289 (2011).
Web of Science ®Google Scholar
S. Dolgov, B. Khoromskij, D. Savostyanov, and I. Oseledets, Comp. Phys. Commun. 185 (4), 1207–1216 (2014).
Web of Science ®Google Scholar

A reduced basis approach for calculation of the Bethe–Salpeter excitation energies by using low-rank tensor factorisationsFootnote^*
* Dedicated to Prof. Andreas Savin on occasion of his 65th birthday.

ABSTRACT

GRAPHICAL ABSTRACT

1. Introduction

2. Low-rank approximation of the two-electron integrals in Hartree–Fock calculus

2.1. Cholesky decomposition of the TEI matrix

2.2. Rank bounds for the TEI matrix V

3. Tensor factorisation of the BSE matrix blocks

3.1. Tensor representations using TEI matrix in MTO basis

3.2. The reduced basis approach using low-rank approximations

Table 1. The error |γ₁ − ω₁| vs. the size of reduced basis, m₀.

4. Numerical tests for the reduced basis method

4.1. Reduced basis method for the BSE system

Table 2. The number of Gaussian type orbital (GTO) basis functions, N_b, and molecular orbitals, N_o.

Table 3. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁|, and norms of the differences between the exact and reduced-rank matrices, ‖F₁ − F₀‖, vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ .

Table 4. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁| vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ for chains of 16 and 32 hydrogen atoms.

Table 5. The model error |μ₁ − ω₁| in TDA approximation for different molecules.

4.2. Reduced basis approach to the Tamm–Dancoff model

5. Conclusions

Acknowledgments

Disclosure statement

References

Information for

Open access

Opportunities

Help and information

A reduced basis approach for calculation of the Bethe–Salpeter excitation energies by using low-rank tensor factorisationsFootnote** Dedicated to Prof. Andreas Savin on occasion of his 65th birthday.

ABSTRACT

GRAPHICAL ABSTRACT

1. Introduction

2. Low-rank approximation of the two-electron integrals in Hartree–Fock calculus

2.1. Cholesky decomposition of the TEI matrix

2.2. Rank bounds for the TEI matrix V

3. Tensor factorisation of the BSE matrix blocks

3.1. Tensor representations using TEI matrix in MTO basis

3.2. The reduced basis approach using low-rank approximations

Table 1. The error |γ1 − ω1| vs. the size of reduced basis, m0.

4. Numerical tests for the reduced basis method

4.1. Reduced basis method for the BSE system

Table 2. The number of Gaussian type orbital (GTO) basis functions, Nb, and molecular orbitals, No.

Table 3. Accuracy (in eV) for the first eigenvalue, |γ1 − ω1|, and norms of the differences between the exact and reduced-rank matrices, ‖F1 − F0‖, vs. ϵ-rank for V, W‾ and W˜.

Table 4. Accuracy (in eV) for the first eigenvalue, |γ1 − ω1| vs. ϵ-rank for V, W‾ and W˜ for chains of 16 and 32 hydrogen atoms.

Table 5. The model error |μ1 − ω1| in TDA approximation for different molecules.

4.2. Reduced basis approach to the Tamm–Dancoff model

5. Conclusions

Acknowledgments

Disclosure statement

Notes

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information

Keep up to date

A reduced basis approach for calculation of the Bethe–Salpeter excitation energies by using low-rank tensor factorisationsFootnote^*
* Dedicated to Prof. Andreas Savin on occasion of his 65th birthday.

Table 1. The error |γ₁ − ω₁| vs. the size of reduced basis, m₀.

Table 2. The number of Gaussian type orbital (GTO) basis functions, N_b, and molecular orbitals, N_o.

Table 3. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁|, and norms of the differences between the exact and reduced-rank matrices, ‖F₁ − F₀‖, vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ .

Table 4. Accuracy (in eV) for the first eigenvalue, |γ₁ − ω₁| vs. ϵ-rank for V, $\overline{W}$ and $\tilde{W}$ for chains of 16 and 32 hydrogen atoms.

Table 5. The model error |μ₁ − ω₁| in TDA approximation for different molecules.