Publication Cover
Molecular Physics
An International Journal at the Interface Between Chemistry and Physics
Volume 121, 2023 - Issue 7-8: Special Issue of Molecular Physics in Memory of Nick Besley
846
Views
2
CrossRef citations to date
0
Altmetric
Memorial Issue for Nick Besley

Electronic circular dichroism of proteins computed using a diabatisation scheme

ORCID Icon, ORCID Icon & ORCID Icon
Article: e2133748 | Received 29 Jul 2022, Accepted 30 Sep 2022, Published online: 27 Oct 2022

ABSTRACT

Circular dichroism (CD) spectroscopy is a powerful technique employed to study the structure of biomolecules. More accurate calculation of CD from first principles will aid both computational and experimental studies of protein structure and dynamics. We apply a diabatisation scheme to improve the description of nearest neighbour interactions between two electronic transitions (nπ* and πnbπ*) localised on each individual peptide bond (amide group) in a protein. These interactions are incorporated into DichroCalc, an exciton-based computational method to calculate CD, and yield improvements over the standard DichroCalc parameter set, particularly for calculation of CD for important secondary structural elements such as an α helix.

GRAPHICAL ABSTRACT

Introduction

Calculations from first principles of the electronic circular dichroism (CD) spectroscopy of proteins are challenging. It is a topic that has attracted the interest of many researchers, including the late Nick Besley [Citation1], to whose memory this Special Issue is dedicated. In this paper, we present a new theoretical development, but we begin by surveying some of the contributions of Besley to the field, which underpin the current study. The CD spectra of proteins in the far-ultraviolet (far-UV) have distinct signatures, reflecting the protein secondary structure, which arise from the regular, repeating arrays of backbone amide chromophores. Thus, CD is a widely used biophysical technique for characterising proteins in solution [Citation2]. Often, statistical techniques are used to estimate the secondary structure content from the experimentally determined CD spectrum [Citation3,Citation4]. However, our focus is on the reverse, namely, computing the CD spectrum from the atomic coordinates of the protein.

CD spectra can be computed from protein structures in various ways, e.g. see [Citation5–10], some of which have been reviewed elsewhere [Citation11]. We focus here on the exciton framework, whereby an effective Hamiltonian is constructed through a consideration of localised chromophores within the protein [Citation12,Citation13]. The peptide bond is the most important contributor to the far-UV CD spectra of proteins. A simple effective Hamiltonian would consider the nπ* and πnbπ* electronic transitions of the amide chromophore, which occur at about 222 and 190 nm, respectively. The πnb, n and π* molecular orbitals for an example diamide are depicted in Figure .

Figure 1. Molecular orbitals for diamide 2a [φ = 180°, ψ = 180°] with an isosurface cut-off of 0.05. Orbitals from a state-averaged CASSCF calculation as described in reference 32. (a) π* for amide 1, (b) n for amide 1, (c) πnb for amide 1, (d) π* for amide 2, (e) n for amide 2 and (f) πnb for amide 2.

Figure 1. Molecular orbitals for diamide 2a [φ = 180°, ψ = 180°] with an isosurface cut-off of 0.05. Orbitals from a state-averaged CASSCF calculation as described in reference 32. (a) π* for amide 1, (b) n for amide 1, (c) πnb for amide 1, (d) π* for amide 2, (e) n for amide 2 and (f) πnb for amide 2.

A desire to improve the quality of protein CD calculations that use the exciton framework motivated several studies of the electronic structure of small amides using ab initio multi-configurational methods, including complete active space self-consistent field (CASSCF) calculations on formamide [Citation14] and N-methylacetamide [Citation15] in the gas phase. Analogous calculations with various models of solvent showed subtle, but important, changes to the electronic transitions, including, for example, differences in the orientation of the πnbπ* electric transition dipole moment compared to the gas phase [Citation16,Citation17]. This factor was influential in leading to an improvement in the accuracy of calculations of protein CD spectra based on parameterisation of the ab initio descriptions of the amide chromophore [Citation18–20] compared to spectra calculated using parameters from semi-empirical calculations [Citation21]. The computational methodology for protein CD calculations and the chromophore parameters are available via a web-interface [Citation22], which allows the non-expert simply to upload a Protein Data Bank file with the coordinates of a protein structure and to compute the CD spectrum. The theoretical background and related developments have been summarised elsewhere [Citation11,Citation23] and the approach has been used, sometimes in conjunction with molecular dynamics simulations, to study a diverse range of systems, e.g. glycosylated antimicrobial peptides [Citation24], SARS-CoV-2 proteins [Citation25] and chymotrypsin adsorbed on silica [Citation26].

The above discussion has focused on two localised electronic transitions, the nπ* and πnbπ* excitations, on an individual amide chromophore. Whilst the computational approach based on this approximation has been successful, additional insights have been gained through the study of cyclic [Citation27–29] and linear diamides [Citation30–32]. Electronic excitations with higher transition energies may be important [Citation33] and in this region charge transfer transitions between orbitals on neighbouring peptide groups are also observed. Quantum chemical calculations at the CASSCF level on a linear diamide enabled the generation of parameters describing the charge transfer transitions [Citation31]. Protein CD calculations incorporating these transitions gave an encouraging level of agreement with synchrotron radiation CD spectra [Citation34] measured down to 170 nm. The CASSCF calculations on the diamide provide an additional opportunity, which is the subject of our study, namely, to consider the interaction between two covalently bound amide chromophores at the quantum chemical level using a procedure known as diabatisation.

The effective Hamiltonian matrix in the exciton framework comprises diagonal elements corresponding to the energies of monomer (diabatic) states and off-diagonal elements. The latter are inter-state interactions or couplings. Whilst the isolated monomer, i.e. N-methylacetamide, appears to be a good choice of diabatic state with the couplings computed using a Coulombic model, an alternative approach would be to apply a diabatisation scheme to a full ab initio calculation on a diamide model, whereby the eigenstates of the diamide are transformed to a diabatic basis. This method is capable of treating multiple excited states and also taking into account both short- and long-range effects of the excitonic couplings. Thus, an orthogonal transformation is used to connect the adiabatic excited states with a set of diabatic excited states. There is no single ‘ideal’ set of diabatic states [Citation35], but several strategies can be adopted to define the adiabatic-to-diabatic transformation. The fragment excitation difference scheme uses the so-called excitation density [Citation36]. A variant of this and some extensions are available in the EXAT software [Citation37]. An alternative strategy derives the couplings from the energy splitting between the excited states and the extent of exciton delocalisation between the fragments [Citation38]. Other approaches have maximised the overlap with a reference wavefunction [Citation39] or the similarity with the dipole moment [Citation40]. Diabatisation approaches have been applied to aggregates of anthracene and tetracene [Citation41], to aggregates of propeller-shaped emitters [Citation42] using the fromage software [Citation43], dimers of ethylene and derivatives [Citation44] and to various nucleic acid systems [Citation45,Citation46]. Herein, we apply a diabatisation scheme to compute the CD of some polypeptide structures corresponding to conformations of particular significance, including α-helices, β-strands, polyproline II helices and 310-helices.

Proteins rich in β-strands can be separated into two distinct classes: β-I and β-II [Citation47]. This classification can be made based on their CD spectra [Citation47,Citation48]. β-I proteins, e.g. concanavalin A, contain regular β strands and give rise to CD spectra with a positive band at 195 nm and a negative band at 215–220 nm. β-II proteins, e.g. elastase, yield CD spectra reminiscent of unordered polypeptides with a negative band around 198 nm [Citation49,Citation50]. It is thought that β-II proteins may adopt more polyproline II conformation [Citation50], in addition to the possibility that they are more conformationally labile. Accurate computation of the CD spectrum of the polyproline II conformation would, thus, help provide useful insight into the spectra of β-II proteins. Another conformation of interest is the 310 helix, which, despite some structural similarity to the α helix, has a distinct CD spectrum [Citation51].

Methods

Diabatisation

A diabatisation procedure was performed to generate more accurate inter-amide couplings between two electronic transitions (nπ* and πnbπ*) localised on each of the two amide monomers when in a dimeric system (a diamide). The procedure follows that described by Aragó and Troisi [Citation41], which is based on orthogonal transformation of the adiabatic states (eigenstates of the total Hamiltonian) to diabatic (noninteracting) states for a two-level dimer system and generalisable to systems with an arbitrary number of interacting states (four in this study). For two monomers (amides) labelled 1 and 2 that combine, via a peptide bond, as a dimer (a diamide), consider four singlet excited states on the dimer and two singlet excited states on each of the monomers. The four adiabatic excited states on the dimer are the n1π1* and πnb1π1* localised on amide 1 in the dimer and the n2π2* and πnb2π2* localised on amide 2 in the dimer. The two excited states on each of the two isolated monomers are nπ* and πnbπ*. The diabatic Hamiltonian HD is found by a transformation of the adiabatic Hamiltonian HA, thus: (1) HD=CHAC(1) The unitary transformation matrix C relates the diabatic states (on the dimer) to the adiabatic states (on the dimer). It is desirable that the diabatic wave functions be as close as possible to the initial noninteracting wave functions. The unitary matrix C is computed by minimising the difference between the adiabatic transition dipole moments (TDMs) on the dimer and the isolated monomer TDMs in the coordinate frame of the dimer [Citation41]. Computation of matrix C proceeds as follows [Citation41].

The matrix M is formed from a product of matrices representing the adiabatic TDMs on the dimer, µA, and the isolated TDMs on each of the monomers, µISO, (2) M=(μA)μISO(2) Singular valued decomposition is carried out on matrix M to find the matrices U and V*. These are used to obtain C*,best, (3) C,best = UV(3) where C*,best is the transpose of Cbest which is defined as the best unitary transformation satisfying the following equation, (4) C,best=argminR||μARμISO||(4) where |||| denotes the Frobenius norm.

The adiabatic Hamiltonian is diagonal, where the diagonal elements are the transition energies of the excited states in the dimeric system. The output of the diabatisation is the diabatic Hamiltonian with transition energies on the diagonal and the coupling between excited states in the diabatic representation in the off-diagonal matrix elements. The amide is the monomer and the diamide is the dimer. The monomer and the dimer state properties (TDMs) and adiabatic Hamiltonian are listed as matrices in Table S1. The input properties are from DichroCalc parameter sets for the amide N-methylacetamide (set NMA4FIT2) [Citation17] and the diamide N-acetylglycine N-methylamide (13 sets for various φ and ψ dihedral angles of the diamide) [Citation31,Citation32]. The TDMs for the two monomers (from the above parameter set) are transformed onto the coordinate frame of the dimer prior to diabatisation. The 13 pairs of φ and ψ angles and the corresponding parameter sets for the diamide are listed in Table . The geometries were chosen to span the Ramanchandran plot. The diabatic Hamiltonian output from the diabatisation contains off-diagonal matrix elements that are the couplings between the states on each monomer in the system of the dimer, and it is these quantities that are used to modify the computation of CD (discussed below). The diabatic Hamiltonian matrices are listed in Table S1.

Table 1. The 13 diamides and their secondary structure and associated DichroCalc parameter set.

Model peptides

13 model idealised peptide structures, each comprising 20 L-alanine residues, with different repeating φ and ψ dihedral angles to describe differing proteinaceous secondary structure elements were constructed using the molecular editor Avogadro [Citation52,Citation53]. The 13 model peptides were considered based on the availability of DichroCalc parameter sets that describe intra-amide and inter-amide (charge-transfer) electronic transitions in a diamide (N-acetylglycine N-methylamide) of varying φ and ψ angles [Citation31,Citation32], for which the diabatisation procedure was carried out. The diamides are listed in Table . The naming of diamides 2l and 2m follows the labelling sequence of Oakley et al. [Citation32], who studied diamides 2a through to 2k (Table ). The three α-helical diamide structures 2j, 2k and 2l are, respectively, the Corey-Pauling-Branson helix, an idealised helix and the Barlow-Thornton helix [Citation54]. Figure shows a Ramachandran plot of the φ and ψ dihedral angles for the 13 structures. For clarity, we adopt a notation for each diamide and peptide with the φ and ψ dihedral angles in square brackets, e.g. for 2d the notation is 2d [−135°, 135°].

Figure 2. Ramachandran plot of the φ and ψ dihedral angles for the 13 diamide and 13 20 L-Ala peptide structures. The scatter points are labelled by the alphabetical (second) character naming each structure (see Table ). Inset is the structure of peptide 2k [φ = −60°, ψ = −60°].

Figure 2. Ramachandran plot of the φ and ψ dihedral angles for the 13 diamide and 13 20 L-Ala peptide structures. The scatter points are labelled by the alphabetical (second) character naming each structure (see Table 1). Inset is the structure of peptide 2k [φ = −60°, ψ = −60°].

CD calculations

DichroCalc [Citation22] was used to compute the CD spectra for the 13 model peptides. A modified version of the DichroCalc code was used to read a Hamiltonian matrix where, for each peptide, the nearest neighbour inter-amide interactions from the diabatisation procedure replaced the default off-diagonal matrix elements generated by DichroCalc. The theoretical approach utilised by DichroCalc, as well as alternative approaches to the computation of CD for biomolecules, has been described in detail in a recent review [Citation11]. Briefly, the DichroCalc software implements the exciton framework, with the transition electron densities associated with excitations of biological chromophores represented by a set of point charges (monopoles) that reproduce the electrostatic potential arising from the electron density. The parameters (transition energies, electric and magnetic transition dipole moments, electric permanent dipole moments and monopole charge sets) for the amide backbone are derived from ab initio calculations (parameter set NMA4FIT2) [Citation17] and describe the intra-amide nπ* transition (at 220 nm) and the πnbπ* transition (at 193 nm). To obtain a CD spectrum, a rotational strength line spectrum, computed by DichroCalc, is convoluted with Gaussian functions of full-width half-maximum (FWHM) of typically, e.g. 9.0 or 12.5 nm. Throughout the rest of the paper, results using the un-modified DichroCalc Hamiltonian are termed ‘ab initio’ and the results obtained from modifying the off-diagonal elements of the DichroCalc Hamiltonian are termed ‘diabatic’.

Results

Figure displays computed CD spectra (using Gaussian functions of FWHM of 9.0 nm) for the six most physically relevant of the 13 model peptide geometries 2d [−135°, 135°], 2i [−74°, −4°], 2j [−48°, −57°], 2k [−60°, −60°], 2l [−62°, −41°] and 2m [−75°, 145°]. Figure S1 displays rotational strength line spectra and CD spectra for all 13 peptides. For seven of the 13 model peptides, there is a marked difference for the rotational strength line spectra and for the CD spectra when the diabatic inter-amide interactions are incorporated in the Hamiltonian. The seven peptides are: 2a [180°, 180°], 2b [−120°, 180°], 2c [−60°, 180°], 2d [−135°, 135°], 2f [−120°, 60°], 2i [−74°, −4°] and 2m [−75°, 145°]. For the other six peptides, more subtle differences are found between the spectra. For the three α-helical peptides (2j [−48°, −57°], 2k [−60°, −60°] and 2l [−62°, −41°]) the diabatic CD spectra feature the double minimum (characteristic of an α-helical spectrum) in the 200–230 nm region; and the band maxima and minima are in similar regions of the spectrum for peptides 2e [−120°, 120°], 2g [−120°, 0°] and 2h [−60°, 0°] but with slightly differing band shape characteristics. Reproduction of the double minimum between 200 and 230 nm represents a qualitative improvement over the ab initio calculated spectra.

Figure 3. CD spectra for the model peptides comprising 20 L-Ala residues and named as the diamides. CD spectra with band shapes from convolution with Gaussian functions of FWHM of 9.0 nm. Spectra from DichroCalc ab initio Hamiltonian (black line) and diabatic Hamiltonian (blue line). (a) 2d [−135°, 135°], (b) 2i [−74°, −4°], (c) 2j [−48°, −57°], (d) 2k [−60°, −60°], (e) 2l [−62°, −41°] and (f) 2m [−75°, 145°].

Figure 3. CD spectra for the model peptides comprising 20 L-Ala residues and named as the diamides. CD spectra with band shapes from convolution with Gaussian functions of FWHM of 9.0 nm. Spectra from DichroCalc ab initio Hamiltonian (black line) and diabatic Hamiltonian (blue line). (a) 2d [−135°, 135°], (b) 2i [−74°, −4°], (c) 2j [−48°, −57°], (d) 2k [−60°, −60°], (e) 2l [−62°, −41°] and (f) 2m [−75°, 145°].

Table S1 displays, for each of the 13 diamides, the three input matrices for the diabatisation procedure: the monomer TDMs, the dimer TDMs and the adiabatic Hamiltonian of the dimer. Table S1 also shows the outputted diabatic Hamiltonian, where the off-diagonal matrix elements 1–2 and 3–4 are intra-amide coupling interactions (for n1π1* with πnb1π1* and n2π2* with πnb2π2* transitions) and where matrix elements 1–3, 1–4, 2–3 and 2–4 are inter-amide coupling interactions (for n1π1* with n2π2*, n1π1* with πnb2π2*, πnb1π1* with n2π2* and πnb1π1* with πnb2π2* transitions, respectively).

The ab initio and the diabatic inter-amide interactions (off-diagonal elements of the Hamiltonian matrix) are shown in Table  for six of the 13 diamide geometries (2d [−135°, 135°], 2i [−74°, −4°], 2j [−48°, −57°], 2k [−60°, −60°], 2l [−62°, −41°] and 2m [−75°, 145°]). Table S2 lists the couplings for all 13 diamides. For the diamide in the β strand arrangement (2d [−135°, 135°]), there are significant differences in the three matrix elements for the 1–4, 2–3 and 2–4 interactions (H14, H23 and H24). The sign is opposite for 2–3 (positive for the ab initio and negative for the diabatic) and the order of magnitude is different for 1–4 (one order larger for the diabatic) and 2–4 (one order larger for the ab initio). The diamide 2i [−74°, −4°] (310 helix) has differences in all the four off-diagonal matrix elements, with H24nb1π1* with πnb2π2*) having the largest difference between the ab initio and diabatic Hamiltonians.

Table 2. Inter-amide interactions. Off-diagonal elements of the Hamiltonian (cm−1). DichroCalc ab initio matrix elements in the first row and diabatic matrix elements in the second row for each diamide geometry. The ab initio parameters have nπ* and πnbπ*transition energies at 220 nm (45455 cm−1) and 193 nm (51813 cm−1), respectively.

For the three α-helical diamides (2j [−48°, −57°], 2k [−60°, −60°] and 2l [−62°, −41°]), the largest difference between the ab initio and the diabatic matrix elements are for the n1π1* with πnb2π2* interaction (H14), where the diabatic interactions are one order of magnitude greater than the ab initio for diamides 2j [−48°, −57°] and 2l [−62°, −41°] and two orders of magnitude for diamide 2k [−60°, −60°], with all these matrix elements having the same sign (negative). This is clearly an important factor leading to the distinctive double minimum in the α-helical spectra. The diamide representing a polyproline II helical conformation (diamide 2m [−75°, 145°]), has large differences between ab initio and diabatic Hamiltonians for matrix elements H14, H23 and H24, with the largest differences between H14 and H24.

The Coulombic coupling of two electronic transitions is related to the dot product of the transition dipole moments located on monomers A and B [Citation55], (5) Vdd=μAμBRAB33(μARAB)(μBRAB)RAB5(5) Vdd is the coupling between the two dipoles µA and µB, and RAB is the distance between the two monomers A and B. If the diabatic coupling is different from the ab initio coupling and the dot product of the transition dipole moments is similar in magnitude, then the coupling may have a short-range non-Coulombic interaction that is not described by the interaction of two dipole moments (or two sets of monopole charges) from a monomeric calculation. The diabatic coupling is from diabatisation of the adiabatic states on a complete dimer, including all the electrons in the system, using a basis set (for the diabatic states) of the monomer transition dipole moments. For cases where the ab initio and diabatic coupling differs and the magnitude of the dot product of the transition dipole moments differ, the diabatic states may contain a more accurate description of Coulombic interactions between the two transitions, due to the more accurate wave function of the adiabatic states, i.e. for the complete dimer.

Table  displays the dot products of the un-normalised transition dipole moments for the monomer (in the arrangement of the dimer) and the dimer for the subset of six diamides (2d [−135°, 135°], 2i [−74°, −4°], 2j [−48°, −57°], 2k [−60°, −60°], 2l [−62°, −41°] and 2m [−75°, 145°]). Data for all 13 diamides are in Table S5.

Table 3. Dot products of the un-normalised transition dipole moments (D2) for six of the 13 diamide structures (Table S1). Upper triangle of Hamiltonian Hij. Intra-amide interactions are 1–2 and 3–4. Inter-amide interactions are 1–3, 1–4, 2–3 and 2–4. Transition dipole moments are not normalised.

Discussion

α helices

The experimental spectrum for an α helix possesses an intense positive band peak at 190 nm and a double minimum band with the negative band peaks at 208 and 220 nm. The CD spectra computed for the α-helical peptides (2j [−48°, −57°], 2k [−60°, −60°] and 2l [−62°, −41°]) using the ab initio Hamiltonians and the diabatic Hamiltonians all feature a positive band peak around 190 nm (Figure ), with very little difference between the ab initio and diabatic spectral bands in this region. Differences in the computed spectra are apparent at wavelengths greater than 200 nm where the double minimum is expected. For peptide 2j [−48°, −57°] the negative peak at 205 nm in the ab initio spectrum is slightly less intense and red shifted to 206 nm in the diabatic spectrum, and the negative band shoulder around 220 nm in the ab initio spectrum is both more pronounced and intense in the diabatic spectrum. At wavelengths greater than 200 nm for the peptide 2k [−60°, −60°], the ab initio spectrum has a negative band peak at 208 nm and a shoulder around 220 nm. The diabatic spectrum for this peptide displays the characteristic double minimum with negative peaks at 207 and 222 nm, with the latter peak being slightly more intense than the former. For peptide 2l [−62°, −41°] and in the ab initio spectrum, the negative band is at 205 nm and the band shoulder is around 220 nm, whereas the diabatic spectrum features a double minimum with negative peaks at 205 and 224 nm, with the former peak being more intense than the latter peak (vice versa to peptide 2k [−60°, −60°]).

For the discussion we focus on the third α-helical dimer 2l [−62°, −41°], the Barlow-Thornton helix whose geometry is representative of structures in the Protein Data Bank. For dimer 2l [−62°, −41°] the H13, H14, H23 and H24 couplings are –17, −37, −52 and –292 cm−1 for the ab initio and −359, −660, −131 and –363 cm−1 for the diabatic calculation. The dot product of the un-normalised TDMs for these couplings are, respectively, −0.003, −0.547, 0.629 and 0.984 D2 for the monomer and −0.035, −0.116, 0.618 and −3.997 D2 for the dimer. Using the rationale outlined in the Methods section, H13, H14, H23 and H24 may have non-Coulombic interactions in the diabatic representation, with H14 having the largest non-Coulombic interaction, and a significant Coulombic interaction, in the diabatic representation. Coupling 2–4 may also have a significant Coulombic interaction in the diabatic representation. These coupling and dot product values are close to those for dimer 2k [−60°, −60°], reflecting the closeness of the secondary structure of dimers 2k [−60°, −60°] and 2l [−62°, −41°] (both have a φ angle of –60°), except for couplings 1–3 and 1–4. For dimer 2l [−62°, −41°] the extent of non-Coulombic interaction in the H14 coupling may be less than for dimer 2k [−60°, −60°], with the H13 coupling in dimer 2l [−62°, −41°] having more non-Coulombic interaction than in dimer 2k [−60°, −60°].

β strand

The experimental CD spectrum for a β strand features a positive band at around 195 nm and a negative band at around 215–220 nm. The 2d [−135°, 135°] peptide has a computed CD spectrum using the diabatic inter-amide interactions that does not possess this feature in the 215–220 nm region, whereas the CD spectrum using the ab initio inter-amide interactions displays a degree of the correct character in this region (Figure ) as well as a positive band at 186 nm.

Due to the arbitrariness of the phase of the wave functions, the signs of the excitonic coupling from the diabatisation are not well defined. A bimodal distribution (plus or minus) for the excitonic coupling is often observed (see, e.g. Figure 14 from Jiang et al. [Citation56]). Thus, we usually select the signs of the diabatic coupling such that the computed diabatic CD spectra best agree with experimental and/or CD spectra computed using DichroCalc. To examine the differences between the two predicted spectra, the sign and/or magnitude of the three coupling interactions discussed above (1–4, 2–3 and 2–4; respectively, n1π1* with πnb2π2*, πnb1π1* with n2π2* and πnb1π1* with πnb2π2*) in the diabatic Hamiltonian were varied so that the negative band at 215–220 nm could be observed in the computed CD spectra (Figure ). Table  lists the ab initio, diabatic and the matrix elements of the diabatic Hamiltonian that were modified to compute the spectra for peptide 2d [−135°, 135°] shown in Figure . Modifying H14 only gives a less intense CD spectrum (compared to the diabatic), centred around 206 nm and where the negative band at 193 nm and the positive band at 221 nm are present (Figure , red line). Modifying H23 yields a spectrum similar to that computed from the modification of H14 but less intense with the negative band slightly red shifted to 194 nm (Figure , green line). Modifying H24 only gives a spectrum that is like that of the diabatic spectrum but where the negative band is blue shifted (from 192 to 187 nm), and the positive band (at 221 nm) is less intense (Figure , magenta line). A positive band at 191 nm and a negative band at 221 nm are computed when both H14 and H23 are modified (Figure , yellow line). When, in addition, H24 is modified the computed spectrum resembles that of the spectrum computed using the ab initio Hamiltonian (Figure , cyan and black lines).

Figure 4. Rotational strength line spectra (upper panel) and CD spectra with band shapes from convolution with Gaussian functions of FWHM of 9.0 nm (lower panel) for peptide 2d [φ = −135°, ψ = 135°]. Spectra computed using the ab initio (black line), diabatic (blue), diabatic with H14 modified (red), diabatic with H23 modified (green), diabatic with H24 modified (magenta), diabatic with H14 and H23 modified (yellow) and diabatic with H14, H23 and H24 modified (cyan) Hamiltonians.

Figure 4. Rotational strength line spectra (upper panel) and CD spectra with band shapes from convolution with Gaussian functions of FWHM of 9.0 nm (lower panel) for peptide 2d [φ = −135°, ψ = 135°]. Spectra computed using the ab initio (black line), diabatic (blue), diabatic with H14 modified (red), diabatic with H23 modified (green), diabatic with H24 modified (magenta), diabatic with H14 and H23 modified (yellow) and diabatic with H14, H23 and H24 modified (cyan) Hamiltonians.

Table 4. Inter-amide interactions (cm−1 rounded to integer values) for diamide 2d [φ = −135°, ψ = 135°] and used to compute the spectra in Figure 2.

H13, H14, H23 and H24 have ab initio couplings of 72, –20, 292 and –907 cm−1, and diabatic couplings of 109, –279, −244 and –89 cm−1 (Table ). The dot product of the un-normalised transition dipole moments for these couplings are, respectively, −0.032, −0.350, −0.342 and –2.037 D2 for the monomer and –0.036, 0.407, −0.399 and 2.842 D2 for the dimer (Table ). Using the rationale outlined in the Methods section, H14, H23 and H24 may have non-Coulombic interactions in the diabatic representation, with H24 having significant non-Coulombic interactions and additional Coulombic interactions in the diabatic representation. As there are little differences in the coupling values and dot products for the H13 coupling, this is likely a Coulombic interaction.

310 helix

The two computed spectra for diamide 2i [−74°, −4°] both feature an intense couplet centred around 197 nm for the ab initio spectrum and centred around 190 nm for the diabatic spectrum (Figure ), which has the relatively more intense couplet of the two spectra. For the ab initio spectrum, the couplet has a positive peak at 189 nm and a negative peak at 203 nm, which has a shoulder region around 220 nm. The couplet in the diabatic spectrum consists of a negative peak at 184 nm and a positive peak at 197 nm. This spectrum also features a relatively shallow and broad negative band around 218 nm (where the shoulder in the ab initio spectrum is situated).

The couplings listed in Table  show a difference for all four between the ab initio and the diabatic, where H13, H14, H23 and H24 are 15, −36, 161 and −662 cm−1 for the former and 289, −148, 230 and 284 cm−1 for the latter. The dot product of the un-normalised transition dipole moments for these couplings are, respectively, −0.013, −0.454, 0.615 and –2.985 D2 for the monomer and 0.011, −0.289, −0.730 and 3.175 D2 for the dimer. Thus, H14, H24, H23 and H24 may have non-Coulombic interactions in the diabatic representation.

The experimental CD spectrum of an octameric sequence of L amino acids in a right-handed 310-helical structure (with mean φ and ψ angles of −54° and −32°, respectively) displays a negative band at 207 nm with a shoulder centred near 222 nm [Citation51]. In addition to a slight positive band at 195 nm and negative band at 184 nm. The ab initio spectrum presented here is closest to the experimental spectrum. We note that the octamer has a structure close to diamide 2j [−48°, −57°] (discussed above) and not diamide 2i [−74°, −4°], and that the diabatic CD spectrum for 2j [−48°, −57°] is closer to the experimental CD spectrum of the 310-helical octamer.

Polyproline II

The ab initio spectrum for diamide 2m [−75°, 145°] features an intense couplet centred around 188 nm with a negative band at 181 nm and a positive band at 194 nm (Figure ). The spectrum also has a shallow negative band at 208 nm and a weak and broad positive band at 220 nm. In contrast, the diabatic spectrum possesses a positive band at 186 nm and two negative bands at 199 and 220 nm with the latter negative band being broader and shallower than the former.

The coupling values for H13, H14, H23 and H24 are, respectively, 31, −7, 16 and −971 cm−1 for the monomer and 81, 356, 145 and 29 cm−1 for the dimer (Table ). The dot product of the un-normalised TDMs for these couplings are, respectively, −0.002, 0.005, −0.619 and –2.865 D2 for the monomer and 0.048, 0.543, −0.610 and −4.660 D2 for the dimer (Table ). As noted for the β strand dimer (2d [−135°, 135°]), H14, H23 and H24 may have non-Coulombic interactions in the diabatic representation, with H24 having significant non-Coulombic interactions and additional Coulombic interactions in the diabatic representation. As there are little differences in the coupling values and dot products for the H13 coupling, this is likely a Coulombic interaction.

The experimental CD spectrum of a polyproline II structure (poly-L-proline) features a negative peak in the region around 200 nm and a positive peak at 220 nm [Citation57]. The computed diabatic CD spectrum features a negative band at 199 nm, in agreement with experiment, and a negative band at 220 nm, in disagreement with experiment.

Future work and outlook

Future work will involve derivation of an empirical correction term (for inter-amide couplings) to incorporate into the DichroCalc NMA4FIT2 parameter set, based on the diabatisation work presented here. The empirical correction will follow the work of Scholes and Ghiggino [Citation58], who defined a distance-based exponential term for short-range, non-Coulombic interactions between two chromophores, and Voityuk [Citation55], who derived expressions to compute the electronic coupling based on vertical transition energies, transition dipole moments (of the system and of individual chromophores) and the relative orientation of the chromophores or individual transition dipole moments. The correction term will account for the short-range, non-Coulombic interactions between excited states on neighbouring amides, an interaction currently neglected in the NMA4FIT2 parameter set and which has been shown, in this study, to be a significant contributor toward computing more accurate CD spectra, e.g. the double minimum for an α–helical polypeptide.

Supplemental material

Supplementary Material

Download MS Word (360.1 KB)

Acknowledgements

We are grateful for access to the University of Nottingham’s high-performance computer. J. H. is supported by the Royal Academy of Engineering under the Chairs in Emerging Technologies scheme. H.D. is supported by the NSFC Research Fund for International Young Scientists (grant number 21850410456).

Data availability statement

The DichroCalc code is available at https://github.com/dmrogers75/DichroCalc. Data, example files and other code used in this study are available in a folder named ‘diabatisation’.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by National Natural Science Foundation of China: [Grant Number 21850410456]; Royal Academy of Engineering: [Grant Number CiET2021_17].

References