1,136
Views
3
CrossRef citations to date
0
Altmetric
Computational Life Sciences, Bioinformatics and System Biology

Low resolution protein mapping and KB-R7943 drug-protein molecular interaction analysis of long-QT syndrome linked KCNH2 mutations

, , , , , , , ORCID Icon & ORCID Icon show all
Pages 183-193 | Received 09 Oct 2019, Accepted 25 Feb 2020, Published online: 12 Mar 2020

ABSTRACT

Cardiac Arrhythmias (CA) are responsible for syncope and sudden cardiac death (SCD) in several million cases worldwide. The molecular basis of CA pathogenesis is known, but the underlying mechanisms connecting genotype to phenotype changes in the proteins are not yet fully explored. Hence, in this study, three LQT2-linked missense mutations (Thr613Met, Ser641Phe and Gly648Ser) located in exon-7 region of KCNH2 gene identified in CA patients was assessed for their deleteriousness using diverse computational algorithms. The SIFT and PolyPhen-2 methods, which depends on sequence conservation across species, and protein structure-based information have attributed pathogenicity to these variants. Protein behavior studies at 3D level have identified that these three missense mutations induce biochemically severe amino acid changes that may alters the structure and function of KCNH2. Our molecular docking suggested that mutant KCNH2 protein shows a similar binding interactions to KB-R7943 as compared to the wild type model. These findings could be useful in resorting to alternate therapies for subset of CA patients who show limited therapeutic response to KB-R7943. This study supports that in silico approaches could act as a first-line filters to screen the potential impact of CA linked KCNH2 mutations depending upon the perspective of molecular evolution and phenotype of proteins.

1. Introduction

Cardiac arrhythmias or irregular heartbeats are prevalent across all age groups and are responsible for syncope (fainting) and sudden cardiac death (SCD) in several million cases worldwide (Afzal et al. Citation2019). Primary cardiac arrhythmias, which constitute a significant percentage of all forms of arrhythmias are caused by a heritable abnormality of ion channel functions in heart (Fu Citation2015). Hence, exploring the intricate and complex mechanisms underlying cardiac arrhythmias has become the main challenge for scientists and clinicians. Genetic pathology has been described in clinical subclassifications of hereditary arrhythmias e.g. Long QT syndrome, short QT syndrome, Brugada syndrome, catecholaminergic polymorphic ventricular tachycardia, sick sinus syndrome, cardiac conduction defect, atrial fibrillation, etc (Li et al. Citation2016). Long QT syndrome (LQTS), a prolonged ventricular repolarization condition is a genetically heritable cardiac arrhythmic disease which leads to sudden death. Genetic mutations in cardiac ion channel proteins are described to be the underlying causes for different types of LQTS phenotypes (Tester and Ackerman Citation2014). LQTS2 is a frequent form accounting for 35% to 40% of all the different types of LQTS phenotypes (Singh et al. Citation2019). The gene which is known to cause LQTS2 is called Potassium Voltage-Gated Channe; Subfamily H Member 2 (KCNH2), it codes for α-subunit of voltage-gated inwardly rectifying K+ channel (Saprungruang et al. Citation2018). The mutations in KCNH2 lead to a reduction in rectifer K+ current (IKr), which causes prolonged QT intervals and ventricular fibrillation based on the nature of mutation (De Zio et al. Citation2019).

With the advancement in sequence technologies, the number of KCNH2 mutations (missense mutations, indels, frameshift mutations, duplications) in CA patients are continuously increasing (Guillen Sacoto et al. Citation2015; Saprungruang et al. Citation2018; Fan et al. Citation2019). Of these, the mutations which result in substitution of amino acids are presumed to be deleterious as they could potentially alter both biophysical characteristics and functional aspects of the encoded protein. In KCNH2, the structural and functional aspects of most missense mutations are not fully characterized. The technical complexity surrounding the traditional cell line and animal model-based mutation characterization strategies makes them a relatively less attractive option. On the other hand, the ability to study these missense mutations can be improved based on the information, like conservation of sequences across species (Sim et al. Citation2012), structural attributes (Ittisoponpisan et al. Citation2019), stability (Witvliet et al. Citation2016) and physicochemical properties of polypeptides (Li et al. Citation2014). In this context, we studied LQT2- linked KCNH2 missense mutations (T613M, S641P and G648S) using different bioinformatic webservers and assessed their prediction potential in exploring the genetic sequence – protein structure relationships in terms structure deviation, secondary structure and stability parameters by using different computational methods. The three KCNH2 mutations i.e. T613 (pore helix), S641 (upper end of S6) and G648 (hinge region at which S6 kinks to open the channel) fall in potassium voltage gated channel regions, hence they are ideal candidates to study the mutation induced to perturbations in the structure and function of protein.

2. Methods

2.1. KCNH2 mutation data collection

The molecular genetic data on KCNH2 mutations were retrieved from databases (ex: pubmed, and HGMD) and research publications (Tester et al. Citation2005; Kapplinger et al. Citation2009; Saprungruang et al. Citation2018). The information about KCNH2 mutations including transcript ID, nucleotide and amino acid sequences of KCNH2 gene were collected from KEGG gene database (https://www.genome.jp/kegg/genes.html) and ENSEMBLE (https://www.ensembl.org/) (Table ).

Table 1. Basic characteristics of KCNH2 missense variants analyzed in this study.

2.2. Predicting the deleterious potential of KCNH2 missense mutations by computational methods

We measured the deleterious potential of missense mutations using different computational methods that were trained to discriminate the mutations into deleterious or non-deleterious based on the support vector algorithmic programs. In brief, we submitted the mutation details like chromosome number and base-pair position followed by allele change to an empirical rule-based tool viz. SIFT (Sorting Intolerant from Tolerant) BLink Beta version for homology searching (Ng and Henikoff Citation2003). SIFT provides prediction score for each mutation and classify them as borderline (0.101–0.20) or tolerant (0.201–1.00), potentially damaging (0.051–0.10) and damaging (0.00–0.05). The PolyPhen-2 (Polymorphism Phenotyping version 2) method predicts the probable deleterious effects of amino acid changes based on the structure and function of human proteins (phylogenetic and structure). Input options constitute protein sequence, database ID/accession number and position of the amino acid variants. The PolyPhen-2 score classify the mutations as probably damaging (1.00–0.80), possibly damaging (0.80–0.50) or benign (0.49–0.0) (Adzhubei et al. Citation2010).

2.3. 3-dimensional modeling, structural differences, stability, solvent accessibility and secondary structural analysis of KCNH2 protein models

The Protein Databank (PDB) has a 544 aa (from 333th to 877th aa) long 3-dimensional crystal structure of KCNH2 protein with 3.8 Å resolution (PDB; reference ID is 5VA2). The crystal structure KCNH2 model (of the original 1159 aa) was used as a coordinate to construct the remaining 615 aa long polypeptide chain following the I-Tasser ab-initio and Modeller 9v2, Swiss Model homology modeling approaches, which is basically used when the identical structure is not available, or the target sequence has less than 30% identity. The I-Tasser built protein models were energy minimized, by applying the steepest decent algorithm available in Gromacs software (Roy et al. Citation2010). The energy minimized protein models, structural quality assessment was carried out by PROCHECK (Laskowski et al. Citation1993). The superimposition, secondary structure and solvent accessibility tests were carried out using Pymol software (Rigsby and Parker Citation2016), PDBSUM (Laskowski et al. Citation2018) and DUET (Pires et al. Citation2014) webservers.

2.4. KCNH2 protein structure alignment analysis

The deleterious missense mutations not only change the sequence of amino acids, but they will also alter the physical configuration of proteins by inducing physical constraints both at residue and whole structure levels. In this study, we performed structural alignment of mutant and wild type KCNH2 proteins by superimposing their Cα traces and backbone atoms to calculate the root mean squared deviation (RMSD). A RMSD score of <2.0 Å for proteins and <0.2 Å for amino acids represents that the given amino acid substitution is deleterious to the structure of protein (Zemla Citation2003; Kufareva and Abagyan Citation2012).

2.5. Molecular docking for protein-inhibitor complex

AutoDock program was used to execute a docking simulation between the known sodium–calcium exchange inhibitor and KCNH2 protein. The binding conformation of inhibitor to the protein was analyzed by Lamarckian Genetic Algorithm (LGA). The protein structure was charged by adding charged ions and Histidine molecules to neutralize the whole protein. Finally, gigester charges were applied to neutralize the charges on the whole protein. Each atom type of the ligand was placed in a grid point, and interaction energy with all atoms of the protein was calculated. The default parameters used in constructing grid parameter file are 60 × 60 × 60 in X, Y, Z directions with an 0.367 Å spacing in the grid. Autodock MGLtools were used to prepare the docking parameter file. Lamarckian genetic algorithm with default parameters like the genetic algorithm settings set to 150 runs, 150 possible conformations; population size to 50 individuals; and the number of energy evaluations to 2,500,000 were used. In the RMSD based on positional all atoms, the results with a difference of less than 1 Å were gathered. At the end of docking, ligands with the maximum promising binding energy were selected. The resultant complex structures were explored using Pymol-0.98 program.

3. Results

3.1 KCNH2 mutation details and pathogenicity prediction

SIFT depends on nucleotide sequence homology across species and amino acid properties to classify any amino acid substitution as neutral or pathogenic to the protein functionality. In the present study, SIFT analysis predicted that T613M, S641P, and G648S variants are deleterious because all of them possessed ‘0’ prediction score. Any variant which is predicted to have >0.1 score by SIFT is deleterious. PolyPhen-2, which takes into account variant effects on nucleotide sequence and structural conservation changes, has also predicted T613M, S641P and G648S variants as damaging. PolyPhen-2 considers any variant as damaging if its prediction score is above 0.9 (Table ).

Table 2. Shows pathogenic predictions scores and amino acid changes of all three CA linked KCNH2 mutaions.

3.2 Stability analysis

The location and type of a substituted amino acid residue determine the stability of protein. In particular, amino acid substitutions that lead to changes in their orientation i.e. from exposed to buried state or vice versa may affect the secondary properties (ex. solvent accessibility) and also structural stability of proteins. In this context, we examined the protein stability (in terms of free energy values) and solvent accessibility of Thr613Met, Ser641Phe and Gly648Ser mutations using DUET web server. The negative free energy values suggest the query mutation could destabilize the protein, hence deleterious (Table ).

3.3 KCNH2 protein modeling

3.3.1 3-dimensional (3D) protein modeling.

The KCHN2 sequence (1159 aa) to protein structure alignment in Blastp program identified that experimentally solved structures available from Protein Data Bank (PDB) have a maximum sequence coverage of 72%. The structure of this covered protein structure spans over a NH2-terminal (1–140 aa), transmembrane segment (337–877 aa) and COOH terminal (1006–1159 aa) regions. The structure of missing regions lying in between 141 aa to 332 aa & 878 aa to 1005 aa is generated through I-Tasser Tool. From the top 5 predicted models, the best KCNH2 protein structure was selected based on its confidence score, template model and root mean square deviation score (Supplementary Table 1). High-quality polypeptides showed a C-score of −3.44 (Chain 1) and −3.77 (Chain 2). Using Modeller and Swiss Model tools we joined all the chains (experimentally solved structures and ab-initio models) and built 100 different confirmations of KCNH2 full-length model with open state pore. The built model was energy minimized and further used as a template to build KCNH2 mutated models (T613M, S641P and G648S). The stereochemical quality of the proteins are validated using the Ramachandran plot, which revealed that 97% amino acids are in core region and only 3% of amino acids were in disallowed regions (Figure ).

Figure 1. Molecular view of built KCNH2 model (a) cytosolic N-terminal PAS domain (b) voltage gated transmembrane domain (VAS) (c) cytosolic C-terminal domain.

Figure 1. Molecular view of built KCNH2 model (a) cytosolic N-terminal PAS domain (b) voltage gated transmembrane domain (VAS) (c) cytosolic C-terminal domain.

3.3.2 Structure analysis

The core Per-Arnt-Sim (PAS) domain  (cytosolic N-terminal region) of KCNH2 protein consists of five α-helices and four β-sheets. The transmembrane segment (VSD) consists of six α-helices, where the helices 1 to 4 make up voltage sensor domain and the helices 5,6 cytosolic pore domain. The nucleotide-binding domain (cytosolic C-terminal region) consists of five α-helices, and seven β-sheets. The mutated residue T613 is located in the short helix (connecting the S5 and S6 segments), and residues S641 and G648 are located in the S6 helix which lines the inner channel pore (Figure ). In structure-based superposition method, the differences of the Cα-atom coordinates, between the structures or subset of atoms are estimated in form of RMSD by rotating them in 3D space. All the mutant KCNH2 models (T613M, S641P and G648S) demonstrated 99.68% sequence identity in their amino acid sequence excluding the mutated residue. No significant difference in the main structure of KCNH2 protein in mutant conditions, as reflected by the RMSD scores of individual missense mutations (RMSD scores are <0.2 Å), is observed. However, all 3 KCNH2 variants portrayed structural differences at amino acid residue level, as reflected by their RMSD score (>2 Å). All the KCNH2 amino acid variants T613M (2.48 Å), S641P (2.33 Å) and G648S (1.86 Å) showed significant structural deviations compared to the wild type amino acid residues. The secondary structure of KCNH2 protein molecule is characterized by 4 β-sheets, 4 β-hairpins,12 β-strands, 38 α-helices, 44 helix-helix interactions, 117 β-turns and 29 γ-turns. The native threonine residue at 613 amino acid position is localized in helix 26 of KCNH2 molecule and the mutant Methionine residue does not induce any change in its helix position. Similarly, both Ser641Phe and Gly648Ser, localized in helix 27 position and does not induce any change in its orientation. None of the T613M, S641P and G648S variants have shown any changes in any of the secondary structure properties (α-sheets, β-hairpins, β-strands, helices, helix-helix interactions, β-turns and γ-turns) of KCNH2 protein (Figures and ).

Figure 2. Shows the superpose simulation of both wild type KCNH2 against its mutated protein models, with description of the amino acid changes and their RMSD values at the position of the all mutations.

Figure 2. Shows the superpose simulation of both wild type KCNH2 against its mutated protein models, with description of the amino acid changes and their RMSD values at the position of the all mutations.

Figure 3. KCNH2 mutations site secondary structural elements at KCNH2 Wild and mutant models.

Figure 3. KCNH2 mutations site secondary structural elements at KCNH2 Wild and mutant models.

3.4 Docking study of KCNH2 and ligand molecule

KB-R7943 is a well-known inhibitor of the reverse Na+/Ca2+ exchange and it plays a pivotal role in regulating contractility and electrical activity in the heart (Antoons et al. Citation2012). From docking analysis, it is revealed that the KB-R7943 inhibitor strongly binds at the transmembrane channel (S4 and S5) of wildtype KCNH2. The nitrobenzyl and methyl sulfanyl methane diamine groups of KB-R7943 binds to Ala558 and Thr623 residues (S4 and S5 pores) of wildtype KCNH2, with the lowest binding energy of 6.1 Kcal/Mol. The KCNH2 in two mutant forms (Ser641 and Gly648), binds to the inhibitor at S4 and S5 voltage transmembrane regions of the protein (Asp609, Thr623, Asn629 and Thr613) by forming more stable bonds with inhibitor showing highest binding energy of 7.8 and 7.7 Kcal/Mol (). However, the Thr613 mutant protein binds to the inhibitor in a different pose, involving Glu575 and Ser636 residues of S5 and S6 pores of KCNH2 voltage channels, with an binding energy of 7.1 K.Cal/Mol. Interestingly, in all the docking interactions nitrobenzyl group forms strong H-bond interactions with Thr623, which eventually alters the confirmations of pore or cytoplasmic tail of the KCNH2. So, from our analysis, we predict that KB-R7943, a Na+/Ca2+ exchange inhibitor can effectively bind to KCNH2 even under mutant conditions (Figure ).

Figure 4. Illustration of the docking simulation combining all KB-R7943 with KCNH2 with (a) wild type protein model (b) mutant Thr613Meth protein model (c) mutant Ser641Phe protein model (d) mutant Gly648Ser protein model.

Figure 4. Illustration of the docking simulation combining all KB-R7943 with KCNH2 with (a) wild type protein model (b) mutant Thr613Meth protein model (c) mutant Ser641Phe protein model (d) mutant Gly648Ser protein model.

Table 3. Auto dock binding energies of KB-R7943 with KCNH2 wild and mutant forms.

4. Discussion

The recent molecular genetic reports have implicated several nucleotide sequence alterations with many inherited arrhythmic diseases, more particularly, ‘ion channel diseases,’ where genetic mutations affect functional units of ion channels and/or their transporter-associated proteins in patients without primary cardiac structural abnormalities (Lieve Citation2015; Imberti et al. Citation2016). KCNH2 (previously known as hERG) gene encodes subunits of potassium channels, which transports potassium ions to the cells’ exterior and determinate the generation and transmission of electric signals. Potassium channels made KCNH2 actively recharge the cardiac muscle in order to maintain the regular rhythm after each heartbeat (Gianulis and Trudeau Citation2011). KCNH2 gene is localized at 7q35–7q36, and it consists of 15 exons, reads in reverse direction and encodes a protein of 1159 amino acids that forms a VGK (rectifier voltage gated K+) channel. Genetic mutations in KCNH2 can reduce IKr (a repolarizing current in the heart) slow action potential (AP) repolarization and can trigger cardiac arrhythmias. Mutations in KCNH2 cause different forms of CAs. For example, approximately 38% of patients with CA possess mutations in KCNH2. According to HGMD, Ensemble and NCBI data bases, a total of 364 genetic mutations in KCNH2 gene has been described in Cardiac Arrhythmias [356 genetic mutations in long QT syndrome (LQT) and 8 genetic mutations in Short QT syndrome (SQT)]. Interestingly, all the 364 variants are non-synonymous variants, which mean they can change the amino acid they encode in corresponding polypeptides.

Here in this study we assessed the structural impact of T613M, S641P and G648S mutations on biophysical properties of KCNH2 protein properties using different computational approaches (Tester et al. Citation2005; Kapplinger et al. Citation2009; Saprungruang et al. Citation2018). We first screened them with SIFT and PolyPhen-2, both of which involves genetic sequence conservation and protein structure properties to identify the deleterious potential of missense mutations. A study by Hicks et al. (Citation2011) has demonstrated that out of different test tools, SIFT and PolyPhen-2 tools showed better performance in identifying the damaging potential of missense mutations (Hicks et al. Citation2011). The accuracy of both tools was further tested by few other studies as well (Gnad et al. Citation2013; Shaik et al. Citation2018; Shaik et al. Citation2019). In case if the true positive mutation is predicted as benign by computational tools, then one should carefully interpret variant pathogenicity based on the clinical assessment. The variant prediction results, when combined with 3D structure information of protein molecule can help us to take a better insight into the variant induced effects. However, not all KCNH2 mutations are mapped in the 3D space of the KCNH2 protein, because it is technically infeasible. Therefore, we constructed the computational 3D model of KCNH2 protein and tried to explore the molecular connections between KCNH2 gene mutations and CA. The superposing of 3D structures of wild type and mutant residues allows us to examine them in different confirmations by estimating the RMSD differences in atomic coordinates. We observed that T613M, S641P and G648S variants bring minor structure related changes in the KCNH2 protein molecule at the site of amino acid residue they affected. The differences in the RMSDs could be attributed to chemical side chain, volume, molecular weight, orientation of amino acids. These RMSD differences disturb the hydrogen bonds, ionic bonds Van der Waals force, and electrostatic charges, existing between the amino acids which maintain the conformation of the polypeptide chains in KCNH2 molecule (Shaik et al. Citation2018).

The KCNH2 protein contains six transmembrane segments (S1-S6) where S1-S4 maintains voltage sensor domain (VSD) and S5-S6 in addition to intervening pore loop forms the Pore domain (Morais Cabral et al. Citation1998). The tetrameric functional channel containing pore domain responsible for ion conduction in the cell. The KCNH2 protein contains 3 functional domains. i.e. cytoplasmic NH2 terminal, COOH-terminal and transmembrane domains. The N-terminal contains a Per-Arnt-Sim (PAS) domain that defines a subfamily of VGK channel proteins. The C-terminal segment contains cyclic nucleotide binding domain (cNBD) (Warmke and Ganetzky Citation1994). The KCNH2 gene sequence in vertebrates showed a significant correlation between exon boundaries and protein domains. The exon 1 region encodes for 5’ UTR and 26 amino acid residues with a known role in ion channel deactivation kinetics. Exon-2&-3 encompasses the PAS domain. The cytoplasmic NH2 terminal region and transmembrane segments are encoded by exon 6. The fourth transmembrane segment of protein is encoded by exon-7 region. This segment is known as the principle voltage-sensing region within domain VSD along with the majority of pore domain. It is noticeable that the boundary region between exon −7 & −8 consists of glycine residue which is by and large seen to be highly conserved in all potassium channels and also determines the boundaries in between pore cavity upper and lower proportions. The c-linker and cNBD domains are encoded by exons 8 and 10, whereas the remaining 5 exons encode distal COOH terminal part of the protein (Jiang et al. Citation2002).

The KB-R7943 is a known pharmacological agent that can inhibit Na+/Ca2+ across plasma membranes. It reversibly inhibits the Na(+)/Ca(2+) inward and outward movements in a dose-related fashion and does not affect sodium-dependent transport systems (Iwamoto et al. Citation1996). KB-R7943 is also known to suppress arrhythmias (induced by ouabain) by blocking the reverse-mode Na+ and Ca2+ exchanger (NCX). The anti-arrhythmic and cardioprotective effects of KB-R7943 are well documented in multicellular and intact heart preparations (Nakamura et al. Citation1998; Mukai et al. Citation2000). Further anti-arrhythmic effects were shown in Na+ overload induced by glycosides or sodium channel facilitators (Amran et al. Citation2004). When administered in-vivo, the KB-R7943 was able to prevent the development of atrial fibrillation (Miyata et al. Citation2002). Currently the therapeutic administration of KB-R7943 to CA patients is independent of their mutational status of KCNH2. We sought to check if KCNH2 mutations could account the variations in responding to anti-arrhythmic drug molecules. Therefore, we performed molecular docking to understand the differences in binding efficiencies of wild type and mutant KCNH2 forms with commonly used arrhythmic drug inhibitor i.e. KB-R7943. Our molecular analysis revealed that the binding energy range of KB-R7943 inhibitor with KCNH2 wild type and mutant forms [Thr613Met, Ser641Phe and Gly648Ser] is in between 7.4 Kcal to 7.7 Kcal/mol. The altered amino acid sequences in KCNH2 (Thr613Met, Ser641Phe and Gly648Ser) may lead to some changes in the hydrophobic, electrostatic and Van der Waals interactions between the ligand and protein molecules. Interestingly, both wild type and mutant forms showed same binding energies, even though the position of binding is different in different mutated forms. So, from our analysis we conclude that KB-R7943, a Na+/Ca2+ exchange inhibitor can effectively function even in CA patients with KCNH2 mutations. This concludes that the mutant KCNH2 (even with residue level structural deviation) shows same binding affinity towards KB-R7943 as that of its native protein form. Furthermore, we predict that these comparative studies create a direction to development of competitive peptide inhibitors for KCNH2 mutation positive patients.

In summary, this study has used different computational methods to screen the pathogenicity of cardiac arrhythmia linked KCNH2 missense (Thr613Met, Ser641Phe and Gly648Ser) mutations. The SIFT and PolyPhen-2 methods, which depends on sequence conservation across species, and structure-based information have correctly attributed pathogenicity to these variants. By superimposing the 3D structures of KCHN2 mutant proteins we found that these missense mutations induce some minor structural alterations at the amino acid residue level. These mutations are also seen to destabilize the KCNH2 protein molecule, hence its function. This study also points that the mutant KCNH2 (even with residue level structural deviation) shows same binding affinity towards KB-R7943 drug molecule as that of its native protein form. However, the real impact of Thr613Met, Ser641Phe and Gly648Ser missense mutations on the biophysical and chemical properties of KCNH2 protein can be better interpreted when the computational protein phenotype findings are confirmed by wet laboratory experiments. Nonetheless, the computational approaches we used in this study could help to quickly study the CA linked KCNH2 mutations before initiating time-consuming lab assays.

Supplemental material

Supplemental Material

Download MS Word (14.4 KB)

Acknowledgements

The authors, therefore, acknowledge with thanks DSR technical and financial support.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was supported by Deanship of scientific research (DSR), King Abdulaziz University, Jeddah, Saudi Arabia [grant number KEP-6-140-38].

References