771
Views
2
CrossRef citations to date
0
Altmetric
Review

Comparative genomics of HORMA domain-containing proteins in prokaryotes and eukaryotes

ORCID Icon
Pages 2531-2546 | Received 03 Jul 2018, Accepted 02 Nov 2018, Published online: 04 Dec 2018

ABSTRACT

In eukaryotes, critical regulation of cell cycle is required to ensure the integrity of cell division. HORMA-containing proteins include various proteins that contain HORMA domain and play important role in the regulation of cell cycle in eukaryotes. Many types of HORMA-containing proteins are found in eukaryotes, but their role in prokaryotes has not been proven. Therefore, we conduct an extensive search in GenBank for HORMA-containing proteins in prokaryotes to compare HORMA domain structure and architecture across eukaryotes and prokaryotes. Strikingly, genome sequencing for many prokaryotic organisms reveals that HORMA domain is present in many bacterial genomes and only two archaeal genomes. We perform sequence alignment and phylogenetic analysis to trace the evolutionary link between HORMA domain in prokaryotes and eukaryotes. HORMA domain in prokaryotes appears to vary in sequence and architecture. Interestingly, seven bacterial HORMA-containing proteins and the two archaeal HORMA-containing proteins showed close relationships with eukaryotic HORMA-containing proteins. Additionally, we uncovered remarkable close relationships between HORMA-containing protein from Chlamydia trachomatis and eukaryotic MAD2 proteins. Our results provide insights into evolutionary relationships between prokaryotic and eukaryotic systems, which facilitate our understanding of the evolution of cell cycle regulation mechanisms.

Introduction

Cell cycle in eukaryotes is a complicated process that involves multiple events performed by many proteins with various functions. In all cellular organisms, organization of the cell cycle and DNA repair is controlled by crucial mechanisms. HORMA domain-containing proteins are a group of proteins that are known to be pivotal in cell division regulation and DNA repair. This group has been identified in Schizosaccharomyces and derives its name from the initial letters of the names of three types: HOP1, REV7 and mitotic arrest deficient 2 (MAD2) [Citation1,Citation2].

HOP1 protein is essential for the synaptonemal complex assembly and chromosome synapsis in meiosis. In Arabidopsis, ASY1 is a homolog to HOP1 in Saccharomyces cerevisiae, which is involved in meiosis in male and female gametophytes [Citation3,Citation4]. In humans, other types of HORMA proteins, HORMAD1 and HORMAD2, are known to be involved in meiotic progression. They are also homologs to Saccharomyces HOP1 [Citation5Citation7]. Both HORMAD1 and HORMAD2 are important for synapsis surveillance and crucial for the male mid-pachytene checkpoint and the female meiotic prophase checkpoint [Citation6,Citation8].

REV7 is the DNA polymerase zeta (Pol ζ) processivity subunit which interacts with REV3, the catalytic subunit in Pol ζ [Citation9]. REV3 and REV7 are error-prone DNA polymerases involved in translation DNA synthesis polymerases to repair DNA damage during DNA replication [Citation10,Citation11]. Names of some HORMA-containing proteins can be different in the other eukaryotes. For instance, MAD2 is named MAD2L1 or MAD2A in other organisms. REV7 is also known as MAD2L2 or MAD2B. These proteins perform overlapping roles in the mitotic spindle assembly checkpoint, and they delay the initiation of anaphase until all chromosomes are correctly arranged in the cell midline at metaphase. Mutations in MAD2L1 and 2 in humans are related to various types of cancer and numerical chromosomal abnormalities [Citation12,Citation13]. The structure and function of REV7 and MAD2 genes in plants are highly conserved. REV7 in Arabidopsis is known to be involved in damage-tolerance mechanisms through translesion DNA synthesis [Citation14,Citation15].

Specific types of Autophagy-related (ATG) genes that control autophagy in cells have recently been discovered to contain HORMA domain [Citation16]. Autophagy-related 13 is an autophagy factor required for autophagosome formation and mitophagy. This gene was originally identified in yeast, and it has clear orthologs in Arabidopsis [Citation17], Drosophila melanogaster [Citation18], Caenorhabditis elegans [Citation19] and in humans [Citation20].

Although C. elegans has a homolog to MAD2 in yeast, HORMA proteins in the worm appear to be distinct from other eukaryotes [Citation21,Citation22]. C. elegans has four distinct HORMA proteins: Him-Three Paralog 1 (HTP1), HTP2, HTP3 and High Incidence of Males (HIM3). HTP1 and HTP2 are involved in regulation of centriole-centriole cohesion, while HTP3 performs a critical role in meiotic DNA double-strand break formation and synapsis. High Incidence of Males shares overlapping and divergent roles with HTP1, HTP2 and HTP3 in homologous chromosome segregation, DNA break formation and recombination, regulation of centriole-centriole cohesion and synaptonemal complex assembly [Citation21Citation23].

Cell cycle control in prokaryotes is less complexity than in eukaryotes. However, there are evidences about the existence of cell-cycle regulation in bacteria which cell cycle is arrested before DNA replication initiation or after chromosome segregation. Unsegregated chromosomes block assembly of the FtsZ ring, the essential proteins that triggers the accumulation membrane and cell wall proteins between the dividing bacterial cells [Citation24Citation27] during cytokinesis in prokaryotes [Citation28]. Additionally, previous studies indicate that some crenarchaea species initiate replication from multiple origins of replication as seen in eukaryotes [Citation29,Citation30]. Moreover, checkpoint-like regulation also is seen in crenarchaea via Cdv proteins that form intracellular structures during constriction. These proteins are induced in normal cell division at the initiation of genome segregation to carry out cytokinesis. In response to DNA damage, Cdv proteins are down-regulated and subsequently inhibit cell division. Cdv-based cell division systems bear some similarity to eukaryotic checkpoint systems, suggesting similar regulation mechanisms [Citation31].

Molecular mechanisms that regulate mitosis or meiosis in eukaryotes are apparently different from that in cell cycle control in prokaryotes. The high level of organization and the complexity of cell division in eukaryotes relative to prokaryotes prompts questions regarding the origin and evolution of eukaryotic cell division mechanisms. Advances in comparative genomics and bioinformatic tools may potentially improve our understanding of the evolution of organisms. It is commonly known that many proteins which are involved in cell division are conserved among prokaryotes and eukaryotes, such as ATPase family proteins [Citation32], several key enzymes of the apoptotic machinery [Citation33] and structural maintenance of chromosomes complex [Citation34]. However, no experimental reports describe the role of HORMA-containing proteins in cell division in prokaryotes. Nevertheless, sequencing for some bacterial genomes demonstrates that a conserved HORMA domain is present in many bacterial genomes [Citation35]. Therefore, we aim to investigate the structure and architecture of HORMA domain through a broad survey in GenBank for HORMA-containing proteins in prokaryotes. Comparison and phylogenetics of prokaryotic and eukaryotic proteins will help in understanding of the evolution of cell division machinery in prokaryotic and eukaryotic systems.

Experimental procedures

HORMA-containing proteins were retrieved by sequence homology searches in NCBI (http://www.ncbi.nlm.nih.gov/genome) and the UniProt database (http://www.uniprot.org/). The HORMA protein sequences of human (MAD2, NP_002349.1; REV7, NP_006332.3; HORMAD1, NP_001186758.1; HORMAD2, NM_001329457.1), S. cerevisiae (HOP1, NP_012193.3; REV7, NP_012127.1; MAD2, NP_012504.3; ATG13, AJV99010.1), Arabidopsis thaliana (REV7, NM_101522.4; MAD2, NM_001203049.1; ASY1, NM_101522.4; ASY2, NM_119372.3, ATG13, NM_114819.3) were utilized as query sequences in NCBI BlastP and UniProt to retrieve HORMA-containing proteins in eukaryotes and prokaryotes. Among all hits found, HORMA-containing proteins from specific taxonomically-representative eukaryotic organisms were selected to investigate HORMA domain structure. HORMA-containing proteins from the worms; C. elegans, the brown algae; Ectocarpus siliculosus, the euglena; Trypanosoma cruzi and the amoeba; Tieghemostelium lacteum were selected to represent eukaryotic organisms in addition to humans, yeast and Arabidopsis. All HORMA-containing proteins discovered in these eukaryotes were selected for further investigation.

Relative to NCBI, more HORMA-containing proteins in prokaryotic genomes were found in the UniProt database. From 111 hits found in bacterial genomes in UniProt, 20 bacteria were selected to represent the large taxonomic divisions of bacteria. These include HORMA-containing protein from each of the following: Chlamydia trachomatis, Polaribacter dokdonensis DSW-5, Clostridium sp. IBUN13A, Pseudomonas aeruginosa, Bacteroidales bacterium, Desulfovibrio africanus, Candidatus Delongbacteria bacterium, Streptomyces purpurogeneiscleroticus, Achromobacter spanius, Sulfitobacter geojensis, Serratia marcescens subsp. marcescens, Roseomonas rhizosphaerae, Parvibaculum lavamentivorans, Rubricoccus marinus, Sphingopyxis bauzanensis, Planctomycetaceae bacterium, Streptomyces sp., Rickettsiales bacterium, Cytophagales bacterium and Flavobacterium granuli. Only one HORMA-containing protein was found in each of the selected bacteria. A homology search in UniProt revealed only two HORMA-containing proteins in archaea. Both Halorubrum ezzemoulense and Halorientalis regularis contained only one protein. lists accession numbers for all selected proteins.

Table 1. HORMA-containing proteins from taxonomically representative eukaryotic and prokaryotic organisms. HORMA-containing proteins from human, yeast and Arabidopsis were used as query sequences in NCBI BlastP and UniProt to retrieve HORMA-containing proteins in eukaryotes and prokaryotes. Twenty-nine eukaryotic proteins were selected from NCBI and 22 prokaryotic proteins were selected from UniPro for phylogenetic analysis and investigation of HORMA-containing proteins structure and architecture (Accession numbers shown in the 3rd column).

All protein sequences were aligned by MUSCLE version 3.8 [Citation36]. The multiple sequence alignment results were imported to the UniGene software package [Citation37]. The alignment results were assessed to remove the repeated and partial sequences. The phylogenetic tree was constructed based on the alignment of 29 eukaryotic, 20 bacterial and two archaeal HORMA-containing protein sequences. MEGA 7.0.26 [Citation38] was used to construct phylogenetic trees through neighbor joining (NJ) [Citation39] and maximum likelihood (ML) [Citation40] methods using default settings. The constructed phylogenetic tree was also visualized by MEGA 7.0.26.

To compare HORMA domain structure, HORMA domain sequences from each of the selected proteins were determined using the InterPro database [Citation41] and aligned using MUSCLE version 3.8. HORMA domain architecture was retrieved in all protein sequences using ScanProsite [Citation42]. In prokaryotic proteins, HORMA domain architecture did not appear through ScanProsite searches except Chlamydia trachomatis, Cytophagales bacterium and Rickettsiales bacterium. Therefore, HORMA domain architecture in other prokaryotes was drawn using MyDomains- Image Creator (https://prosite.expasy.org/cgi-bin/prosite/mydomains/) [Citation43] based on protein structure information from InterPro.

Results and discussion

Phylogenetic relationships

All selected eukaryotic HORMA-containing proteins belong to one of the following HORMA-containing protein types; HOP1, REV7, MAD2, ATG13, ASY1, ASY2, HORMAD1 and HORMAD2, except one HORMA-containing protein from each of the following protozoa: Trypanosoma cruzi and Tieghemostelium lacteum; these were uncharacterized. All prokaryotic HORMA-containing proteins were uncharacterized except glycine dehydrogenase (GLDC) from Polaribacter dokdonensis and glycosyl transferase (GTR) protein from Bacteroidales bacterium. To clearly investigate phylogenetic relationships, proteins which are known by multiple names in NCBI or UniProt were assigned a unified name for this study. For instance, MAD2A and MAD2L1 were named “MAD2”, whereas MAD2L2 and MAD2B were named “REV7”. Protein sequences that are present in NCBI without clear protein names or which are named “hypothetical protein or HORMA protein” were assigned the names from UniProt for this study. Other proteins without clear protein names in both NCBI and UniProt were referred to as HORMA-containing proteins. To investigate the relationships between HORMA-containing proteins in prokaryotic and eukaryotic systems, we conducted a phylogenetic analysis for 29 HORMA-containing proteins from eukaryotes and 22 HORMA-containing proteins from prokaryotes using two methods, NJ and ML.

In NJ phylogeny ()), HORMA-containing proteins were divided into two major groups. The first group was composed primarily of eukaryotic proteins, with two archaeal proteins and five bacterial proteins. This group was divided into two subgroups and the first subgroup was split into two branches. The first branch consists of all eukaryotic REV7 and MAD2 proteins, in addition to the Tieghemostelium lacteum uncharacterized HORMA-containing protein and two bacterial HORMA-containing proteins from the Chlamydia trachomatis and Rickettsiales bacterium. The Chlamydia trachomatis HORMA-containing protein descended from an interior node with S. cerevisiae MAD2 protein with a bootstrap value of 59%. The Rickettsiales bacterium formed a separate branch, which was supported with a 78% bootstrap value from the first branch that included all MAD2 and REV7 proteins. Arabidopsis thaliana ASY1 and E. siliculosus HOP1 descended together from second subgroup of the first branch. The second branch consisted of Trypanosoma cruzi uncharacterized HORMA-containing protein, S. cerevisiae HOP1, C. elegans HTP1, Arabidopsis thaliana ASY2 and ATG13, and E. siliculosus ATG13. The second subgroup consisted of HORMA-containing proteins from the two archaea, Halorubrum ezzemoulense and Halorientalis regular, along with human HORMAD1 and HORMAD2 and C. elegans HIM3 with three bacterial HORMA-containing proteins from Clostridium sp., Candidatus Delongbacteria and Flavobacterium granuli.

Figure 1. Phylogenetic phenogram tree produced from the alignment of 29 eukaryotic and 22 prokaryotic HORMA-containing proteins. Multiple sequence alignment was constructed by MUSCLE 3.8 and phylogenetic trees generated using MEGA 7.0.26 software. (a) Phylogenetic tree constructed by the NJ method, (b) Phylogenetic tree constructed by ML method. Numbers on nodes are bootstrap percentages supporting a given partitioning. The proteins are designated by the genus name followed by the name of protein. Uncharacterized proteins are named by genus name followed by “HORMA”. Streptomyces purpurogeneiscleroticus # A0A0M8ZCY4 named as “StreptomycesHORMA”, Streptomyces sp. # A0A2A2Z569 named as “StreptomycesHORMA1”. Accession numbers for all proteins are listed in . Red, blue and green circles indicate for eukaryotic, bacterial and archaeal HORMA-containing proteins, respectively.

Figure 1. Phylogenetic phenogram tree produced from the alignment of 29 eukaryotic and 22 prokaryotic HORMA-containing proteins. Multiple sequence alignment was constructed by MUSCLE 3.8 and phylogenetic trees generated using MEGA 7.0.26 software. (a) Phylogenetic tree constructed by the NJ method, (b) Phylogenetic tree constructed by ML method. Numbers on nodes are bootstrap percentages supporting a given partitioning. The proteins are designated by the genus name followed by the name of protein. Uncharacterized proteins are named by genus name followed by “HORMA”. Streptomyces purpurogeneiscleroticus # A0A0M8ZCY4 named as “StreptomycesHORMA”, Streptomyces sp. # A0A2A2Z569 named as “StreptomycesHORMA1”. Accession numbers for all proteins are listed in Table 1. Red, blue and green circles indicate for eukaryotic, bacterial and archaeal HORMA-containing proteins, respectively.

The second major group in the NJ tree consisted of 15 bacterial and five eukaryotic HORMA-containing proteins. This group contained two subgroups. The first subgroup was comprised of C. elegans ATG13 and HORMA-containing protein from the Cytophagales bacterium, which descended from an interior node supported with a 65% bootstrap value. C. elegans HTP3 protein comprised a separate branch from this subgroup, with a bootstrap support value of 80%. The second subgroup branched into two clusters. The first cluster consisted of three eukaryotic ATG13 proteins from humans, S. cerevisiae and Tieghemostelium lacteum with the bacterial protein GLDC from Polaribacter dokdonensis. The second cluster consisted of 12 bacterial proteins, including Rubricoccus marinus, Planctomycetaceae bacterium, Sphingopyxis bauzanensis, Desulfovibrio africanus, Sulfitobacter geojensis, Roseomonas rhizosphaerae, Parvibaculum lavamentivorans, Pseudomonas aeruginosa, Achromobacter spanius, Serratia marcescens, Streptomyces sp. and Streptomyces purpurogeneiscleroticus. Bacteroidales bacterium GTR formed a separate branch from the second major group in the NJ tree, with an 87% bootstrap support value.

The same relationships between prokaryotic and eukaryotic HORMA-containing proteins were nearly confirmed by the ML phylogenetic tree ()). The first group was divided into two subgroups, which demonstrated the same clustering for eukaryotic, bacterial and archaeal proteins, with some exceptions. Unlike the NJ tree, the two bacterial proteins; Bacteroidales bacterium GTR and Cytophagales bacterium HORMA-containing protein, with C. elegans ATG13 clustered with the first group in ML tree instead of clustering in the second group in NJ tree. Bacteroidales bacterium GTR and the Cytophagales bacterium HORMA-containing protein were clustered with REV7 and MAD2 proteins in the ML tree, whereas C. elegans ATG13 formed a separate branch from the first subgroup in the ML tree. Trypanosoma cruzi REV7 and MAD2 were the closest proteins to Bacteroidales bacterium GTR, with bootstrap support value of 89% in the ML tree. Additionally, the HORMA-containing protein from Flavobacterium granuli appeared to be the closest protein to the archaeal proteins in the NJ tree, while it clustered with the eukaryotic REV7 in the ML tree. However, the HORMA-containing protein from Trypanosoma cruzi was the closest to the two archaea in the ML tree. Clustering of Polaribacter dokdonensis GLDC also varied between the NJ and ML trees. In the NJ tree, Polaribacter dokdonensis was the closest to human ATG13 protein, while in the ML tree, this protein clustered with the 12 bacterial proteins in the second group. Unlike the NJ tree, Arabidopsis thaliana ASY1 and E. siliculosus HOP1 clustered with the second subgroup in the ML tree rather than clustering with proteins in the first subgroup in the first group of the NJ tree.

The close relationships between the Chlamydia trachomatis HORMA-containing protein and S. cerevisiae MAD2 protein in the NJ tree were also confirmed in the ML tree with a bootstrap value of 66%. The close relationship between HORMA-containing proteins from the two archaea, Halorubrum ezzemoulense and Halorientalis regular, was confirmed by both the NJ and ML tree. These proteins descended from an interior node with a bootstrap support value of 52% in the NJ and ML trees. Similarly, the relationships between the two bacterial HORMA-containing proteins from Clostridium sp. and Candidatus Delongbacteria, and the two archaeal proteins were confirmed by both phylogenetic trees. These bacterial proteins descended from an interior node with bootstrap support value of 71% in the NJ and 82% in the ML tree. However, clustering of eukaryotic proteins in the subgroup that included these bacterial and archaeal proteins varied between the two phylogenetic trees. In the NJ tree, this subgroup included C. elegans HIM3 and human HORMAD1 and HORMAD2, in addition to Flavobacterium granuli. Conversely, the same subgroup in the ML tree included Arabidopsis ASY2 and HORMA-containing protein from Trypanosoma cruzi, together with Clostridium sp., Candidatus Delongbacteria and the two archaeal proteins.

HORMA domain architecture

The HORMA domain architecture in all the proteins investigated was retrieved and drawn through PROSITE. illustrates HORMA domain architecture for some eukaryotic and prokaryotic proteins. In eukaryotes, HORMA domain size ranged from 177 amino acids in Trypanosoma cruzi MAD2 protein, to 264 amino acids in C. elegans HTP3 ()). Domain architecture varied across various eukaryotic HORMA-containing proteins. MAD2 and REV7 proteins appeared to be composed entirely of HORMA domain, except Tieghemostelium lacteum REV7, which contained HORMA domain in its C-terminal region. Other eukaryotic HORMA-containing proteins were longer than MAD2 and REV7 and had different architecture, which contained HORMA domain in the N-terminal region. ASY2 protein from Arabidopsis was unique HORMA-containing protein (1399 aa length). The NCBI and UniProt BLAST searches did not reveal any homologs for the ASY2 protein in eukaryotes or prokaryotes. The Arabidopsis ASY1 protein contained SWIRM domain with 99 amino acids in addition to HORMA domain. SWIRM domain presents in proteins that are involved in chromatin modifications and remodeling. Previous studies have reported that SWIRM domain may be involved in the assembly of chromatin-protein complexes [Citation44]. ATG13 in eukaryotes also contained ATG13 domain, which overlapped with HORMA domain in all ATG13 proteins in the N-terminal region. ATG13 HORMA appears to has structural plasticity even at the N terminus of the protein. This plasticity is due to the role of ATG13 in regulating spatiotemporal assembly of the components in the autophagy induction complex [Citation45].

Figure 2. HORMA domain architecture in some HORMA-containing proteins from (a) eukaryotes and (b) prokaryotes. In eukaryotes, HORMA domain size ranged from 177 to 264 aa. Prokaryotic HORMA-containing proteins were classified based on domain size and multiple sequence alignment results into three groups. HORMA-containing proteins in the first group ranged in size from 165 to 167 aa and HORMA domains in these group ranged from 148–159 aa. The second group contained larger HORMA domains (from 196 to 224 aa), while the third group contained smaller HORMA domains (from 52 to 145 aa). ScanProsite search used to retrieve domain architecture in all eukaryotic HORMA-containing proteins and three bacterial proteins; Chlamydia trachomatis, Cytophagales bacterium and Rickettsiales bacterium. HORMA domains architecture in other prokaryotic proteins were drawn using MyDomains- Image Creator based on proteins structure information in InterPro.

Figure 2. HORMA domain architecture in some HORMA-containing proteins from (a) eukaryotes and (b) prokaryotes. In eukaryotes, HORMA domain size ranged from 177 to 264 aa. Prokaryotic HORMA-containing proteins were classified based on domain size and multiple sequence alignment results into three groups. HORMA-containing proteins in the first group ranged in size from 165 to 167 aa and HORMA domains in these group ranged from 148–159 aa. The second group contained larger HORMA domains (from 196 to 224 aa), while the third group contained smaller HORMA domains (from 52 to 145 aa). ScanProsite search used to retrieve domain architecture in all eukaryotic HORMA-containing proteins and three bacterial proteins; Chlamydia trachomatis, Cytophagales bacterium and Rickettsiales bacterium. HORMA domains architecture in other prokaryotic proteins were drawn using MyDomains- Image Creator based on proteins structure information in InterPro.

Based on HORMA domain size and multiple sequence alignment results, prokaryotic HORMA-containing proteins were classified into three groups. The first group contained 11 proteins ranging in size from 165 to 167 amino acids, while HORMA domains in these group ranged from 148 to 159 amino acids. This group contained 11 HORMA-containing proteins: Pseudomonas aeruginosa, Desulfovibrio africanus, Streptomyces purpurogeneiscleroticus, Achromobacter spanius, Sulfitobacter geojensis, Serratia marcescens subsp. marcescens, Roseomonas rhizosphaerae, Parvibaculum lavamentivorans, Rubricoccus marinus, Sphingopyxis bauzanensis and Planctomycetaceae bacterium. Bacterial taxonomy demonstrated that all members of this group belong to Proteobacteria except Rubricoccus marinus which belong to Rhodothermaeota and Planctomycetaceae bacterium which belong to Planctomycetes, Verrucomicrobia and Chlamydiae (PVC) superphylum [Citation46]. Because of HORMA domain in the first prokaryotic group the first group shares similar size and architecture, it is represented in ) by one protein (Desulfovibrio africanus protein). The second group consisted of larger proteins and larger HORMA domains. As displayed in , proteins in the second group ranged from 214 to 302 amino acids, while HORMA domains ranged from 196 to 224 amino acids. This group was comprised of Chlamydia trachomatis, Rickettsiales bacterium and Cytophagales bacterium. These three bacteria belong to the three taxonomic divisions of bacteria; PVC, Proteobacteria and Bacteroidetes, respectively [Citation46]. The third group contained smaller HORMA domains (from 52 to 145 amino acids) and varied protein sizes and domain architectures. This group contained five bacterial proteins and the two archaeal proteins. The bacterial proteins in the third group included Flavobacterium granuli, Candidatus Delongbacteria, Streptomyces sp., Clostridium sp., GLDC from Polaribacter dokdonensis and GTR from Bacteroidales bacterium. The last three bacteria are members of Bacteroidetes, Streptomyces sp. is member of Actinobacteria, and Clostridium sp. is member of Firmicutes, where Candidatus Delongbacteria is an unclassified bacterium [Citation46]. Clostridium sp. HORMA-containing protein, GLDC and GTR were multiple protein domains. HORMA-containing protein from Clostridium sp. contained the helix-turn-helix domain, the DNA-binding motif that exists in transcription regulatory proteins in many bacteria [Citation47]. In addition to HORMA domain, GTR Bacteroidales bacterium contained a domain of galactosyltransferase, which is involved in the biosynthesis of different glycoconjugates and saccharide structures [Citation48]. Stemphylium lycopersici bacterium owns GTR gene that appears to contain also HORMA domain beside the domain of galactosyltransferase. Polaribacter dokdonensis GLDC contained two domains; glycine cleavage system P-protein and pyridoxal phosphate-dependent transferase, in addition to HORMA domain. Glycine cleavage system P-protein domain catalyzes the degradation of glycine. It participates in glycine, serine and threonine metabolism [Citation49]. In eukaryotes, GLDC gene is a mitochondrial gene that shares conserved sequences with its bacterial homologs [Citation50,Citation51]. In contrast to Polaribacter dokdonensis GLDC, the homologs GLDC genes in eukaryotes or other prokaryotes appear not to contain HORMA domain.

Bacteria from PVC superphylum and some archaebacteria appear to have a distinct cell division mechanisms that seem to be exceptions to the dominant mode of prokaryotic cell division by binary fission [Citation52]. Cell division on Planctomycetes, Chlamydia and many archaebacteria is atypical since it occurs in the absence of a sequence homologue of FtsZ [Citation53] and peptidoglycan [Citation54,Citation55]. Moreover, both Planctomycetes and Chlamydia has condensed DNA [Citation56]. Nevertheless, Chlamydiales generally divide by binary fission [Citation57] while Planctomycetes divide by budding process [Citation58]. Recently, Chlamydia has been shown to divide by a polarized cell division similar to the budding process observed in Planctomycetes [Citation59]. Chlamydia trachomatis is sexually transmitted human pathogen. Cell division of this pathogen interests the scientists as the understanding of Chlamydia cell cycle may result in novel therapeutic antimicrobial compounds. Planctomycetes phylum possess a distinct intracellular compartmentalization and unique metabolism patterns apparently comparable to the eukaryotic mitochondrion [Citation60]. The exact mechanisms of cell division in Chlamydia and Planctomycetes are still unknown yet. Some biologists currently claim that bacteria of PVC group are considered evolutionary intermediates in the prokaryotes to eukaryotes transition [Citation56].

HORMA-containing proteins in the first and second prokaryotic groups were almost entirely composed of HORMA domain, except Cytophagales bacterium. This protein with Candidatus Delongbacteria, Clostridium sp., Streptomyces sp. and Bacteroidales bacterium contained HORMA domain in its N-terminal region. However, Flavobacterium granuli and the two archaeal proteins contained HORMA domain in C-terminal region. Polaribacter dokdonensis GLDC enzyme had a unique architecture, in which HORMA domain existed in the central region between the other two domains (glycine cleavage system P-protein and pyridoxal phosphate-dependent transferase) ().

HORMA domain multiple sequence alignment

To provide insights into the structure of HORMA domain, sequence alignment was performed exclusively for HORMA domain sequences from all investigated proteins (). Multiple sequence alignment revealed that the sequence of HORMA domain was highly conserved within the first prokaryotic group. Chlamydia trachomatis HORMA-containing also demonstrated a high degree of sequence conservation with the eukaryotic MAD2 proteins, Trypanosoma cruzi REV7 and HORMA-containing proteins from Tieghemostelium lacteum. Limited conservation was displayed by HORMA domain sequences from all C. elegans proteins, all ATG13 proteins and HORMA domain from each of; Clostridium sp.,Bacteroidales bacterium GTR, Candidatus Delongbacteria, Polaribacter dokdonensis GLDC and Flavobacterium granuli. These proteins also revealed less conservation with HORMA domain sequences from other prokaryotes and eukaryotes. HORMA domains from the two archaeal proteins also demonstrated a lower degree of conservation with bacterial and eukaryotic proteins. Remarkably, the sequence of the HORMA domain from Chlamydia trachomatis was closely related to the sequences of eukaryotic MAD2 proteins. Therefore, further sequence alignment was performed for HORMA domain from MAD2 proteins and HORMA domain from Chlamydia trachomatis to closely compare the HORMA domain structure. As shown in , the HORMA domain sequences of Chlamydia trachomatis was highly conserved with MAD2 protein sequences from unicellular and multicellular eukaryotes. The NCBI BLASTP search revealed a 43% sequence identity between HORMA domain from Chlamydia trachomatis and S. cerevisiae MAD2, and a 42% sequence identity between HORMA domain from Chlamydia trachomatis and Arabidopsis MAD2.

Figure 3. Multiple sequence alignments of HORMA domains from 29 eukaryotic, 20 bacterial and two archaeal HORMA-containing proteins. The proteins are designated by the genus name followed by the name of protein. Uncharacterized proteins are named by genus name followed by “HORMA”. Streptomyces purpurogeneiscleroticus # A0A0M8ZCY4 named as “StreptomycesHORMA”, Streptomyces sp. # A0A2A2Z569 named as “StreptomycesHORMA1”. Alignment was performed using MUSCLE 3.8 and visualized by UniGene.

Figure 3. Multiple sequence alignments of HORMA domains from 29 eukaryotic, 20 bacterial and two archaeal HORMA-containing proteins. The proteins are designated by the genus name followed by the name of protein. Uncharacterized proteins are named by genus name followed by “HORMA”. Streptomyces purpurogeneiscleroticus # A0A0M8ZCY4 named as “StreptomycesHORMA”, Streptomyces sp. # A0A2A2Z569 named as “StreptomycesHORMA1”. Alignment was performed using MUSCLE 3.8 and visualized by UniGene.

Figure 3. (Continued).

Figure 3. (Continued).

Figure 4. Multiple sequence alignments of conserved HORMA domain in some eukaryotic MAD2 protein sequences along with HORMA domain from Chlamydia trachomatis protein. The proteins are designated by the genus name followed by the name of protein. Alignment was performed using MUSCLE 3.8 and visualized by UniGene.

Figure 4. Multiple sequence alignments of conserved HORMA domain in some eukaryotic MAD2 protein sequences along with HORMA domain from Chlamydia trachomatis protein. The proteins are designated by the genus name followed by the name of protein. Alignment was performed using MUSCLE 3.8 and visualized by UniGene.

MAD2 functions as an effective molecule in the mitotic spindle assembly checkpoint. It ensures that all chromosomes are correctly attached to kinetochores before the initiation of anaphase. This function can be performed through switching between two states for MAD2. In the presence of unattached kinetochores, MAD2 protein is in its active state and can arrest cell cycle in metaphase through establishing an inhibitory mitotic checkpoint complex, which binds to the anaphase-promoting complex activator Cdc20 and locks it. In the inactive state of MAD2, it releases Cdc20 and allows the initiation of anaphase [Citation61]. The conserved sequence and architecture of Chlamydia trachomatis HORMA domain with eukaryotic MAD2 proteins introduce other evidence to the uniqueness of cell cycle of Chlamydia trachomatis. This evidence indicates for existence of a similar checkpoint before segregation of Chlamydial chromosome unlike dominant bacterial division where the chromosome separates during replication. This supports the suggestion that the eukaryotic mitotic spindle checkpoint might evolve from a similar cell cycle control mechanism during Chlamydial cell division. Alignment of eukaryotic HOP1, REV7 and MAD2 sequences, along with gene co-expression and protein–protein interaction data, provide evidence that the MAD2 gene may have evolved earlier than the REV7 and HOP1 genes, which the single-celled eukaryotes only possess the MAD2 gene [Citation62].

Altogether, the classification of prokaryotic HORMA-containing proteins into three groups was nearly exposed by phylogenetic trees. The 11 bacterial proteins in the first prokaryotic group, which shared similar HORMA domain sequences and architecture, clustered together in the NJ and ML trees. The second prokaryotic group appeared to be the closest bacterial proteins to the eukaryotic proteins. They shared similar sequences and architecture with eukaryotic HORMA domains and clustered with eukaryotic proteins in both the NJ and ML trees. Interestingly, HORMA domain from Chlamydia trachomatis was the most similar prokaryotic sequence relative to eukaryotic MAD2 proteins in both phylogenetic trees. This relationship is supported by previous comparative genomics that revealed that chlamydiae genomes are most similar to many plant proteins [Citation63]. Furthermore, these similar proteins are obtained from horizontal gene transfer between chlamydiae and primary photosynthetic eukaryotes [Citation64]. This similarity suggests a close common ancestry among plants, cyanobacteria and Chlamydia [Citation65]. 16S ribosomal RNA data has revealed that Chlamydiales and the early eukaryotes (Parachlamydia amoebophila) diverged from the common ancestor about 700 million years ago [Citation66]. The third prokaryotic group was composed of diverse HORMA domain sequences and architecture. Nevertheless, bacterial and archaeal proteins in this group demonstrated close phylogenetic relationships with specific eukaryotic HORMA-containing proteins. Our results suggest evolutionary relations between prokaryotic proteins in the second and third groups and the eukaryotic HORMA-containing proteins. The position of the two archaeal proteins with bacterial and eukaryotic proteins reflect the evolutionary relationships among the three domains; Bacteria, Archaea and Eukarya. These relationships were previously supported by many studies [Citation67Citation69]. Nevertheless, the minimal information about HORMA domain in the sequenced archaeal genomes limits our knowledge about the evolution of HORMA domain from archaeal to eukaryotes. Therefore, the role of HORMA domain in cell division in archaea and bacteria must be explained through experimental evidence.

Conclusion

HORMA domain presents in a wide range of proteins that are involved in regulating the cell cycle. Its role in the cell cycle has been investigated extensively in eukaryotes. However, the role of HORMA domain in cell division in prokaryotes is still unclear. Recently, sequencing of many bacterial and archaeal genomes has revealed that HORMA domain is present in many prokaryotes. By exploiting bioinformatic tools and HORMA-containing protein sequences from NCBI and UniProt databases, this study investigates the phylogenetic relationships between prokaryotic and eukaryotic HORMA-containing proteins, with an emphasis on the structure and architecture of prokaryotic HORMA-containing proteins. Our results indicate that many prokaryotic genomes contain HORMA-containing proteins. One HORMA-containing protein was found in each of bacterial and archaeal genome. Interestingly, prokaryotic HORMA domain varied in sequences and architectures between the different organisms. Furthermore, phylogenomic analysis reveals an evolutionary connection between some bacterial and eukaryotic HORMA-containing proteins. Relationships between Chlamydia and six other bacterial proteins with eukaryotic HORMA-containing proteins provides convincing evidence for this evolutionary link. Further experimental research is necessary to explain the functional diversification of HORMA domain in prokaryotes; this will potentially facilitate the construction of the evolutionary scenario of HORMA-containing genes.

Disclosure statement

No potential conflict of interest was reported by the author.

References

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.