1,658
Views
7
CrossRef citations to date
0
Altmetric
Research Paper

Deep sequencing of the mouse lung transcriptome reveals distinct long non-coding RNAs expression associated with the high virulence of H5N1 avian influenza virus in mice

ORCID Icon, , , , , , , , , , , & show all
Pages 1092-1111 | Received 22 Jan 2018, Accepted 08 May 2018, Published online: 27 Jul 2018

ABSTRACT

Long non-coding RNAs (lncRNAs) play multiple key regulatory roles in various biological processes. However, their function in influenza A virus (IAV) pathogenicity remains largely unexplored. Here, using next generation sequencing, we systemically compared the whole-transcriptome response of the mouse lung infected with either the highly pathogenic (A/Chicken/Jiangsu/k0402/2010, CK10) or the nonpathogenic (A/Goose/Jiangsu/k0403/2010, GS10) H5N1 virus. A total of 126 significantly differentially expressed (SDE) lncRNAs from three replicates were identified to be associated with the high virulence of CK10, whereas 94 SDE lncRNAs were related with GS10. Functional category analysis suggested that the SDE lncRNAs-coexpressed mRNAs regulated by CK10 were highly related with aberrant and uncontrolled inflammatory responses. Further canonical pathway analysis also confirmed that these targets were highly enriched for inflammatory-related pathways. Moreover, 9 lncRNAs and 17 lncRNAs-coexpressed mRNAs associated with a large number of targeted genes were successfully verified by qRT-PCR. One targeted lncRNA (NONMMUT011061) that was markedly activated and correlated with a great number of mRNAs was selected for further in-depth analysis, including predication of transcription factors, potential interacting proteins, genomic location, coding ability and construction of the secondary structure. More importantly, NONMMUT011061 was also distinctively stimulated during the highly pathogenic H5N8 virus infection in mice, suggesting a potential universal role of NONMMUT011061 in the pathogenesis of different H5 IAV. Altogether, these results provide a subset of lncRNAs that might play important roles in the pathogenesis of influenza virus and add the lncRNAs to the vast repertoire of host factors utilized by IAV for infection and persistence.

Introduction

The primary natural host reservoir of highly pathogenic avian influenza (HPAI) H5N1 virus is wild waterfowl. However, these viruses can occasionally infect other host species, including wild birds, terrestrial poultry, various mammals and even human beings. As of March 2 2018, 860 laboratory-confirmed human cases of H5N1 infection, including 454 deaths, have been reported (http://www.who.int). Although pathogenesis of the H5N1 subtype has been extensively studied, molecular events leading to disease are still obscure. Using transcriptomics or proteomics profiling, accumulated studies have demonstrated that an aberrant immune response contributes to the development of typical pneumonia and subsequent acute respiratory distress syndrome (ARDS), which ultimately lead to death [Citation1Citation4]. However, the underlying host molecular mechanisms causing the aberrant host response are largely unknown.

Traditional studies examining the host transcriptional response to pathogen infection mainly focus on protein-coding genes. However, the majority of the mammalian genome is transcribed into non-coding RNAs (ncRNAs), including the small ncRNAs (< 200 bp) and long ncRNAs (lncRNAs) (> 200 bp)[Citation5]. LncRNAs are the major regulators of gene expression and are involved in multiple biological processes, including genomic imprinting, embryonic development, cell differentiation, tumor metastasis and regulation of cell cycle [Citation6Citation10]. LncRNAs regulate gene expression at the epigenetic, transcriptional and post-transcriptional levels in diverse biological contexts[Citation6]. In addition, a variety of molecular mechanisms were used as major regulators of these function, such as modulating histone modifications[Citation11], interfering the transcription [Citation12,Citation13], regulating patterns of alternative splicing [Citation14,Citation15],generating small RNAs[Citation16], and modulating protein activation and localization[Citation17].

Although accumulated studies have identified the functional importance of lncRNAs in innate immune response [Citation13,Citation18Citation24] and host-pathogen interactions [Citation25Citation29], the specific function of lncRNAs in innate immune response to influenza A virus (IAV) infection remains largely unexplored. As far as we known, only a few lncRNAs, such as NRAV (negative regulator of antiviral response)[Citation11], NEAT1 (nuclear enriched abundant transcript 1)[Citation30], Bst2/BISPR (bone marrow stromal cell antigen 2 IFN-stimulated positive regulator) [Citation31,Citation32] and VIN (virus inducible lincRNA) [Citation33] have been demonstrated to be associated with the innate immunity against influenza virus and viral pathogenesis. Therefore, studies are still needed to better understand the potential roles of other lncRNAs in the pathogenesis of influenza virus.

Our previous study has characterized two HPAI H5N1 viruses, CK10 and GS10, that have similar genetic background but differ greatly in pathogenicity in mice[Citation34]. CK10 is a highly pathogenic strain, as evidenced by the extremely low lethal dose in mice (MLD50: 0.33 log10 50% embryo infectious dose, EID50), whereas GS10 is low pathogenic (MLD50 > 6.32 log10 EID50). Transcriptomics analysis showed that CK10 elicited a more potent innate immune response in the mouse lung compared to GS10[Citation35]. Further quantitative proteomics suggested that an early intense host response associated with the lung injury to CK10 may contribute to the high virulence of this virus in mice[Citation36]. In this study, to evaluate the role of lncRNAs in the pathogenesis of influenza virus of the H5N1 subtype, we systematically compared the expression profile of lncRNAs in the lung of mice infected with CK10 or GS10 using the RNA deep-sequencing technology. Our results demonstrated that these two viruses distinctively regulated the expression of numerous lncRNAs, suggesting that these lncRNAs may be a new class of regulatory molecules involving in determining the outcome of H5N1 virus infection.

Materials and methods

Ethics statement

This study was carried out in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the Ministry of Science and Technology of the People’s Republic of China. The protocols for animal experiments were approved by the Jiangsu Administrative Committee for Laboratory Animals (approval number: SYXK-SU-2007–0005), and complied with the guidelines of Jiangsu laboratory animal welfare and ethics of Jiangsu Administrative Committee of Laboratory Animals. All experiments involving live viruses and animals were housed in negative-pressure isolators with HEPA filters in bio-safety level 3 (BSL3) animal facilities in accordance with the institutional bio-safety manual.

Viruses

A/Chicken/Jiangsu/k0402/2010 (H5N1) (CK10) was isolated from a dead chicken in 2010, and A/Goose/Jiangsu/k0403/2010 (H5N1) (GS10) was isolated from an apparently healthy goose in live poultry market[Citation37]. CK10 is highly pathogenic in mice (MLD50 = 0.33 log10 EID50), whereas GS10 is low pathogenic (MLD50 > 6.32 log10 EID50) in this animal model[Citation34]. The entire genome of the two viruses differed by 30 amino acid (aa) distributed throughout eight genes. [Citation34] Highly pathogenic strain of H5N8 subtype, A/goose/Jiangsu/QD5/2014 (QD5), was isolated from swab samples from apparently healthy geese at a wholesale live-bird market in 2014[Citation38]. QD5 is also highly pathogenic in mice, with a MLD50 of 2.83 log10 EID50[Citation38]. Viruses were plaque-purified three times in MDCK cells and propagated once in specific-pathogen-free (SPF) embryonated chicken eggs.

Mouse experiment

To collect lung samples for deep-sequencing, groups of fourteen 6-week-old female BALB/c mice were infected intranasally (i.n.) with 105.0 EID50 of CK10 or GS10, respectively. Another group of mice were i.n. inoculated with 50 μl of PBS as mock control. At day 1, 3 and 5 post inoculations (p.i.), three mice from each group were euthanized and the lungs were collected for lncRNAs quantification, cytokine profiling, histopathological examination and virus load measurement. The remaining five mice of each group were monitored daily for weight loss and survival for 14 days.

To identify the potential role of the selected lncRNA in IAV pathogenesis, groups of nine 6-week-old female BALB/c mice were infected i.n. with 105.0 EID50 of CK10 (H5N1), QD5 (H5N8) or GS10 (H5N1), respectively. At day 1, 2 and 3 p.i., three mice from each group were euthanized and the lungs were collected for determination of lncRNAs expression and viral replication.

Poly (A)-independent and strand-specific RNA-sequencing

The lung samples were homogenized in TRIzol (Invitrogen, CA, US) using the MagNA Lyser system (Roche) according to the manufacturer’s instructions. RNA was further purified using the miRNeasy minikit (Qiagen) based on the manufacturer’s instructions. The purity of the RNA samples was verified spectroscopically, and RNA quality was assessed using Agilent Bioanalyzer 2100[Citation39]. Only samples with an RNA integrity number greater than 8 were used. rRNA was depleted from 1 mg of total RNA using RiboZero (Illumina). cDNA libraries were prepared from the remaining RNA, without poly (A) selection, using the TruSeq Stranded RNA LT kit (Illumina) following the TruSeq stranded total RNA sample preparation guide provided by the vendor (RS-122-9007DOC). The libraries underwent cluster generation using TruSeq PE Cluster Kit v3-cBot-HS and 100 cycles of paired-end sequencing using TruSeq SBS Kit v3-HS (Illumina) and an Illumina HiSeq 2000 sequencer as described previously [Citation39,Citation40].

Analysis of RNA-sequencing data

After sequencing, the raw reads that were generated by sequencers, were saved in the fastq format. To obtain reliable clean reads, the dirty raw reads were filtered according to six criteria based on fastx software(version:0.0.13): (i) reads with sequence adaptors were removed; (ii) reads with more than 5% ‘N’ bases were removed; (iii) reads with length < 20 bases were removed; (iv) 3ʹ end of Q (Q = −10 log error ratio) less than 10 of the base quality were removed; (v) low-quality reads, in which less than 50% of the quality were > 20 bases were removed; (vi) and ribosomal RNA sequences that were obtained from the ribosomal RNA database SILVA (http://www.arb-silva.de/) by the software SOAP v2.2.0 [Citation41] were removed based on an allowance of no more than three mismatched bases. All subsequent analyses were based on clean reads. Tophat v2.0.9 (http://tophat.cbcb.umd.edu/) spliced mapping was used to map the cleaned reads to the mouse mm10 reference genome with two mismatches.

The clean reads that were uniquely mapped to lncRNAs were used to calculate the expression levels. After genome mapping, Cufflinks v2.1.1 (http://cufflinks.cbcb.umd.edu/) was run with a reference annotation to generate Fragments Per Kilobase of exon model per Million mapped reads (FPKM) values for known gene models. Differentially expressed genes were identified using Cuffdiff, implemented in Cufflinks[Citation42]. The relative expression levels of lncRNAs in the CK10 or GS10 and mock control groups were measured as the number of uniquely mapped FPKM. The formula was defined as follows: FPKM = 109× C/(NL × 10−3), where C was the number of reads that uniquely mapped to the given transcript, N was the number of reads that uniquely mapped to all transcripts, and L was the total length of the given transcript. The FPKM method eliminates the influences of different transcript lengths and sequencing discrepancies on the calculation of expression. Therefore, the FPKM value was directly used to compare the differences in lncRNA expression between the samples. The fold change from the normalized expression was calculated as FPKM CK10 or GS10/FPKM mock to assess the levels. Genes satisfying the condition of FPKM CK10 or GS10 value/FPKM mock value > 1.5 were defined as those up-regulated in virus-infected group, whereas genes satisfying the condition of FPKM CK10 or GS10 value/FPKM mock value < 0.67 were defined as those down-regulated in virus-infected group. To compensate for false-positive findings at each significance threshold, the p-value significance threshold in multiple tests was further set by the false discovery rate (FDR)[Citation43]. Therefore, we identified lncRNAs that were differentially regulated between the CK10, GS10 and mock control groups based on the following criteria: p < 0.05, FDR < 0.05 and absolute value of the fold change > 1.5.

Identification and expression analysis of lncRNAs

Cufflinks was used to assemble reads into transcripts. Novel transcripts were obtained after comparing all the assembled transcript isoforms with the mouse known protein coding transcripts using Cuffcompare[Citation42]. Putative lncRNAs were defined as novel transcripts set through the following filters: length ≥ 200 bp; number of exons ≥ 2; ORF ≤ 300 bp; no or weak protein coding ability (CPC score < 0, CNCI score < 0 and no significant similarity with Pfam database)[Citation44]. Finally, to generate a unique set of lncRNAs, we used Cuffcompare to integrate the RNA-seq derived lincRNAs with the known lncRNAs previously annotated by NONCODE V4.0 (http://www.noncode.org/). Differentially expressed lncRNAs were selected for target prediction. The genes transcribed within a 10 kbp window upstream or downstream of lncRNAs were considered as cis-acting target genes. The trans-acting target genes were predicted using RNAplex software[Citation45].

Real-time PCR analysis of lncRNAs and mRNAs expression

Quantitative real-time PCR (qRT-PCR) was used to validate the expression of lncRNAs and mRNAs identified by RNA-sequencing analysis. Briefly, total RNA was isolated from the tissues using the TRIzol reagent and treated with DNase I (Invitrogen). A total of 1 μg of RNA per sample was reverse transcribed into cDNA using 400 U RevertAid Premium Reverse Transcriptase and 100 μM random primers in the presence of RNase inhibitor at 50°C for 30 min. The reaction mixture contained cDNA, 200 nM of each primer and 10 μl of 2 × SYBR Green PCR Master Mix (Takara, Shiga, Japan). PCR reactions were performed in triplicate using the ABI Prism 7300 system (Applied Biosystems, CA, US) with the following cycle profile: 1 cycle at 50°C for 2 min and 1 cycle at 95°C for 5 s followed by 40 cycles at 95°C for 5 s and 60°C for 31 s. 1 cycle for melting curve for all reactions was added to verify product specificity. Expression value of each gene, relative to the GAPDH, was calculated using the equation 2−ΔΔCt method.

Construction of the LncRNA/mRNA Coexpression Network

To construct the lncRNA/mRNA coexpression network, we calculated the Pearson correlation coefficient and R value to evaluate lncRNA-mRNA correlation[Citation46]. The network construction procedure includes: (1) Preprocess data: the same mRNAs with different transcripts taking the median value represent the gene expression values, without special treatment of lncRNAs expression value. (2) Screen data: remove the subset of data according to the lists showing the differential expression of lncRNAs and mRNAs. (3) Calculate the Pearson correlation coefficient and use R value to calculate the correlation coefficient between lncRNAs and mRNAs. (4) Screen by Pearson correlation coefficient: select the Pearson correlation coefficient ≥ 0.99 or ≤ – 0.99 as the meaningful value and draw the lncRNA/mRNA coexpression network by using the cytoscape program.

Accession numbers

All primary RNA-sequencing data have been deposited in the Gene Expression Omnibus (GEO) database (http://www.ncbi.nlm.nih.gov/geo/info/linking.html) under accession number GSE100522. The genomic sequences of the CK10 and GS10 viruses are available in GenBank under accession numbers JQ638673 to JQ638688.

Statistical analysis

Viral loads are expressed as the mean ± standard deviation (SD) from three individuals. Cytokine levels and lncRNA expression levels are expressed as the mean fold change ± standard error (SE) of the mean from three individuals. Statistical analyzes were performed using Independent-Sample T test.

Results

Pathogenicity of CK10 and GS10 in mice

To systematically compare the pathogenicity of this pair of viruses, groups of fourteen mice were infected i.n. with 105.0 EID50 of CK10 or GS10. At day 1, 3 and 5 p.i., three mice from each group were euthanized and the lungs were collected for determination of transcriptional cytokine response and viral replication. The remaining five mice of each group were monitored daily for weight loss and survival rate over the course of infection. As shown in ), mice infected with CK10 virus showed more severe weight loss compared with that of GS10 virus. In addition, no death was found in GS10-infected mice within a 14-day observation period, while all the CK10-infected mice succumbing to death by day 8 p.i. ()). Moreover, CK10 replicated at significantly higher titers in mouse lung than GS10 at all-time points ()). According to our previous study [Citation35] and the established knowledge about the role of Cxcl10, Cxcl11 and Il6 in inflammation, we determined the expression of these cytokines and found that CK10 stimulated significantly higher expression levels of these genes compared to GS10 in the mouse lung (). Mean while, we also compared the mouse lung histopathology induced by these pair of viruses. As shown in , we can see that early at day 1 p.i., the highly pathogenic CK10 virus induced a more severe lung injury than the non-virulent GS10 virus, represented by pulmonary alveolar hemorrhage, bronchial mucosa injury and accompanied by inflammatory cells infiltration around the bronchus ()). However, no obvious histopathology was observed in the GS10 virus-infected mouse lung ()). Altogether, these results demonstrated that CK10 virus is more virulent than GS10 virus in mice and is associated with higher lung titer and proinflammatory cytokine expressions.

Figure 1. Pathogenicity of CK10 and GS10 in mice. (A) Mean weight loss of mice infected with 105.0 EID50 of CK10 and GS10 viruses (= 5). Mice were humanely killed when they lost ≥ 25% of their initial body weight. Error bar represents stand deviation (SD). (B) Survival rate of mice infected with the indicated viruses (= 5). (C) Viral replication in the mouse lung. Values shown are the mean ± SD of the results from five individuals (*< 0.05). Asterisk indicates significant difference between the CK10 and the GS10 virus. (D) – (F) Cytokines and chemokines expression in the mouse lung. Levels of cytokine or chemokine were expressed as the mean fold change ± standard error (SE) of the mean. *< 0.05 and **< 0.01, asterisk or double asterisk indicates significant difference between CK10 and GS10. (G) and (H) Representative histopathological changes in H&E (hematoxylin and eosin)-stained lung tissues on day 1 p.i.. (G) CK10 virus-infected mouse lung. Pulmonary alveolar hemorrhage (shown as asterisk), bronchial mucosa injury and accompany by inflammatory cells infiltration around the bronchus (shown as black arrow). (H) GS10 virus-infected mouse lung. No obvious histopathology was observed in the GS10 virus-infected mouse lung.

Figure 1. Pathogenicity of CK10 and GS10 in mice. (A) Mean weight loss of mice infected with 105.0 EID50 of CK10 and GS10 viruses (n = 5). Mice were humanely killed when they lost ≥ 25% of their initial body weight. Error bar represents stand deviation (SD). (B) Survival rate of mice infected with the indicated viruses (n = 5). (C) Viral replication in the mouse lung. Values shown are the mean ± SD of the results from five individuals (*p < 0.05). Asterisk indicates significant difference between the CK10 and the GS10 virus. (D) – (F) Cytokines and chemokines expression in the mouse lung. Levels of cytokine or chemokine were expressed as the mean fold change ± standard error (SE) of the mean. *p < 0.05 and **p < 0.01, asterisk or double asterisk indicates significant difference between CK10 and GS10. (G) and (H) Representative histopathological changes in H&E (hematoxylin and eosin)-stained lung tissues on day 1 p.i.. (G) CK10 virus-infected mouse lung. Pulmonary alveolar hemorrhage (shown as asterisk), bronchial mucosa injury and accompany by inflammatory cells infiltration around the bronchus (shown as black arrow). (H) GS10 virus-infected mouse lung. No obvious histopathology was observed in the GS10 virus-infected mouse lung.

Whole-transcriptome analysis of mouse lung infected with CK10 and GS10

Since the pathogenicity study showed that the two viruses exhibited a remarkable difference in replication, cytokine response and histological changes early at day 1 p.i. (), we then performed a whole-transcriptome analysis of the collected lung samples at this time point using next-generation sequencing (NGS). Mapped ratio of region distribution of all the tested reads, including gene, coding region, splicing, intron, non-coding region, were shown in ). Among them, a total of 1,990 novel lncRNAs covering three major types of lncRNAs (Intergenic, Intron and Antisense) were identified. In addition, CK10 virus induced expression of 48,759 known lncRNAs and 19,491 mRNAs, whereas the corresponding numbers for GS10 virus were 49,411 and 19,843, respectively. Among these RNAs, 126 lncRNAs regulated by CK10 displayed significantly altered expression levels compared with the mock control, including 103 up-regulated lncRNAs (fold change: CK10 vs. mock > 1.5, p < 0.05, FDR< 0.05) and 23 down-regulated lncRNAs (fold change CK10 vs. mock > −1.5, p < 0.05, FDR< 0.05) ()). As for the GS10 group, a total of 94 lncRNAs were significantly differentially expressed (SDE), with 80 up-regulated and 14 down-regulated ()). Moreover, CK10 induced 223 SDE mRNAs (fold change CK10 vs. mock > 1.5 or −1.5, p < 0.05, FDR< 0.05), while only 130 SDE mRNAs were screened for GS10 ()). Interestingly, among these SDE RNAs, 64 lncRNAs (40.5%) and 119 mRNAs (50.9%) were shared by the two viruses (). Therefore, these results showed that early at day 1 p.i., CK10 induced a larger number of SDE RNAs than GS10, and there was a partial of SDE lncRNAs and mRNAs overlapped between these two groups.

Figure 2. Analysis of the lncRNA data. (A) Region distribution of the tested reads. Results shown are the region distribution of the tested reads, including gene, coding region, splicing, intron, non-coding region and intergenic. Among them, 5-UTR, 3-UTR, non-coding RNA regions are covered in non-coding region. (B) Numbers of significantly differentially expressed (SDE) lncRNAs in the process of infection with CK10 or GS10 relative to mock (< 0.05, fold change > 1.5 or < 0.67). (C) Numbers of SDE mRNAs in the process of infection with CK10 or GS10 relative to mock (< 0.05, fold change > 1.5 or < 0.67). (D) Venn diagram showing the distribution of SDE lncRNAs in the process of infection with CK10 or GS10. (E) Venn diagram showing the distribution of SDE mRNAs in the process of infection with CK10 or GS10 viruses.

Figure 2. Analysis of the lncRNA data. (A) Region distribution of the tested reads. Results shown are the region distribution of the tested reads, including gene, coding region, splicing, intron, non-coding region and intergenic. Among them, 5-UTR, 3-UTR, non-coding RNA regions are covered in non-coding region. (B) Numbers of significantly differentially expressed (SDE) lncRNAs in the process of infection with CK10 or GS10 relative to mock (p < 0.05, fold change > 1.5 or < 0.67). (C) Numbers of SDE mRNAs in the process of infection with CK10 or GS10 relative to mock (p < 0.05, fold change > 1.5 or < 0.67). (D) Venn diagram showing the distribution of SDE lncRNAs in the process of infection with CK10 or GS10. (E) Venn diagram showing the distribution of SDE mRNAs in the process of infection with CK10 or GS10 viruses.

lncRNA/mRNA coexpression network

Although accumulating studies have attempted to reveal the functional significance of lncRNAs, the biological roles of most lncRNAs are still unknown. Biological processes and cellular regulation networks are very complex, involving the interactions of various molecules, such as proteins, RNAs, and DNAs. Our RNA-sequencing data not only provided the information of lncRNAs expression, but also provided mRNAs expression patterns in the virus or mock-inoculated mouse lung. We thus constructed an lncRNA/mRNA coexpression network based on the SDE RNAs between CK10 or GS10 virus and mock control and investigated the potential interaction between mRNAs and lncRNAs of each virus-infected group. The coexpression network for the CK10 group was composed of 56 differentially expressed lncRNAs, 149 differentially expressed mRNAs and 205 network nodes (). The network showed that several lncRNAs (NONMMUT036704, NONMMUT011061, NONMMUT053065, NONMMUT058733 and NONMMUT061245) correlated with a great number of targeted mRNAs, and vice versa (). This coexpression network also indicated that one lncRNA (NONMMUT036704) could target 22 network nodes and one coding gene (Cxcl10) could target 25 network nodes. In addition, the second ranked lncRNA (NONMMUT011061) and mRNA (Cxcl11), could both target 21 network node. As for the GS10 group, the coexpression network was composed of 65 lncRNAs, 102 mRNAs and 167 network nodes (Figure S1). However, as shown in Figure S1, in the GS10-infected mouse lung, the lncRNAs were not highly correlated with the mRNAs and vice versa. Taken together, these results suggested the closer inter-regulation of lncRNAs and mRNAs in the early stage of CK10 virus infection compared with GS10 virus.

Figure 3. The lncRNA/mRNA coexpression network constructed using the cytoscape program for the CK10 group. The lncRNAs and mRNAs with Pearson correlation coefficients ≥ 0.99 or ≤ −0.99 were selected to draw the regulatory network using the cytoscape program. In gene-coexpression networks, each gene corresponds to a node. Two genes are connected by an edge, indicating a strong correlation. Within the network analysis, a degree is the simplest, most important measure of the centrality of a gene within a network and determines the relative importance. A degree is defined as the number of directly linked neighbors. In the network, the node size indicates the node degrees and the number represents the number of directly linked neighbours that are associated with each color. Therefore, the larger the node size suggested the targeted lncRNA or mRNA could directly linked with more neighboring genes.

Figure 3. The lncRNA/mRNA coexpression network constructed using the cytoscape program for the CK10 group. The lncRNAs and mRNAs with Pearson correlation coefficients ≥ 0.99 or ≤ −0.99 were selected to draw the regulatory network using the cytoscape program. In gene-coexpression networks, each gene corresponds to a node. Two genes are connected by an edge, indicating a strong correlation. Within the network analysis, a degree is the simplest, most important measure of the centrality of a gene within a network and determines the relative importance. A degree is defined as the number of directly linked neighbors. In the network, the node size indicates the node degrees and the number represents the number of directly linked neighbours that are associated with each color. Therefore, the larger the node size suggested the targeted lncRNA or mRNA could directly linked with more neighboring genes.

Biofunction analysis of the lncRNAs-coexpressed mRNAs

To predict the roles of the selected differentially expressed lncRNAs in response to CK10 and GS10, the lncRNAs-coexpressed mRNAs were then imported into the Ingenuity Pathways Analysis (IPA) software (Ingenuity Systems, Redwood, CA, US) for bio-function category construction. As a result, these mRNAs mainly clustered into some important functional groups, such as ‘Immunological Disease’, ‘Organismal Injury and Abnormalities’, ‘Antiviral Responses’ and ‘Inflammatory Response’ ()). Among of them, ‘Antiviral Responses’ and ‘Inflammatory Response’ were intensely activated (activation z-score above 2.0) both by the CK10 virus and GS10, however, more genes involved in these functions in the CK10 virus-infected group. Particularly, the lncRNAs-coexpressed mRNAs associated with ‘Inflammatory Disease’, ‘Hematological System Development and Function’, ‘Tissue Morphology’ and ‘Cellular Movement and Immune Cell Trafficking’ were more than 2-fold induced (according to the -log10 P-value) in the lungs in CK10-infected mice than that of GS10 (which indicated as ‘*’) ()). Moreover, a number of genes were more upregulated in these functions by CK10 virus than that of GS10 virus ()). Therefore, these results suggested that the SDE lncRNAs-coexpressed mRNAs activated by CK10 were highly related with the aberrant and uncontrolled inflammatory related responses.

Figure 4. Disease and bio-functional categories of the lncRNAs-coexpressed mRNAs induced by CK10 or GS10 by IPA analysis. (A) Important bio-functions associated with the lncRNAs-coexpressed mRNAs. ‘*’, indicates the P value of the CK10 virus-infected group was above 2-fold than that of GS10. (B) Crucial bio-functions related to inflammatory responses that were highly induced by CK10.

Figure 4. Disease and bio-functional categories of the lncRNAs-coexpressed mRNAs induced by CK10 or GS10 by IPA analysis. (A) Important bio-functions associated with the lncRNAs-coexpressed mRNAs. ‘*’, indicates the P value of the CK10 virus-infected group was above 2-fold than that of GS10. (B) Crucial bio-functions related to inflammatory responses that were highly induced by CK10.

Canonical pathway analysis of the lncRNAs-coexpressed mRNAs

To further gain insight into the function of the lncRNAs-coexpressed mRNAs, we next compared the top 5 canonical pathways (generated by IPA) associated with these SDE mRNAs. The results showed that the top 5 canonical pathways (according to the -log10 P-value) for CK10 virus were ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’, ‘Agranulocyte Adhesion and Diapedesis’, ‘Interferon Signaling’, ‘Antigen Presentation Pathway’ and ‘Granulocyte Adhesion and Diapedesis’ was hyper-induced in CK10-infected mice ()). Among of them, ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ and ‘Interferon Signaling’ pathways were highly activated (activation z-score above 2.0). In contrast, the top 5 canonical pathways of GS10 were ‘Interferon Signaling’, ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’, ‘Agranulocyte Adhesion and Diapedesisy’, ‘Role of RIG1-like Receptors in Antiviral Innate Immunity’ and ‘Antigen Presentation Pathway’, respectively ()). Therefore, there were highly overlapped between the SDE lncRNAs-coexpressed mRNAs-mediated top canonical pathway in CK10 and GS10 virus-infected mouse lung. However, further analysis demonstrated that mice infected with the CK10 virus exhibited an overall stronger expression of lncRNAs-coexpressed mRNA associated with these pathways, including ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ ()),

Figure 5. Canonical pathways of the lncRNAs-coexpressed mRNAs stimulated by CK10 or GS10 by IPA analysis. (A) The top canonical pathways associated with the lncRNAs-coexpressed mRNAs. (B) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ top 1 pathway for the CK10 group. (C) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Agranulocyte Adhesion and Diapedesis’ top 2 pathway for the CK10 group. (D) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Interferon Signaling’ top 3 pathway for the CK10 group. (E) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Antigen Presentation Pathway’ top 4 pathway for the CK10 group. (F) The detail presentation of top 1 canonical pathway ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ induced by CK10.

Figure 5. Canonical pathways of the lncRNAs-coexpressed mRNAs stimulated by CK10 or GS10 by IPA analysis. (A) The top canonical pathways associated with the lncRNAs-coexpressed mRNAs. (B) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ top 1 pathway for the CK10 group. (C) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Agranulocyte Adhesion and Diapedesis’ top 2 pathway for the CK10 group. (D) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Interferon Signaling’ top 3 pathway for the CK10 group. (E) The expression profiles of the lncRNAs-coexpressed mRNAs related to the ‘Antigen Presentation Pathway’ top 4 pathway for the CK10 group. (F) The detail presentation of top 1 canonical pathway ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ induced by CK10.

‘Agranulocyte Adhesion and Diapedesis’ ()), ‘Interferon Signaling’ ()) and ‘Antigen Presentation Pathway’ ()). Therefore, these results indicate the SDE lncRNAs-coexpressed mRNAs induced by CK10 and GS10 shared some similarities in association with the inflammatory response related pathways; however, CK10-regulated mRNAs were more upregulated in these pathways.

qRT-PCR validation of the selected lncRNAs and lncrnas-coexpressed mRNA

To verify the RNA-sequencing data, a subset of RNAs in replicate samples was examined using qRT-PCR. Two categories of RNA were selected: 9 lncRNAs that were highly coexpressed with the annotated protein-coding genes and 17 mRNAs that were tightly related with the targeted lncRNAs. Pearson correlation analysis was applied to measure the significance of the correlations. As a result, a good correlation between RNA deep-sequencing data and qRT-PCR results on the set of independent samples with multiple replicates was observed (Figure 6 and Figure S2). The analysis results revealed a highly statistically significant (p < 0.001) correlation between the qRT-PCR and RNA-sequence data and the correlation coefficient were all above 0.8 (Figure S2). QRT-PCR data also showed that the expression of NONMMUT036704, NONMMUT011061, NONMMUT058733 and NONMMUT053310 in CK10-infected mice was significantly higher than that in GS10-infected animals (Figure 6(A), Supplementary information 1 listed the sequence information for NONMMUT011061). In addition, some mRNAs that highly correlated with NONMMUT011061, including Cxcl11, Cxcl5, Ccl2, Saa3, Irg1, Ccl4, Il6, Ccl7 and Cxcl1, were also activated to significantly higher levels compared to those of the GS10 group ( and ).

Figure 6. Validation of the RNA-sequencing data. (A) Alteration of the expression of the selected lncRNAs in CK10- or GS10-infected mouse lungs at 24 h p.i. was analyzed by qRT-PCR. Values shown are the mean fold change ± SE of the results from three individuals (*p < 0.05). Asterisk indicates a significant difference between CK10 and GS10. (B) RNA-sequencing results of the targeted lncRNAs were shown as control. (C) Alteration of the expression of the selected mRNAs in CK10- or GS10-infected mouse lungs at 24 h p.i. was analyzed by qRT-PCR. Values shown are the mean ± SD of the results from three individuals (*p < 0.05, **p < 0.01). Asterisk indicates a significant difference between CK10 and GS10. (D) RNA-sequencing results of the selected mRNAs were shown for comparison.

Figure 6. Validation of the RNA-sequencing data. (A) Alteration of the expression of the selected lncRNAs in CK10- or GS10-infected mouse lungs at 24 h p.i. was analyzed by qRT-PCR. Values shown are the mean fold change ± SE of the results from three individuals (*p < 0.05). Asterisk indicates a significant difference between CK10 and GS10. (B) RNA-sequencing results of the targeted lncRNAs were shown as control. (C) Alteration of the expression of the selected mRNAs in CK10- or GS10-infected mouse lungs at 24 h p.i. was analyzed by qRT-PCR. Values shown are the mean ± SD of the results from three individuals (*p < 0.05, **p < 0.01). Asterisk indicates a significant difference between CK10 and GS10. (D) RNA-sequencing results of the selected mRNAs were shown for comparison.

Potential relevance of NONMMUT011061 with virulence of H5 IAV in mice

Our coexpression network indicated that one NONMMUT036704 could target 22 network nodes and lncRNA NONMMUT011061 could target 21 network nodes (). Moreover, a number of inflammatory genes which were highly correlated with NONMMUT011061 were highly activated during the highly pathogenic virus infection ( and )). Therefore, to reveal the potential role of NONMMUT011061 and NONMMUT036704 in the pathogenesis of H5 IAV, groups of mice were inoculated with different H5 viruses (H5N1: CK10 and GS10; H5N8: QD5) [Citation38] and the expression pattern of NONMMUT011061 and NONMMUT036704 in the mouse lung were determined at day 1, 2 and 3 p.i. The results showed that the expression of NONMMUT011061 and NONMMUT036704 in CK10-infected mice was significantly higher than that of GS10 at several time points ( and )). Meanwhile, CK10 replicated to significantly higher levels than GS10 at all three time points ()). More importantly, the highly pathogenic H5N8 virus QD5 also stimulated significantly higher expression level of NONMMUT011061 than that of GS10 at day 2 and 3 p.i. However, although significantly higher virus load was also observed in QD5-infected mice compared to that in GS10-infected mice at day 3 p.i., NONMMUT036704 is not significantly stimulated by the highly pathogenic H5N8 virus in mice ()). Taken together, these results showed that the NONMMUT011061 was distinctively stimulated during the highly pathogenic H5N1 and H5N8 virus infection in mice, suggesting a potential role of NONMMUT011061 in the pathogenesis of different H5 IAV.

Figure 7. Potential relevance of NONMMUT011061 with virulence of H5 IAV in mice. Groups of nine mice were infected with CK10, GS10 or QD5 (a H5N8 strain that is also highly pathogenic in mice). (A) The expression pattern of NONMMUT011061 was determined in the mouse lung at day 1, 2 and 3 p.i. Values are expressed as the mean fold change ± SE of the mean from three individuals. (B) Virus titers in the lungs. Values shown are the mean ± SD of the results from three individuals (*p < 0.05, **p < 0.01). Asterisk indicates a significant difference between highly pathogenic viruses (CK10 or QD5) and GS10. (C) The expression pattern of NONMMUT036704 was determined in the mouse lung at day 1, 2 and 3 p.i. Values are expressed as the mean fold change ± SE of the mean from three individuals.

Figure 7. Potential relevance of NONMMUT011061 with virulence of H5 IAV in mice. Groups of nine mice were infected with CK10, GS10 or QD5 (a H5N8 strain that is also highly pathogenic in mice). (A) The expression pattern of NONMMUT011061 was determined in the mouse lung at day 1, 2 and 3 p.i. Values are expressed as the mean fold change ± SE of the mean from three individuals. (B) Virus titers in the lungs. Values shown are the mean ± SD of the results from three individuals (*p < 0.05, **p < 0.01). Asterisk indicates a significant difference between highly pathogenic viruses (CK10 or QD5) and GS10. (C) The expression pattern of NONMMUT036704 was determined in the mouse lung at day 1, 2 and 3 p.i. Values are expressed as the mean fold change ± SE of the mean from three individuals.

Bioinformatics analysis of NONMMUT011061

Since NONMMUT011061 was also distinctively stimulated during the highly pathogenic H5N1 (CK10) and H5N8 virus (QD5) infection in mice, suggesting a potential role of NONMMUT011061 in the pathogenesis of different H5 IAV. Therefore, the bio-functions of this lncRNA were further analyzed using the well-established bioinformatics tools. Transcription factors (TFs) were recognized as important components of signaling cascades controlling all types of normal cellular processes as well as response to external stimulus[Citation47]. Here, TRANSFAC (http://www.gene-regulation.com/index2.html) was used to predict the potential TFs of NONMMUT011061. The results showed that NONMMUT011061 can combine with 74 TFs in total, including interferon regulated factor (IRF), macrophage-activating factor (MAF), heat shock factor (HSF), GATA sequence and myelocytomatosis oncogene (Myc) (). Through University of California Santa Cruz (UCSC) Genome Browser (http://genome-asia.ucsc.edu/index.html), the genomic location of NONMMUT011061 was predicted to be on chromosome 11qB5 and was assumed to be unable to encode genes (predicated through PhyloCSF tracks: https://data.broadinstitute.org/compbio1/PhyloCSFtracks/trackHub/hub.txt) ()). In silico prediction of the secondary structure of lncRNA is another useful method to define putative functions of non-coding transcripts, based on the widely-held assumption that highly-folded structures affect functionality through binding interactions with proteins/nucleotides[Citation48]. Using RNAfold minimum free energy estimations based on RNAfold webserver (http://rna.tbi.univie.ac.at/cgi-bin/RNAWebSuite/RNAfold.cgi), a highly-folded secondary structure with several hairpin loops of NONMMUT011061 was identified ()). We also conducted catRAPID analysis (http://service.tartaglialab.com) to predict the potential interacting proteins of NONMMUT011061. As a result, we found a strong interaction between NONMMUT011061 and several proteins, including cleavage and polyadenylation-specific factor 3 (CPSF3), fragile X mental retardation syndrome related protein 1 (FXR1), THO complex 1 (THOC1) and polyA-specific ribonuclease (PARN) () and ).

Table 1. Top protein that might interact with NONMMUT011061.

Figure 8. Prediction of the potential transcriptional factors (TFs) of the NONMMUT011061. The TRANSFAC database was used to predict the TFs associated with NONMMUT011061 (http://www.gene-regulation.com/index2.html).

Figure 8. Prediction of the potential transcriptional factors (TFs) of the NONMMUT011061. The TRANSFAC database was used to predict the TFs associated with NONMMUT011061 (http://www.gene-regulation.com/index2.html).

Figure 9. Bioinformatics analysis of NONMMUT011061. (A) The chromosome location of NONMMUT011061 in the mouse genome was shown. (B) Prediction of RNA secondary structure for NONMMUT011061 (RNAfold web server, University of Vienna). A minimal free energy structure (MFE = −396.90 kcal/mol) was shown. Base pairing probabilities have been color-coded using a scale from 0 (blue) to 1 (red). (C) catRAPID analysis indicated a strong interaction between CPSF3 and NONMMUT011061.

Figure 9. Bioinformatics analysis of NONMMUT011061. (A) The chromosome location of NONMMUT011061 in the mouse genome was shown. (B) Prediction of RNA secondary structure for NONMMUT011061 (RNAfold web server, University of Vienna). A minimal free energy structure (MFE = −396.90 kcal/mol) was shown. Base pairing probabilities have been color-coded using a scale from 0 (blue) to 1 (red). (C) catRAPID analysis indicated a strong interaction between CPSF3 and NONMMUT011061.

Discussion

With the recent technical advances in genome-wide studies, it is becoming increasingly obvious that more than 98% of the human genome is transcribed into non-coding RNAs (ncRNAs), while the majority of these transcripts can be categorized as lncRNAs[Citation49]. Although the pivotal role of individual lncRNA in the pathogenesis is being increasingly realized, the possible role of lncRNAs in the interaction between IAV and the host remains largely unknown. In this study, to further investigate the potential role of lncRNAs in the pathogenesis of IAV, a highly pathogenic virus CK10 and a low pathogenic virus GS10 were used to characterize the expression profile of lncRNAs in a mouse model using deep-sequencing. A total of 126 SDE lncRNAs were identified in CK10-infected mice, whereas the corresponding number for GS10 was only 94 ()). Moreover, compared to GS10, the differentially expressed lncRNAs-coexpressed mRNAs regulated by CK10 were highly related with the aberrant and uncontrolled inflammatory responses (). Interestingly, one lncRNA, NONMMUT011061, potentially interacting with a great number of inflammatory-related genes, was significantly up-regulated in both H5N1 CK10 strain and highly pathogenic H5N8 QD5 strain when compared to GS10 ( and (A)).

To date, several lncRNAs have been shown to play an important role in the innate immune response against influenza virus and be associated with viral pathogenesis, including NRAV (negative regulator of antiviral response)[Citation11], NEAT1 (nuclear enriched abundant transcript 1)[Citation30], Bst2/BISPR (bone marrow stromal cell antigen 2 IFN-stimulated positive regulator) [Citation31,Citation32] and VIN (virus inducible lincRNA)[Citation33]. NRAV inhibits the transcription of interferon-stimulated genes via affecting histone modification of these genes (mainly MxA and IFITM3), resulting in the manipulation of the antiviral response[Citation11]. The ectopic expression of human NRAV conferred the hypersensitivity of the mice to influenza virus infection, evidenced by higher virus titers in the lung, increased body weight loss and higher mortality[Citation11]. In addition, the cooperative action between the transcriptional regulators and nuclear lncRNAs also impacts the innate immune response to IAV infection. NEAT1, an essential lncRNA for the formation of nuclear body paraspeckles, is activated by influenza virus and involved in the transcriptional activation of the antiviral gene interleukin Il8 probably through relocating transcriptional regulators splicing factor proline/glutamine-rich (SFPQ, a NEAT1-binding paraspeckle protein) from the Il8 promoter to the paraspeckles[Citation30]. In addition, lncRNAs can also directly regulate the expression of specific protein-coding-genes. LncRNA Bst2/BISPR is activated upon infection with the recombinant influenza virus that is deficient in the interferon (IFN) response blocking and after treatment with type I IFN. Bst2/BISPR regulates the expression of genomically neighboring protein-coding gene in an IFN-stimulated gene, cis bone marrow stromal cell antigen 2 (bst2), while the expression of other genes adjacent to bst2 was not affected [Citation31,Citation32]. It is worth noting that the host innate immune response can be regulated by the expression of NRAV, NEAT1 or Bst2/BISPR. However, VIN, a large intergenic ncRNAs, induced by H1N1, H3N2, H7N7 influenza viruses as well as vesicular stomatitis virus, plays a role in promoting influenza virus replication and is not affected by the treatment with IFN-β or IFN inducers[Citation33].

Compared with the protein-coding sequences, the majority of lncRNAs are poorly conserved in vertebrates. Moreover, it is quite difficult to predict the functions of lncRNAs simply based on their nucleotide sequences. To reveal the functional significance of lncRNAs in IAV, we constructed the lncRNA/mRNA coexpression network based on the correlation analysis. Bio-function analysis suggested that the lncRNAs-coexpressed mRNAs induced by CK10 virus were highly associated with inflammatory response-related functions ()). Moreover, compared to GS10 virus, mice infected with CK10 virus exhibited a stronger expression of genes associated with these functions ()). Canonical pathway further demonstrated that CK10 highly activated the inflammatory cytokines-related pathways, notably, ‘Activation of IRF by Cytosolic Pattern Recognition Receptors’ and ‘Interferon Signaling’ pathways ()). In addition, mice infected with CK10 exhibited a stronger expression of genes associated with these pathways (). Moreover, qRT-PCR results of the mRNAs that highly-related with the targeted lncRNAs forcefully confirmed that the expressions of the tested inflammatory cytokine genes were significantly higher in CK10 virus infected mouse lung than that of GS10 virus ()). Thus, we surmized that the distinct expression of lncRNAs may contribute to the intense inflammatory response induced by CK10 virus which quite accordance with the histopathology observed in the mouse lung ()).

Accumulating evidence suggests that a prolonged and dysregulated host response to influenza virus infection can act deleteriously to initiate or exacerbate pathological lung damage and subsequent death [Citation50Citation52]. Our previous study also showed that CK10-mediated robust innate immune response, termed as a cytokine storm or hypercytokinemia, is potentially fatal and is a significant underlying factor for the high mortality of infected mice [Citation35,Citation36]. In this study, we found that highly-pathogenic CK10 induced higher levels of lncRNA expression than the low pathogenic strain GS10. Therefore, the large number of SDE lncRNAs induced by CK10 may play a crucial role in virus-induced hypercytokinemia. However, further studies are needed to investigate the underlying mechanism for these lncRNAs to regulate the innate immune response.

The co-expression network analysis result showed that NONMMUT011061 could correlate a number of mRNAs () and the qRT-PCR validation results showed that NONMMUT011061 was significantly higher expressed in CK10-infected mice than that of GS10 virus ()). It is worth noting that some mRNAs highly correlated with NONMMUT011061 were also expressed at significantly higher levels compared with GS10 virus, including Cxcl11, Cxcl5, Ccl2, Irg1, Il6, Saa1, Ccl7 and Cxcl1 (). In addition, NONMMUT011061 was also distinctively up-regulated by another highly pathogenic H5N8 virus in mouse lung ()). Thus, these data together showed that NONMMUT011061 may act as a key regulator in IAV-mediated inflammatory response. Further functional study on NONMMUT011061 could provide useful insights into the pathogenesis of influenza virus.

Usually, the normal execution of biological event is controlled by a combination of lncRNA-mediated regulation and TFs[Citation53]. These two mechanisms share similar regulatory logistics and cooperate in part by influencing the activity of the binding sites in target genes. Some lncRNAs, such as 7SK, can directly affect the loading and activity of general TFs and to further affect the production of mRNAs[Citation54]. Moreover, another lncRNA, NRON, can directly serve as either co-factors or inhibitors to regulate the activity of nuclear factor of activated T cells (NFAT) proteins which manipulate gene expression in many cell types[Citation55]. In this study, using TRANSFAC, 75 TFs were predicted to have the potential to combine with NONMMUT011061, including some TFs associated with the immune response and some important pathways, such as IRF, MAF, HSF, GATA sequence and Myc (). Moreover, IRF, MAF, GATA and Myc are specifically up- or down-regulated during influenza virus infection (GSE41126 and GSE53932, http://www.ncbi.nlm.nih.gov/projects/geo/), suggesting their potential role during influenza virus infection.

A large number of studies have revealed the versatile and critical functions performed by IRF family in innate immunity [Citation56Citation58], adaptive immunity [Citation57] and many other biological processes, such as immune cell development [Citation56,Citation58], regulation of gene expression in response to pathogen infection [Citation56,Citation59,Citation60],regulation of the cell cycle [Citation61,Citation62] and apoptosis [Citation63,Citation64]. The MAF is a family of transcription factor protein that belongs to the activated protein-1 super-family of transcription factors. MAF plays multiple roles in regulation of cellular development and differentiation[Citation65]. Moreover, interaction between long intergenic non-coding RNAs (lincRNAs, linc-MAF-4) and MAF involves in the T lymphocyte differentiation[Citation66]. In addition, an lncRNA-MAF transcription factor network plays an essential role in epidermal differentiation[Citation67]. Interestingly, MAF also mediates the crosstalk between the MAPK and AKT/mTOR signal pathways[Citation68]. Most importantly, a number of studies have shown that influenza virus can effectively activate the MAPK pathways and the activation of MAPK family members plays an important role in viral replication, proinflammatory and apoptotic response in various cells during influenza virus infection [Citation69Citation74]. Moreover, the PI3K/Akt/mTOR pathway also activated during influenza virus infection and function as supporting viral effective replication [Citation75Citation80].

HSF, a conserved stress-activated transcription factor for the heat shock proteins, is a key component of the heat shock response and plays a versatile function, such as modulating host inflammatory response[Citation81], regulating stress-induced gene activation[Citation82], activating the ubiquitin proteasome system to promote non-apoptotic developmental cell death[Citation83]. The GATA is a type of transcription factor that plays important roles in several diseases, such as haematopoietic, cardiovascular, gastrointestinal tract, liver and pancreas, urogenital tract and kidney, respiratory tract, mammary gland and central nervous system diseases[Citation84]. Moreover, the GATA also directly affects G protein signaling through regulating the expression of regulator of G protein signaling 4 (RGS4)[Citation85]. The transcription factor Myc involves in regulating the expression of miR-23b-27b cluster during hypoxia-induced neuronal apoptosis[Citation86]. Due to the contributions of these TFs to the regulation of the immune response and their interplay between influenza viruses, it is likely that NONMMUT011061 may serve as an important regulator of the immune response to IAV infection. Future studies are still needed to examine whether NONMMUT011061 activation is also linked to the development of autoimmune and allergic disease as well as the excessive inflammation associated with the acute lung injury caused by CK10 virus infection ()).

Using the catRAPID algorithm, several proteins were predicted to be highly interactive with NONMMUT011061, including CPSF3, FXR1, THOC1 and PARN (, )). CPSF3, an essential component for converting heteronuclear RNA to mRNA, is associated with cellular stress response 1 protein-mediated cell death [Citation87] and also can be designed as an novel target for control toxoplasmosis[Citation88]. The RNA binding protein FXR1 is a critical regulator of post-transcriptional gene expression in differentiation, development and immunity [Citation89,Citation90]. The THOC1, also known as hHpr1/p84, is a nuclear matrix component protein that binds to the tumor suppressor retinoblastoma protein (pRb). [Citation91] THOC1 can dampen cell growth via inducing cell cycle arrest at G2/M and promote apoptosis in lung cancer cells and may have important implications in the development of targeted therapies for lung cancer[Citation92]. THOC1 also plays a crucial role in embryonic development in mice [Citation93] and regulates transcriptional elongation[Citation94]. Deadenylation of eukaryotic mRNA is a crucial mechanism for mRNA function through affecting mRNA turnover and the efficiency of protein synthesis. PARN is one of the biochemically best-characterized deadenylases[Citation95]. Moreover, PARN is also required for the 3′-end maturation of the telomerase RNA component (TERC) and plays a role in the biogenesis of TERC[Citation96]. Depletion of PARN inhibits the proliferation of the gastric cancer cells and promotes cell death through arrested the gastric cancer cells at the G0/G1 phase by up-regulating the expression levels of p53 and p21[Citation97]. Next, we want to find the potential interplay of these identified proteins with influenza virus infection. As a result, unfortunately, currently, we failed to find any association of these proteins with influenza virus infection. However, as stated above, we indeed found that these proteins have multiple functions, especially in cell death (CPSF3, THOC1 and PARN) and immunity regulation (FXR1), which might serve as a connection for the influenza virus-induced cell death and inflammatory response. Therefore, based on the multiple functions of these lncRNA-binding proteins, future investigations are warranted to explore the potential role of these proteins in IAV induced cell death and inflammatory response in the process of interacting with NONMMUT011061.

In summary, our study on the potential link between lncRNAs and IAV may presents a novel direction for understanding the pathogenesis of IAV and may give some clues of the therapeutic strategies for the disease. However, intensive studies are still needed to define the expression, regulation and functioning of lncRNAs for viral pathogenesis. Moreover, further studies centered on the common and unique lncRNAs induced by CK10 or GS10 would be helpful to further elucidate the differential pathogenic mechanisms between these two strains. In addition, further in-depth analysis of the interactions between the influenza virus machineries and specific lncRNAs (such as NONMMUT011061) will provide useful information on their potential role in IAV infection cycle. Moreover, in this study, the healthy mouse was used as the mammalian model. However, for further comparison, it is very interestingly to explore the lncRNA response using immunosuppressive, obese or pregnant mice.

Supplemental material

Supplemental Material

Download Zip (1.5 MB)

Disclosure statement

No potential conflict of interest was reported by the authors.

Supplementary material

Supplementary data for this article can be accessed here

Additional information

Funding

This work was supported by the National Key Research and Development Project of China (2016YFD0500202-1 and 2016YFD0501601), by the Jiangsu Provincial Natural Science Foundation of China (BK20150444), by the National Natural Science Foundation of China (31502076), by the Postdoctoral Science Foundation of Jiangsu Province, China (1501015B), by the Special Financial Grant from the China Postdoctoral Science Foundation (2016T90515), by the ‘Qing Lan Project’ of Higher Education Institutions of Jiangsu Province, China, by the ‘High-end talent support program’ of Yangzhou University, China, by the Natural Science Foundation of the Higher Education Institutions of Jiangsu Province, China (15KJB230006), by the Earmarked Fund for China Agriculture Research System (CARS-40) and by A Project Funded by the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD).

Notes on contributors

Xiufan Liu

Conceived and designed the experiments: JH, ZH and XL. Performed the experiments: JH, ZG, ZH, YL and CM. Analyzed the data: JH, ZH, XW, MG, XL, SH, SC, DP and XJ. Contributed reagents/materials/analysis tools: XW, MG, XL, SH, SC, DP and XJ. Specifically, DP and XJ provided the IPA software and contribute to partial analysis work; XW and MG contribute to the TRANSFAC and UCSC analysis method and analysis; XL and SH involved in providing PhyloCSF and RNAfold WebServer analysis tools and contribute to partial analysis work; SC contributes to the catRAPID algorithm analysis. Wrote the paper: JH, ZH and XL.

References

  • Szretter KJ, Gangappa S, Lu X, et al. Role of host cytokine responses in the pathogenesis of avian H5N1 influenza viruses in mice. J Virol. 2007;81:2736–2744.
  • Perrone LA, Plowden JK, Garcia-Sastre A, et al. H5N1 and 1918 pandemic influenza virus infection results in early and excessive infiltration of macrophages and neutrophils in the lungs of mice. PLoS Pathog 2008; 4:e1000115.
  • Cameron CM, Cameron MJ, Bermejo-Martin JF, et al. Gene expression analysis of host innate immune responses during Lethal H5N1 infection in ferrets. J Virol. 2008;82:11308–11317.
  • Baskin CR, Bielefeldt-Ohmann H, Tumpey TM, et al. Early and sustained innate immune response defines pathology and death in nonhuman primates infected by highly pathogenic influenza virus. Proc Natl Acad Sci U S A. 2009;106:3455–3460.
  • Costa FF. Non-coding RNAs: meet thy masters. Bioessays. 2010;32:599–608.
  • Rinn JL, Chang HY. Genome regulation by long noncoding RNAs. Annu Rev Biochem. 2012;81:145–166.
  • Yoon JH, Abdelmohsen K, Gorospe M. Posttranscriptional gene regulation by long noncoding RNA. J Mol Biol. 2013;425:3723–3730.
  • Magistri M, Faghihi MA, St Laurent G 3rd, et al. Regulation of chromatin structure by long noncoding RNAs: focus on natural antisense transcripts. Trends Genet. 2012;28:389–396.
  • Tsai MC, Manor O, Wan Y, et al. Long noncoding RNA as modular scaffold of histone modification complexes. Science. 2010;329:689–693.
  • Cesana M, Cacchiarelli D, Legnini I, et al. A long noncoding RNA controls muscle differentiation by functioning as a competing endogenous RNA. Cell. 2011;147:358–369.
  • Ouyang J, Zhu X, Chen Y, et al. NRAV, a long noncoding RNA, modulates antiviral responses through suppression of interferon-stimulated gene transcription. Cell Host Microbe. 2014;16:616–626.
  • Huarte M, Guttman M, Feldser D, et al. A large intergenic noncoding RNA induced by p53 mediates global gene repression in the p53 response. Cell. 2010;142:409–419.
  • Carpenter S, Aiello D, Atianand MK, et al. A long noncoding RNA mediates both activation and repression of immune response genes. Science. 2013;341:789–792.
  • Tripathi V, Ellis JD, Shen Z, et al. The nuclear-retained noncoding RNA MALAT1 regulates alternative splicing by modulating SR splicing factor phosphorylation. Mol Cell. 2010;39:925–938.
  • Yin QF, Yang L, Zhang Y, et al. Long noncoding RNAs with snoRNA ends. Mol Cell. 2012;48:219–230.
  • Jalali S, Jayaraj GG, Scaria V. Integrative transcriptome analysis suggest processing of a subset of long non-coding RNAs to small RNAs. Biol Direct. 2012;7:25.
  • Willingham AT, Orth AP, Batalov S, et al. A strategy for probing the function of noncoding RNAs finds a repressor of NFAT. Science. 2005;309:1570–1573.
  • Rapicavoli NA, Qu K, Zhang J, et al. A mammalian pseudogene lncRNA at the interface of inflammation and anti-inflammatory therapeutics. Elife 2013; 2:e00762.
  • Wright PW, Huehn A, Cichocki F, et al. Identification of a KIR antisense lncRNA expressed by progenitor cells. Genes Immun. 2013;14:427–433.
  • Carpenter S. Long noncoding RNA: novel links between gene expression and innate immunity. Virus Res. 2015.
  • Aune TM, Spurlock CF 3rd. Long non-coding RNAs in innate and adaptive immunity. Virus Res. 2015.
  • Heward JA, Lindsay MA. Long non-coding RNAs in the regulation of the immune response. Trends Immunol. 2014;35:408–419.
  • Fitzgerald KA, Caffrey DR. Long noncoding RNAs in innate and adaptive immunity. Curr Opin Immunol. 2014;26:140–146.
  • Collier SP, Collins PL, Williams CL, et al. Cutting edge: influence of Tmevpg1, a long intergenic noncoding RNA, on the expression of Ifng by Th1 cells. J Immunol. 2012;189:2084–2088.
  • Broadbent KM, Park D, Wolf AR, et al. A global transcriptional analysis of Plasmodium falciparum malaria reveals a novel family of telomere-associated lncRNAs. Genome Biol. 2011;12:R56.
  • Scaria V, Pasha A. Long Non-Coding RNAs in Infection Biology. Front Genet. 2012;3:308.
  • Fortes P, Morris KV. Long noncoding RNAs in viral infections. Virus Res. 2015.
  • Rossetto CC, Pari GS. Kaposi’s sarcoma-associated herpesvirus noncoding polyadenylated nuclear RNA interacts with virus- and host cell-encoded proteins and suppresses expression of genes involved in immune modulation. J Virol. 2011;85:13290–13297.
  • Cazalla D, Yario T, Steitz JA. Down-regulation of a host microRNA by a Herpesvirus saimiri noncoding RNA. Science. 2010;328:1563–1566.
  • Imamura K, Imamachi N, Akizuki G, et al. Long noncoding RNA NEAT1-dependent SFPQ relocation from promoter region to paraspeckle mediates IL8 expression upon immune stimuli. Mol Cell. 2014;53:393–406.
  • Barriocanal M, Carnero E, Segura V, et al. Long non-coding RNA BST2/BISPR is induced by IFN and regulates the expression of the antiviral factor tetherin. Front Immunol. 2014;5:655.
  • Kambara H, Gunawardane L, Zebrowski E, et al. Regulation of interferon-stimulated gene BST2 by a lncRNA transcribed from a shared bidirectional promoter. Front Immunol. 2014;5:676.
  • Winterling C, Koch M, Koeppel M, et al. Evidence for a crucial role of a host non-coding RNA in influenza A virus replication. RNA Biol. 2014;11:66–75.
  • Hu J, Zhao K, Liu X, et al. Two highly pathogenic avian influenza H5N1 viruses of clade 2.3.2.1 with similar genetic background but with different pathogenicity in mice and ducks. Transbound Emerg Dis. 2012.
  • Hu J, Hu Z, Song Q, et al. The PA-gene-mediated lethal dissemination and excessive innate immune response contribute to the high virulence of H5N1 avian influenza virus in mice. J Virol. 2013;87:2660–2672.
  • Hu J, Gao Z, Wang X, et al. iTRAQ-based quantitative proteomics reveals important host factors involved in the high pathogenicity of the H5N1 avian influenza virus in mice. Med Microbiol Immunol. 2017;206:125–147.
  • Hu J, Zhao K, Liu X, et al. Two highly pathogenic avian influenza H5N1 viruses of clade 2.3.2.1 with similar genetic background but with different pathogenicity in mice and ducks. Transbound Emerg Dis. 2013;60:127–139.
  • Li J, Gu M, Liu D, et al. Phylogenetic and biological characterization of three K1203 (H5N8)-like avian influenza A virus reassortants in China in 2014. Arch Virol. 2016;161:289–302.
  • Cui Z, Hu J, He L, et al. Differential immune response of mallard duck peripheral blood mononuclear cells to two highly pathogenic avian influenza H5N1 viruses with distinct pathogenicity in mallard ducks. Arch Virol. 2014;159:339–343.
  • Kriegel AJ, Liu Y, Liu PY, et al. Characteristics of microRNAs enriched in specific cell types and primary tissue types in solid organs. Physiol Genomics. 2013;45:1144–1156.
  • Li R, Yu C, Li Y, et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics. 2009;25:1966–1967.
  • Trapnell C, Williams BA, Pertea G, et al. Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010;28:511–U174.
  • Benjamini Y, Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001;29:1165–1188.
  • Sun L, Zhang ZH, Bailey TL, et al. Prediction of novel long non-coding RNAs based on RNA-Seq data of mouse Klf1 knockout study. In: Bmc Bioinformatics. 2012. p. 13.
  • Tafer H, Hofacker IL. RNAplex: a fast tool for RNA-RNA interaction search. Bioinformatics. 2008;24:2657–2663.
  • Liao Q, Liu CN, Yuan XY, et al. Large-scale prediction of long non-coding RNA functions in a coding-non-coding gene co-expression network. Nucleic Acids Res. 2011;39:3864–3878.
  • Wingender E, Chen X, Hehl R, et al. TRANSFAC: an integrated system for gene expression regulation. Nucleic Acids Res. 2000;28:316–319.
  • Washietl S, Hofacker IL, Lukasser M, et al. Mapping of conserved RNA secondary structures predicts thousands of functional noncoding RNAs in the human genome. Nat Biotechnol. 2005;23:1383–1390.
  • Djebali S, Davis CA, Merkel A, et al. Landscape of transcription in human cells. Nature. 2012;489:101–108.
  • Oldstone MB. Lessons learned and concepts formed from study of the pathogenesis of the two negative-strand viruses lymphocytic choriomeningitis and influenza. Proc Natl Acad Sci U S A. 2013;110:4180–4183.
  • Opal SM, Fedson DS. The dysfunctional host response to influenza A H7N9: a potential treatment option? Crit Care. 2014;18:135.
  • Thomas M, Mani RS, Philip M, et al. Proinflammatory chemokines are major mediators of exuberant immune response associated with Influenza A (H1N1) pdm09 virus infection. J Med Virol. 2017.
  • Wu H, Zhao M, Yoshimura A, et al. Critical link between epigenetics and transcription factors in the induction of autoimmunity: a comprehensive review. Clin Rev Allergy Immunol. 2016;50:333–344.
  • Chen R, Yang Z, Zhou Q. Phosphorylated positive transcription elongation factor b (P-TEFb) is tagged for inhibition through association with 7SK snRNA. J Biol Chem. 2004;279:4153–4160.
  • Sharma S, Findlay GM, Bandukwala HS, et al. Dephosphorylation of the nuclear factor of activated T cells (NFAT) transcription factor is regulated by an RNA-protein scaffold complex. Proc Natl Acad Sci U S A. 2011;108:11381–11386.
  • Tamura T, Yanai H, Savitsky D, et al. The IRF family transcription factors in immunity and oncogenesis. Annu Rev Immunol. 2008;26:535–584.
  • Ikushima H, Negishi H, Taniguchi T. The IRF family transcription factors at the interface of innate and adaptive immune responses. Cold Spring Harb Symp Quant Biol. 2013;78:105–116.
  • Savitsky D, Tamura T, Yanai H, et al. Regulation of immunity and oncogenesis by the IRF transcription factor family. Cancer Immunol Immunother. 2010;59:489–510.
  • Chung MC, Kawamoto S. IRF-2 is involved in up-regulation of nonmuscle myosin heavy chain II-A gene expression during phorbol ester-induced promyelocytic HL-60 differentiation. J Biol Chem. 2004;279:56042–56052.
  • Manzella L, Giuffrida MA, Pilastro MR, et al. Possible role of the transcription factor interferon regulatory factor 1 (IRF-1) in the regulation of ornithine decarboxylase (ODC) gene expression during IFN gamma macrophage activation. FEBS Lett. 1994;348:177–180.
  • Liang J, Yan R, Chen G, et al. Downregulation of ZBTB24 hampers the G0/1- to S-phase cell-cycle transition via upregulating the expression of IRF-4 in human B cells. Genes Immun. 2016;17:276–282.
  • Armstrong MJ, Stang MT, Liu Y, et al. Interferon Regulatory Factor 1 (IRF-1) induces p21(WAF1/CIP1) dependent cell cycle arrest and p21(WAF1/CIP1) independent modulation of survivin in cancer cells. Cancer Lett. 2012;319:56–65.
  • Kano A, Haruyama T, Akaike T, et al. IRF-1 is an essential mediator in IFN-gamma-induced cell cycle arrest and apoptosis of primary cultured hepatocytes. Biochem Biophys Res Commun. 1999;257:672–677.
  • Zhao J, Chen C, Xiao JR, et al. An up-regulation of IRF-1 after a spinal cord injury: implications for neuronal apoptosis. J Mol Neurosci. 2015;57:595–604.
  • Zhang C, Guo ZM. Multiple functions of Maf in the regulation of cellular development and differentiation. Diabetes Metab Res Rev. 2015;31:773–778.
  • Ranzani V, Rossetti G, Panzeri I, et al. The long intergenic noncoding RNA landscape of human lymphocytes highlights the regulation of T cell differentiation by linc-MAF-4. Nat Immunol. 2015;16:318–325.
  • Lopez-Pajares V, Qu K, Zhang J, et al. A LncRNA-MAF:MAFB transcription factor network regulates epidermal differentiation. Dev Cell. 2015;32:693–706.
  • Brundage ME, Tandon P, Eaves DW, et al. MAF mediates crosstalk between Ras-MAPK and mTOR signaling in NF1. Oncogene. 2014;33:5626–5636.
  • James SJ, Jiao H, Teh HY, et al. MAPK phosphatase 5 expression induced by influenza and other RNA virus infection negatively regulates IRF3 activation and type I interferon response. Cell Rep. 2015.
  • Hui KP, Lee SM, Cheung CY, et al. Induction of proinflammatory cytokines in primary human macrophages by influenza A virus (H5N1) is selectively regulated by IFN regulatory factor 3 and p38 MAPK. J Immunol. 2009;182:1088–1098.
  • Lee DC, Cheung CY, Law AH, et al. p38 mitogen-activated protein kinase-dependent hyperinduction of tumor necrosis factor alpha expression in response to avian influenza virus H5N1. J Virol. 2005;79:10147–10154.
  • Kujime K, Hashimoto S, Gon Y, et al. p38 mitogen-activated protein kinase and c-jun-NH2-terminal kinase regulate RANTES production by influenza virus-infected human bronchial epithelial cells. J Immunol. 2000;164:3222–3228.
  • Ludwig S, Ehrhardt C, Neumeier ER, et al. Influenza virus-induced AP-1-dependent gene expression requires activation of the JNK signaling pathway. J Biol Chem 2001; 276:10990–10998.
  • Pleschka S, Wolff T, Ehrhardt C, et al. Influenza virus propagation is impaired by inhibition of the Raf/MEK/ERK signalling cascade. Nat Cell Biol. 2001;3:301–305.
  • Ehrhardt C, Marjuki H, Wolff T, et al. Bivalent role of the phosphatidylinositol-3-kinase (PI3K) during influenza virus infection and host cell defence. Cell Microbiol. 2006;8:1336–1348.
  • Ehrhardt C, Wolff T, Pleschka S, et al. Influenza A virus NS1 protein activates the PI3K/Akt pathway to mediate antiapoptotic signaling responses. J Virol. 2007;81:3058–3067.
  • Hale BG, Jackson D, Chen YH, et al. Influenza A virus NS1 protein binds p85 beta and activates phosphatidylinositol-3-kinase signaling. P Natl Acad Sci USA 2006; 103:14194–14199.
  • Shin YK, Li Y, Liu Q, et al. SH3 binding motif 1 in influenza a virus NS1 protein is essential for PI3K/Akt signaling pathway activation. J Virol. 2007;81:12730–12739.
  • Shin YK, Liu Q, Tikoo SK, et al. Effect of the phosphatidylinositol 3-kinase/Akt pathway on influenza A virus propagation. J Gen Virol. 2007;88:942–950.
  • Shin YK, Liu Q, Tikoo SK, et al. Influenza A virus NS1 protein activates the phosphatidylinositol 3-kinase (PI3K)/Akt pathway by direct interaction with the p85 subunit of PI3K. J Gen Virol. 2007;88:13–18.
  • Tong Z, Jiang B, Zhang L, et al. HSF-1 is involved in attenuating the release of inflammatory cytokines induced by LPS through regulating autophagy. Shock. 2014;41:449–453.
  • Duarte FM, Fuda NJ, Mahat DB, Core LJ, Guertin MJ, Lis JT. Transcription factors GAF and HSF act at distinct regulatory steps to modulate stress-induced gene activation. Genes Dev. 2016;30:1731–1746.
  • Kinet MJ, Malin JA, Abraham MC, et al. HSF-1 activates the ubiquitin proteasome system to promote non-apoptotic developmental cell death in C elegans. In: Elife. 2016. p. 5.
  • Lentjes MH, Niessen HE, Akiyama Y, et al. The emerging role of GATA transcription factors in development and disease. Expert Rev Mol Med 2016; 18:e3.
  • Zhang Y, Li F, Xiao X, et al. Regulator of G protein signaling 4 is a novel target of GATA-6 transcription factor. Biochem Biophys Res Commun. 2017;483:923–929.
  • Chen Q, Zhang F, Wang Y, et al. The transcription factor c-Myc suppresses MiR-23b and MiR-27b transcription during fetal distress and increases the sensitivity of neurons to hypoxia-induced apoptosis. PLoS One 2015; 10:e0120217.
  • Zhu ZH, Yu YP, Shi YK, et al. CSR1 induces cell death through inactivation of CPSF3. Oncogene. 2009;28:41–51.
  • Palencia A, Bougdour A, Brenier-Pinchart MP, et al. Targeting Toxoplasma gondii CPSF3 as a new approach to control toxoplasmosis. EMBO Mol Med. 2017;9:385–394.
  • Raheja R, Gandhi R. FXR1: linking cellular quiescence, immune genes and cancer. Cell Cycle. 2016;15:2695–2696.
  • Qian J, Hassanein M, Hoeksema MD, et al. The RNA binding protein FXR1 is a new driver in the 3q26-29 amplicon and predicts poor prognosis in human cancers. P Natl Acad Sci USA 2015; 112:3469–3474.
  • Durfee T, Mancini MA, Jones D, et al. The amino-terminal region of the retinoblastoma gene product binds a novel nuclear matrix protein that co-localizes to centers for RNA processing. J Cell Biol. 1994;127:609–622.
  • Wan J, Zou S, Hu M, et al. Thoc1 inhibits cell growth via induction of cell cycle arrest and apoptosis in lung cancer cells. Mol Med Rep. 2014;9:2321–2327.
  • Chinnam M, Wang X, Zhang X, et al. Evaluating effects of hypomorphic Thoc1 alleles on embryonic development in Rb1 null mice. Mol Cell Biol. 2016;36:1621–1627.
  • Li Y, Wang X, Zhang X, et al. Human hHpr1/p84/Thoc1 regulates transcriptional elongation and physically links RNA polymerase II and RNA processing factors. Mol Cell Biol. 2005;25:4023–4033.
  • Katoh T, Hojo H, Suzuki T. Destabilization of microRNAs in human cells by 3ʹ deadenylation mediated by PARN and CUGBP1. Nucleic Acids Res. 2015;43:7521–7534.
  • Moon DH, Segal M, Boyraz B, et al. Poly(A)-specific ribonuclease (PARN) mediates 3ʹ-end maturation of the telomerase RNA component. Nat Genet. 2015;47:1482–1488.
  • Zhang LN, Yan YB. Depletion of poly(A)-specific ribonuclease (PARN) inhibits proliferation of human gastric cancer cells by blocking cell cycle progression. Biochim Biophys Acta. 2015;1853:522–534.