1,007
Views
0
CrossRef citations to date
0
Altmetric
Research Paper

mNFE: microbiome network flow entropy for detecting pre-disease states of type 1 diabetes

, ORCID Icon, , , & ORCID Icon
Article: 2327349 | Received 24 Jul 2023, Accepted 04 Mar 2024, Published online: 21 Mar 2024

ABSTRACT

In the development of Type 1 diabetes (T1D), there are critical states just before drastic changes, and identifying these pre-disease states may predict T1D or provide crucial early-warning signals. Unlike gene expression data, gut microbiome data can be collected noninvasively from stool samples. Gut microbiome sequencing data contain different levels of phylogenetic information that can be utilized to detect the tipping point or critical state in a reliable manner, thereby providing accurate and effective early-warning signals. However, it is still difficult to detect the critical state of T1D based on gut microbiome data due to generally non-significant differences between healthy and critical states. To address this problem, we proposed a new method – microbiome network flow entropy (mNFE) based on a single sample from each individual – for detecting the critical state before seroconversion and abrupt transitions of T1D at various taxonomic levels. The numerical simulation validated the robustness of mNFE under different noise levels. Furthermore, based on real datasets, mNFE successfully identified the critical states and their dynamic network biomarkers (DNBs) at different taxonomic levels. In addition, we found some high-frequency species, which are closely related to the unique clinical characteristics of autoantibodies at the four levels, and identified some non-differential ‘dark species’ play important roles during the T1D progression. mNFE can robustly and effectively detect the pre-disease states at various taxonomic levels and identify the corresponding DNBs with only a single sample for each individual. Therefore, our mNFE method provides a new approach not only for T1D pre-disease diagnosis or preventative treatment but also for preventative medicine of other diseases by gut microbiome.

Introduction

The majority of Type 1 diabetes (T1D) patients have autoimmune disease that targets pancreatic islet beta cells and arises from complex interactions among genetic elements, patient exposures, and the gut microbiome, usually occurring in children and young adults. And only a minority of patients with T1D have non-autoimmune pathogenesis.Citation1–3 Destruction of 90% of beta cells is a critical point at which clinical manifestations emerge.Citation4 Because of the early onset and chronicity of this disease, the detection of critical states in the development of T1D is of great importance. The development of biological processes is often viewed as a time-dependent nonlinear dynamical system or network involving non-smooth or abrupt state transitions.Citation5,Citation6 Similarly, T1D development with a critical transition point can be generally divided into three states, i.e., normal/healthy state, pre-disease/critical state, and disease state. A normal state is a relatively stable or healthy state before the transition. The pre-disease state or tipping point is the limit of the normal state just before the imminent drastic transition, which is reversible to the normal state if appropriately treated. If not appropriately treated, the system will develop into disease state, which is another stable state and irreversible to the normal state.Citation5,Citation7–9 Thus, timely detection of the pre-disease or critical state is of great importance for early diagnosis and treatment. However, it is challenging to detect the tipping point or critical state because a normal state and its critical state are similar in terms of both phenotype and gene expression, which leads to the failure of traditional biomarkers in detecting the critical state.

To address this important challenge, the concept of the dynamic network biomarker (DNB), defined by three statistical conditions, has been developed to quantify the critical state (or called “WeiBing” in traditional Chinese medicine) or provide early-warning signals of the disease state during disease development.Citation8 DNB members consist of a set of molecules, genes, or proteins that are strongly fluctuating and also strongly correlated, i.e. they can be characterized as “critical collective fluctuations” in terms of variables in contrast to “critical slowing down” in terms of states.Citation8 Unlike traditional biomarkers, the concentrations of DNB molecules fluctuate collectively when approaching the critical state, rather than exhibiting constant values or random fluctuations.Citation10 In particular, when the system is near a bifurcation point or tipping point, a dominant group or DNB appears, and the members of the dominant group or DNB satisfy the following three conditions: the correlation between DNB members rapidly increases, the standard deviation for each DNB member drastically increases, and the correlation between one DNB member and any non-DNB molecule decreases.Citation7,Citation8

Increasing evidence suggests that the gut microbiota is involved in the pathogenesis of T1D.Citation11 The gut microbiota plays an important role in the regulation of autoimmunity and tolerance.Citation12–15 It has been proposed that altered intestinal microbiota may impact T1D pathogenesis by increasing gut permeability, facilitating intestinal inflammation, and disturbing immunological maturation.Citation16–19 Most previous studies have identified key bacteria as biomarkers based on variation in abundance between healthy and diseased groups.Citation20,Citation21 However, the existing studies have mainly concentrated on revealing the role of gut bacteria in the development of T1D or identifying its disease state, which may fail in identifying its pre-disease state on a single-sample basis.

A number of DNB-based computational methods have been developed to detect the critical states of diabetes using gene expression data. The early-warning signals of T1D and its leading biomolecular networks have been detected and used to mark the time period just before the drastic deterioration to diabetes.Citation22,Citation23 The degree matrix network entropy method can detect the critical states of type 2 diabetes mellitus based on a sample-specific network.Citation24 However, these methods are mainly based on gene expression data and suffer from unsatisfactory effectiveness and robustness due to highly noisy omics data, especially on the basis of a single sample. Unlike gene expression data, gut microbiome sequencing data can be noninvasively collected from stool samples, so they are more easily accessible. The collected metagenomic data contain phylogenetic information at various taxonomic levels, which can help us detect early-warning signals of T1D, thus improving the robustness and effectiveness of the mNFE method. Therefore, developing an effective and robust method to detect the critical state of the gut microbiome in T1D with only a single sample from an individual is an important and challenging task.

To solve this problem, we proposed a novel method, i.e., microbiome network flow entropy (mNFE), to detect the critical state of T1D based on solely a single sample of gut microbiota from an individual. First, we constructed a microbial association network by using the neighborhood selection (mb) method based on SParse InversE Covariance Estimation for Ecological Association Inference (SPIEC-EASI). Then we calculated the NFE by extracting the local- or sub-network from the sample-specific association network. Finally, the global/whole network disturbance caused by a single case sample was quantified by mNFE. By transforming the microbial profile from sequencing data into NFE, the mNFE method provides a new way to detect the critical state based on a single sample from any individual. Our proposed method has the following advantages: (i) The stool samples needed for human gut microbiome sequencing are collected noninvasively and can be sampled repeatedly. Gut microbiome sequencing data contain different levels of phylogenetic information that can be utilized to detect the tipping point in a reliable manner, thereby providing more accurate and effective early-warning signals. (ii) The mNFE method reliably quantifies network fluctuation, that is, the collective molecule fluctuation induced by each single sample for a set of given reference samples, thereby reducing the noise and enhancing the robustness by mining dynamical and high-dimensional information of T1D omics data. (iii) With the mNFE method, we can successfully detect the critical states at various taxonomic levels and identify the corresponding DNBs with only a single sample for each individual, both of which contribute to the effectiveness and robustness of this method. (iv) The mNFE method is a model-free and data-driven method, which can achieve the detection of early-warning signals for T1D based on a single sample from an individual. (v) Based on the mNFE method, some high-frequency species that are closely related to the unique clinical characteristics of autoantibodies were observed at the four levels, including Glutamic Acid Decarboxylase (GADA), insulinoma antigen-2 (IA-2A), insulin (IAA) and Zinc Transporter 8 (ZnT8A) levels. And some non-differential ‘dark species’ that play important roles in T1D development were identified. Network connectivity and fragility were measured to illustrate the potential interactions among species in control, seroconversion, and T1D case groups.

The mNFE method is capable of detecting the tipping point before the onset of seroconversion and T1D symptoms at different taxonomic levels and identifying the corresponding DNBs, both of which enhance the effectiveness and accuracy of this method. The critical states of all symptomatic subjects were detected before symptoms appeared based on the high-frequency species. The numerical simulation demonstrated the robustness of mNFE. Some identified high-frequency species were found to be associated with the clinical features of autoantibodies at the four levels. We also revealed the ‘dark species’ that are related to the development of T1D.We showed that the connectivity and robustness of the microbial association network in the control and seroconversion groups were significantly stronger than in the T1D case group.

Results

To illustrate how mNFE works, we first applied it to a simulated dataset, and then to T1D. The proposed method successfully detected the early-warning signals of critical transition into an irreversible after-transition state, which validated the effectiveness of our method in identifying the tipping point just before the critical transition.

Validation based on numerical simulation

To validate the proposed mNFE method, we employed a regulatory network with eight nodes, which was inspired by previous works.Citation25,Citation26 A model of gene regulatory network of Michaelis – Menten or Hill formCitation27,Citation28 is typically used to study genetic regulations including transcription and diffusion processes,Citation29,Citation30 nonlinear biological processes, and other gene regulatory activities.Citation31,Citation32 To transform the simulation model into a complete time series, the numerical data were generated from the model network with a parameter p varying from − 0.5 to 0.25.

The gene regulatory network of eight nodes with positive or negative relationships is demonstrated in . Constructing the gene regulatory network based on the generated data with a set of parameters, we performed the numerical experiment. As shown in , the sudden increase of global mNFE score represents the imminent early-warning signals when the system is near the bifurcation parameter value p = 0. Thus, the significant increase of mNFE score implies the upcoming critical transition at p = 0. To better illustrate the distinct dynamics between the normal state and the critical state, the mNFE landscape of the eight nodes is illustrated in . In particular, when the system is far away from the tipping point, the mNFE score of each node is at a low level; when the system approaches the tipping point p = 0, the mNFE score increases sharply. In addition, we have analyzed the stability and robustness of the mNFE method under different noise strengths in . With the increase of noise strength, the mNFE method remained stable in providing the early-warning signals of critical transition, which shows that the mNFE method is stable and robust. The simulation and calculation details are presented in the Supplementary Material, Figure S1.

Figure 1. The validation of the mNFE method on a simulation dataset.

(A) The model of an eight-node network, in which the arrow represents positive regulation, and the blunt line represents negative regulation. (B) The curve of the mNFE score based on the gene regulatory network. (C) The landscape of the mNFE scores for eight nodes. (D) The robustness of the mNFE method under different noise strengths.
Figure 1. The validation of the mNFE method on a simulation dataset.

Critical states of T1D based on the mNFE

Critical states of T1D at the different taxonomical levels

We then applied the mNFE method to a previously published longitudinal dataset of T1D.Citation33 As shown in , there were 33 subjects/infants, 11 of whom seroconverted to serum autoantibody positivity, and among those 11, four developed T1D. The 16S rRNA sequencing was performed on stool samples with a median of 23 unique samples per individual (minimum 8, maximum 34), and a full operational taxonomic unit table was created. We divided the 33 subjects into a symptomatic group containing T1D cases and seroconverters, and an asymptomatic group containing non-converters. Since the number of time points with sequencing data from two of the four T1D cases was insufficient, we could only detect the tipping point of seroconversion in these two individuals, thus we put their results of early-warning signals together with the seroconverters. As shown in , the global mNFE score curves of T1D and seroconverted individuals showed obvious signals, while the score curves of nonconverters showed no significant change. The top 10% features with the largest mNFE scores at the critical state were regarded as DNBs.Citation26 For 11 symptomatic subjects, the specific mNFE scores for their DNBs are shown in . These scores demonstrated that early-warning signals for T1D and seroconverted individuals preceded the onset of symptoms, expect for the individual E026079 where the early-warning signal co-occurred with symptoms.

Figure 2. The identification of critical states for seroconversion and T1D based on mNFE at the overall taxonomic level.

(A) Table of mNFE diagnoses and clinical diagnoses for all subjects. (B) Line chart for early-warning signals in all symptomatic subjects. The red curve represents the mNFE scores for two T1D subjects. The green curve represents the mNFE score for nine seroconverters. (C) Line chart for early-warning signals in all asymptomatic subjects. The blue curve represents the mNFE score for 22 non-converters.
Figure 2. The identification of critical states for seroconversion and T1D based on mNFE at the overall taxonomic level.

Figure 3. The curves of mNFE scores for all symptomatic subjects based on their respective DNBs at the overall taxonomic level.

(A) The individual mNFE score curves of two T1D subjects based on their respective DNBs. (B) The individual mNFE score curves of nine seroconverters based on their respective DNBs. The yellow circle represents the pre-seroconversion state identified by the mNFE score, and the green square represents the time point of seroconversion onset (clinically observed). The purple circle stands for the tipping point or the critical state identified by the mNFE score, and the blue square denotes the time point of T1D onset (clinically observed).
Figure 3. The curves of mNFE scores for all symptomatic subjects based on their respective DNBs at the overall taxonomic level.

In addition, to improve the effectiveness of the mNFE method, we detected the early-warning signals of seroconversion and T1D at the genus and species levels, respectively. The results on the real dataset indicated that the critical states could be successfully detected before the onset of seroconversion and T1D at the genus and species levels. The details for the detection of tipping point at the species and genus levels are provided in the Supplementary Material, Figure S2 and S3.

Overview of the detection of critical states at different taxonomic levels

Moreover, we overviewed the tipping point detected for all individuals at three different levels, i.e., the overall taxonomic level, genus level, and species level. As shown in , for 11 symptomatic subjects, the 11/11 mNFE prediction was in advance of the symptom appearing time at all three different levels, and there were no wrong signals for 22 asymptomatic individuals, reflecting the high effectiveness and accuracy of the mNFE method in detecting the critical state on an individual basis.

Table 1. Overview of the detection of early-warning signals based on the mNFE at different taxonomic levels.

Diversity analysis and accuracy analysis of mNFE

In order to reveal the microbial richness and internal diversity of each individual sample, we firstly calculated and compared microbiome alpha diversity in the control, seroconversion and T1D case groups. We observed significant differences in alpha diversity (Chao) between T1D and the other two groups (p <.05, Wilcoxon rank-sum test) (). The alpha diversity of control individuals and seroconversion individuals were higher than that of T1D cases, which is consistent with previous studies.Citation33 Moreover, we compared mNFE with the Chao index for early-warning signals in the seroconversion and T1D individuals. We used the score indices, i.e., mNFET>1.5mNFET1andChaoT>1.5ChaoT1, as the threshold of the sharp increase. As shown in , for 11 symptomatic subjects, only 5/11 Chao prediction was in advance of symptom appearance in comparison to the 11/11 mNFE prediction, which validated the accuracy of the mNFE method in detecting the tipping point on an individual basis. Note that similar results were obtained with other indexes of alpha-diversity, such as Shannon, Simpson and richness.

Figure 4. Diversity analysis and accuracy analysis of mNFE.

(A) Alpha-diversity (Chao index) in three groups at the overall taxonomic level. (B) Alpha-diversity (Chao index) in three groups at the genus level. (C) The alpha-diversity (Chao index) curves of 11 symptomatic individuals. (D) PCA plots comparing three groups for all individuals. (E) Comparison of Bray-Curtis distance difference for T1D individuals in different groups. (F) Comparison of Bray-Curtis distance difference for seroconversion individuals in different groups.
Figure 4. Diversity analysis and accuracy analysis of mNFE.

Similarly, the microbiota beta-diversity analysis also showed a clear separation among T1D cases with control and seroconversion groups (). Finally, we calculated the difference in microbial community (Bray-Ccurtis distance) between each time point and the reference time points (the first two time points) for all symptomatic individuals. In there are significant differences between before and after the tipping point detected by mNFE for both T1D or seroconversion individuals. Moreover, the difference before and after the tipping point was higher than that before and after the onset of symptoms.

Analysis of high-frequency species of all symptomatic subjects with mNFE

Critical states of T1D based on high-frequency species

The DNBs varied among individuals even in the case of the same disease, but there still exist some common DNBs. We counted the common DNBs for all symptomatic individuals, and defined the high frequency species (Supplementary Table S1) as the DNBs common in two T1D individuals or in five or more out of 11 seroconverted individuals. Among them, there were 88 high-frequency species in two T1D infants and 77 high-frequency species in 11 seroconverted infants. To validate the effectiveness of mNFE score, we respectively warned the early-warning signals of seroconversion and T1D individuals based on the high-frequency species of these 11 symptomatic individuals, and the results showed that the critical states were all detected before the onset of symptoms (seen in Supplementary Material, Figure S4).

T1D is an autoimmune disease characterized by immune-mediated destruction of pancreatic beta cells. GADA, IA-2A, IAA, and ZnT8A are some of the most reliable biomarkers for autoimmune diabetes T1D in both children and adults. The samples form symptomatic individuals had different clinical profiles for autoantibodies including IAA, GADA, IA-2A, and ZnT8A levels. Therefore, to prioritize gut microbial species, we determined Spearman’s correlations between the abundances of microbial high-frequency species and clinical features using all symptomatic individuals. The close relationships of gut microbial high-frequency species with autoantibodies were observed, especially for GADA and ZnT8A (). Previously, Bacteroides acidifaciens was reported to be a potential agent for modulating metabolic disorders such as diabetes and obesity.Citation34 At the taxonomic level, the abundances of some high-frequency species, such as Bacteroides acidifaciens and Bacteroides ovatus, are closely related to the clinically important metadata of T1D, which validates the effectiveness of mNFE in identifying the DNBs.

Figure 5. Functional analyses of high-frequency species.

(A) Spearman’s correlations between high-frequency species and clinical metadata. (B) Heatmap of the Spearman’s correlation coefficients between high-frequency species and microbiota pathways. (C) Relative abundance of Eggerthella lenta, Bacteroides fragilis and Leuconostoc mesenteroides. Each species corresponds to a boxplot showing the p-values for the overall differences between the groups obtained from Kruskal-Wallis nonparametric tests, as well as the markers of the differential significance levels obtained from the Dunn’s post hoc tests (* p < .05, **p < .01, ***p < .001).
Figure 5. Functional analyses of high-frequency species.

Functional analysis of the high-frequency species

We used PICRUStCitation35 to predict the functional potential of microbial communities and identified 247 pathways, among which we selected differential abundance pathways. To investigate whether high-frequency species are associated with the progression of T1D, we conducted Spearman’s correlation analysis (p < .05) between the high-frequency species and selected pathways. As shown in , 14 high-frequency species were significantly related to the metabolic pathways during the progression of T1D, including alanine, aspartate, glutamate metabolism, carbohydrate digestion and absorption, fructose and mannose metabolism, adipocytokine signaling pathway valine, leucine and isoleucine biosynthesis, and lipoic acid metabolism, all of which have been previously associated with insulin resistance and glucose content.Citation36–40 Others pathways that are not directly associated with the blood glucose control and insulin resistance, they may actively influence a metabolic environment in the gut that is permissive to inflammation and promotes contributes to T1D pathogenesis.

In addition, literature searching also validates that some of these high-frequency species can improve blood glucose control and insulin resistance. For instance, Bacteroides uniformis and Bacteroides acidifaciens show promise as potential agents for modulating metabolic disorders such as diabetes and obesity, contributing to obesity prevention and improved insulin susceptibility in diabetes.Citation41,Citation42 Parabacteroides distasonis-derived nicotinic acid was reported as a vital bioactive molecule that fortifies intestinal barrier function by activating intestinal G-protein-coupled receptor 109a (GPR109a), leading to the amelioration of insulin resistance.Citation43 Collinsella aerofaciens has a positive correlation with 2−Hydroxybutyric acid (2HB) in the original study. Due to the activation of other metabolic pathways during insulin resistance, 2-hydroxybutyric acid is also synthesized as a coproduct of protein metabolism, thus the progression of insulin resistance is intrinsically related to the increase of 2HB levels.Citation44 Therefore, Collinsella aerofaciens may have an influence on increasing the 2HB levels, thereby holding predictive potential for the development of insulin resistance.

Therefore, the 14 high-frequency species likely play multiple roles in the progression of T1D, by (1) being involved in blood glucose control and insulin resistance through pathways mentioned above (and the metabolites produced); (2) pathways that may serve other functional roles in the development of T1D.

‘Dark species’ revealed by mNFE

In addition, we conducted a differential analysis of the microbial species and found that 70 species were differentially abundant among the three groups (p ≤ .05, Kruskal-Wallis test) (Supplementary Table S2). As shown in , in contrast to the T1D case group, there were significant differences in the relative abundances of some species in the control group and seroconversion group, such as Eggerthella lenta, Bacteroides fragilis and Leuconostoc mesenteroides. In particular, the relative abundance of Bacteroides fragilis and Leuconostoc mesenteroides differed significantly between the control and seroconverted groups. Twenty-five differentially abundant species are also DNBs that can facilitate the detection of critical state. We then focused on the T1D ’dark species’ (Supplementary Table S3) that were not differentially abundant in DNBs but sensitive to mNFE score. Although they are generally ignored in traditional differential abundance analysis, many of these species play important roles in T1D progression. Through a literature search, we found that some ’dark species’ are closely related to the process of diabetes. For example, Bifidobacterium adolescentis is effective in relieving diabetes and may be related to its dominant core genome and gut microbiota modulation capacity.Citation45 Moreover, Bifidobacterium adolescentis was also associated with the levels of antibodies GADA and ZnT8A. Recent studies suggest that Parabacteroides distasonis could exert protective effects against certain diseases, including multiple sclerosis, type 2 diabetes, colorectal cancer, and inflammatory bowel disease. Some evidence that Parabacteroides distasonis may be involved in the pathogenesis of diabetes has begun to emerge. But there is a lack of an established consensus on Parabacteroides distasonis’ role in modulating the human gut microbiota and, more importantly, the pathogenicity of the bacterium.Citation46 Similarly, Spearman correlation analysis (p < .05) was performed between ’dark species’ and microbiota KEGG pathways (Supplementary Material, Figure S5). These results showed that, although the ’dark species’ could not be significantly different between the control group and the seroconversion group compared with the T1D case group, they were significantly associated with the development of the T1D.

Connection properties and robustness test of microbial association network

Microbial association network analysis showed that the microbiome in the control group without seroconversion formed a more robust network than the microbiome of the seroconverted and even converted T1D case groups (). We then compared the connectivity and vulnerability of the microbial networks among these groups, i.e., the control group, seroconversion group, and T1D case group. The number of nodes and edges followed order of control > seroconversion > T1D case (). The control group had the highest numbers of edges and nodes, indicating higher connectivity of the control group microbial networks. In addition, to measure the vulnerability of these microbial networks, we simulated system collapse by sequentially removing individual nodes from the network. As shown in , the area under the curve of the control group was the largest, indicating that its association network was the most robust to the simulated disruption. By contrast, the association network in the T1D case group was the most vulnerable. Next, we investigated the connectivity of the microbiome network of each individual under different groups. As shown in , the groups could be ordered control > seroconversion > T1D case on the basis of node number and edge number of association network among the three groups. These two network properties indicated that network connectivity was the strongest in individuals belonging to the control group and the weakest in individuals belonging to the case group.

Figure 6. Analysis of the microbiome association network for each group.

(A–C) The dynamic evolution of microbial association network constructed of T1D among three groups, where gray nodes represent unrelated species that are present in the other two groups, blue nodes represent species present in each corresponding group, and red nodes represent identified DNBs. (D–E) Connectivity properties (node number, edge number) of the association network of each group. (F) Fragility of the association network of each group. The y-axis shows the fragility, which means the percentage of remaining nodes in the giant (largest) component, and the x-axis shows the percentage of nodes/vertices removed. The fragility of each network characterizes the robustness. (G–H) Violin chart reflecting the connectivity property of the association network for each group of subjects, including node number and edge number.
Figure 6. Analysis of the microbiome association network for each group.

Discussion

It is important to characterize the critical states of diseases, such as the pre-onset stage of T1D. Detecting early-warning signals for the critical transition to the disease state may provide appropriate timing for preventing or at least preparing for catastrophic deterioration. The 16S sequence data from stool samples have rich categorical information to facilitate the detection of pre-disease state at different taxonomic levels. However, detecting the critical states of the gut microbiome in T1D is important but also difficult on a single-sample basis. In this paper, we proposed a novel method, i.e., mNFE, to detect the critical state before disease deterioration on the basis of a single sample. Based on the MB method from SPIEC-EASI, we constructed the microbial association network. By calculating the NFE on the sample-specific association network, we evaluated the criticality of each species from its local- or sub-network. Finally, global perturbation brought by each individual sample against a group of given control/reference samples was quantified based on the mNFE method. We found that the mNFE method was able to quantify network fluctuation or collective fluctuation of molecules, which is the key criterion of the tipping point, rather than the random fluctuation of molecules, thereby providing reliable early-warning signals of the critical state.

The numerical simulation validated the robustness of mNFE under different noise levels. With the increase of noise strength, mNFE can still stably provide robust early-warning signals for the upcoming tipping point. Based on real datasets of human gut microbiota, mNFE successfully identified the pre-disease states as well as their DNBs consistently from the different taxonomic levels, thereby providing more accurate and effective early-warning signals during the development of T1D. The diversity analysis and accuracy analysis verified the effectiveness and accuracy of the mNFE method. We found some high-frequency species are closely associated with the diabetes-associated autoantibodies. In addition, we revealed some ‘dark species’ that without differential abundance but were impressible to mNFE score, which also play an important role in the development of T1D. By comparing the connectivity and fragility of microbial association network between the control and case groups, we revealed that the connectivity of the microbial association network in the control group was stronger than that in the case group, but the microbial association network in the case group was more fragile. It is therefore of great potential and application prospect in personalized diagnosis.

Conclusions

In summary, mNFE is an effective and robust method based on a single sample from each individual, which can not only detect the critical state during disease development but also identify the corresponding DNBs. More importantly, mNFE is applicable not only to T1D but also to other diseases. Thus, the proposed mNFE method is of great significance for precision diagnosis and treatment. Moreover, theoretically, any omics data (e.g., metagenomic data or metabolomics data) that can dynamically reflect changes in disease development can be used to detect the critical state or tipping point in a similar way.

Methods

Theoretical background

DNBs are a set of molecules (i.e., genes or proteins) or molecular modules that can reliably detect the early-warning signals of disease state transition.Citation47 Specifically, when a biological system from a stable state approaches a critical state, DNB molecules satisfy three statistical conditionsCitation8: the correlation between any two members in the DNB group sharply increases; the correlation between any one member in the DNB group and any other non-DNB member rapidly decreases; and the standard deviation for any member in the DNB group drastically increases. These are necessary conditions for phase transition in a biological system. Briefly speaking, DNB group is a set of variables with “critical collective fluctuations” (CCF), which can quantify a pre-disease state or “WeiBing” state called in traditional Chinese medicine. CCF principle in terms of variables is also consistent with “critical slowing down” (CSD) in terms of states.Citation5–8,Citation22,Citation23 According to theoretical results, the tipping points or critical states of many complex diseases have been successfully detected.Citation48–50

The mNFE method is based on SPIEC-EASICitation51 and NFE,Citation52,Citation53 which can measure the information content of a network rather than a number of random variables (i.e. Shannon entropy), thus quantifying the potential or function of a network from a system perspective. Based on the microbial profile, a sample-specific association network is built on an individual basis. Then, the NFE is calculated according to the local network information. Finally, the mNFE is used to quantify the perturbation brought by the single sample for local- or sub-network. The global mNFE score at each tipping point is used to detect the advent of critical states and provide early-warning signals of pre-disease states.

Algorithm to detect the tipping point of complex diseases based on mNFE

The mNFE is a model-free and data-driven method based on three statistical conditions detailed above.Citation7,Citation8 By calculating the NFE from these statistical conditions, mNFE provides the early-warning signals of a disease based on a single-sample and the reference samples. Note that the samples from the first two time points or the normal samples are usually taken as the reference samples. We developed the following algorithm to detect the critical state with only a single sample from each individual ().

Figure 7. Schematic illustration of detecting the pre-disease state by using the association network based mNFE.

(A) Outline of analytical study flow. This illustrates the number of serum and stool samples collected at each time-point. Based on the collected 16S samples, we construct phylogenetic trees and extract different taxonomic data to construct the network so as to develop the mNFE algorithm. (B) By extracting the local network from the association network of the reference samples (ARN) and sample-specific association network (SSAN), we calculate the joint probability and conditional probability so as to calculate the microbiome network flow entropy (mNFE).
Figure 7. Schematic illustration of detecting the pre-disease state by using the association network based mNFE.

[step1] construct sample-specific association network

We perform SParse InversE Covariance Estimation for Ecological Association Inference (SPIEC-EASI) to infer a microbial association network, using the neighborhood selection (mb) method with a minimum l threshold of 0.01.Citation54 All steps are computed using the R package SpiecEasi.

To begin with, based on the n reference samples s1,s2,sn, the species taxonomy matrix M includes m rows/species and n columns/samples. We transform matrix M into an association matrix AM based on SPIEC-EASI. Based on this matrix AM, the association network of the reference samples is obtained, i.e., ARN. Then, a case sample is added to the reference samples at each time point t=T, which forms the perturbed/mixed samples s1,s2,,sn,scaseT with the species taxonomy matrix as MT. Similarly, with association matrix AMT, a sample-specific association network of the perturbed/mixed samples is acquired at time point T, i.e.,SSANT.

[step2] extract local network from global network

Each local network Nll=1,,d is extracted from the association network ARN of the n reference samples or the sample-specific association network SSANT of the n + 1 samples, where d and dT are the number of species in the corresponding network. Local network Nlcenters on specie Slii=1,2,,Ml and the first-order neighborhood species are the other Ml1 species except center species.

[step3] calculate the joint probability and conditional probability of species under n samples

For each local network Nll=1,,d, the joint probability pxijl,xkjl of the center specie Slii=1,2,,Ml and neighborhood specie Slkk=1,i1,i+1,,Ml based on n reference samples s1,s2,,sn is defined by

pxijl,xkjl=Dxijl,xkjlk=1Ml1Dxijl,xkjl,

where Dxijl,xkjl is the Aitchison distance of the center specie Slii=1,2,,Ml and neighborhood specie Slkk=1,i1,i+1,,Ml based on n reference samples, calculated as follows:

Dxijl,xkjl=logxijlSxillogxkjlSxkl,

with

Sxil=j=1nxijl1/n,
Sxkl=j=1nxkjl1/n,

where xijl and xkjlare the taxonomies of species Sli and Slkof sample j, respectively.

And the conditional probability pxkjl/xijl of neighborhood specie Slkk=1,i1,i+1,,Ml with respect to center specie Slii=1,2,,Ml is as follows:

pxkjl/xijl=Iiklk=1Ml1Iikl,

where Iikl is the mutual information of the center specie Slii=1,2,,Ml and neighborhood specie Slkk=1,i1,i+1,,Ml under n reference samples.

[step4] calculate network flow entropy (NFE) of species under n samples

For each local network Nll=1,,d, the NFE of species Slii=1,,Ml based on n reference samples is defined as:

NFEnxl=1Ml1(i=1,ikMl1j=1nxijlpxijllogxijlpxijl

with

pxijl=pxijl,xkjlpxkjl/xijl.

[step5] calculate network flow entropy (NFE) of species under n + 1 samples

For each local network Nll=1,,d at a time point T, we repeat the previous processes for the mixed n + 1 samples, and the NFE of species Slii=1,,Ml based on n + 1 samples is as follows:

NFEn+1Txl=1Ml1(i=1,ikMl1j=1n+1xijlpxijllogxijlpxijl

[step6] calculate the microbiome network flow entropy (mNFE)

The average disturbance network flow entropy (NFE) ΔNFET of species is calculated by

ΔNFET=1dTl=1dTNFEn+1Txl1dl=1dNFEnxl

The ∆NFE score at each time point reflects the average perturbation caused by a case sample, it can also be called the Microbiome network flow entropy (mNFE).

Availability of data and materials

Metagenomic sequencing data of T1D33 can be downloaded from https://diabimmune.broadinstitute.org/diabimmune/(NCBI BioProject ID: PRJNA231909). To ensure reproducible results, the original code is available at https://github.com/G-R-0/mNFE.

Supplemental material

Supplementary Table S3.xlsx

Download MS Excel (11 KB)

Supplementary Materials.docx

Download MS Word (31.2 MB)

Supplementary Table S2.xlsx

Download MS Excel (9.9 KB)

Supplementary Table S1.xlsx

Download MS Excel (13.5 KB)

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary material

Supplemental data for this article can be accessed online at https://doi.org/10.1080/19490976.2024.2327349

Additional information

Funding

This work was supported by the National Natural Science Foundation of China (Nos. T2350003,T2341007, 31930022, 12131020), National Key R&D Program of China (No. 2022YFA1004800), Strategic Priority Research Program of the Chinese Academy of Sciences (No. XDB38040400), Special Fund for Science and Technology Innovation Strategy of Guangdong Province (Nos 2021B0909050004, 2021B0909060002), the Young Backbone Teacher Funding Scheme of Henan (No. 2019GGJS079), Key Research and Development and Promotion Special Program of Henan Province (No. 212102310988), JST Moonshot R&D (No. JPMJMS2021) and Deutsche Forschungsgemeinschaft (No. 390713860).

References

  • Vatanen T, Franzosa EA, Schwager R, Tripathi S, Arthur TD, Vehik K, Lernmark Å, Hagopian WA, Rewers MJ, She J-X. et al. The human gut microbiome in early-onset type 1 diabetes from the TEDDY study. Nature. 2018;562(7728):589–17. doi:10.1038/s41586-018-0620-2.
  • Zhou H, Zhao X, Sun L, Liu YJ, Lv Y, Gang XK, Wang G. Gut microbiota profile in patients with type 1 diabetes based on 16S rRNA gene sequencing: a systematic review. Dis Markers. 2020;2020:1–11. doi:10.1155/2020/3936247.
  • Catarino D, Silva D, Guiomar J, Ribeiro C, Ruas L, Cardoso L, Paiva I. Non-immune-mediated versus immune-mediated type 1 diabetes: diagnosis and long-term differences—retrospective analysis. Diabetol Metab Syndr. 2020;12(1):56. doi:10.1186/s13098-020-00563-x.
  • Islam ST, Srinivasan S, Craig ME. Environmental determinants of type 1 diabetes: a role for overweight and insulin resistance. J Paediatr Child Health. 2014;50(11):874–879. doi:10.1111/jpc.12616.
  • Liu R, Aihara K, Chen LN. Dynamical network biomarkers for identifying critical transitions and their driving networks of biologic processes. Quant Biol. 2013;1(2):105–114. doi:10.1007/s40484-013-0008-0.
  • Liu R, Wang XD, Aihara K, Chen LN. Early diagnosis of complex diseases by molecular biomarkers. Network biomarkers, and dynamical network biomarker. Med Res Rev. 2013;34(3):455–478. doi:10.1002/med.21293.
  • Liu XP, Liu R, Zhao XM, Chen LN. Detecting early-warning signals of type 1 diabetes and its leading biomolecular networks by dynamical network biomarkers. BMC Med Genomics. 2013;2(Suppl 2):S8. doi:10.1186/1755-8794-6-S2-S8.
  • Chen LN, Liu R, Liu ZP, Li MY, Aihara K. Detecting early-warning signals for sudden deterioration of complex diseases by dynamical network biomarkers. Sci Rep. 2012;2(1):342. doi:10.1038/srep00342.
  • Chen P, Zhong JY, Yang K, Zhang XH, Chen YQ, Liu R. TPD: a web tool for tipping-point detection based on dynamic network biomarker. Brief Bioinform. 2022;23(5):bbac399. doi:10.1093/bib/bbac399.
  • Gao R, Yan JL, Li PL, Chen LN. Detecting the critical states during disease development based on temporal network flow entropy. Brief Bioinform. 2022;23(5):bbac164. doi:10.1093/bib/bbac164.
  • Hu CY, Wong FS, Wen L. Type 1 diabetes and gut microbiota: friend or foe? Pharmacol Res. 2015;98:9–15. doi:10.1016/j.phrs.2015.02.006.
  • Wu HJ, Wu E. The role of gut microbiota in immune homeostasis and autoimmunity. Gut Microbes. 2012;3(1):4–14. doi:10.4161/gmic.19320.
  • Longman RS, Yang Y, Diehl GE, Kim SV, Littman DR. Microbiota: host interactions in mucosal homeostasis and systemic autoimmunity. Cold Spring Harb Symp Quant Biol. 2014;78(0):193–201. doi:10.1101/sqb.2013.78.020081.
  • Chervonsky AV. Microbiota and autoimmunity. Cold Spring Harb Perspect Biol. 2013;5(3):a007294. doi:10.1101/cshperspect.a007294.
  • Atarashi K, Honda K. Microbiota in autoimmunity and tolerance. Curr Opin Immunol. 2011;23(6):761–768. doi:10.1016/j.coi.2011.11.002.
  • Bosi E, Molteni L, Radaelli MG, Folini L, Fermo I, Bazzigaluppi E, Piemonti L, Pastore MR, Paroni R. Increased intestinal permeability precedes clinical onset of type 1 diabetes. Diabetologia. 2006;49(12):2824–2827. doi:10.1007/s00125-006-0465-3.
  • Westerholm-Ormio M, Vaarala O, Pihkala P, Ilonen J, Savilahti E. Immunologic activity in the small intestinal mucosa of pediatric patients with type 1 diabetes. Diabetes. 2003;52(9):2287–95. doi:10.2337/diabetes.52.9.2287.
  • Bendtsen KM, Fisker L, Hansen A, Hansen HF, Nielsen DS. The influence of the young microbiome on inflammatory diseases—lessons from animal studies. Birth Defects Res Part C. 2015;105(4):278–295. doi:10.1002/bdrc.21116.
  • Brown EM, Kenny DJ, Xavier RJ. Gut microbiota regulation of T cells during inflammation and autoimmunity. Annu Rev Immunol. 2019;37(1):599–624. doi:10.1146/annurev-immunol-042718-041841.
  • de Goffau MC, Fuentes S, den Bogert BV, Honkanen H, de Vos WM, Welling GW, Hyöty H, Harmsen HJM. Aberrant gut microbiota composition at the onset of type 1 diabetes in young children. Diabetologia. 2014;57(8):1569–1577. doi:10.1007/s00125-014-3274-0.
  • Mejía‐León ME, Barca AM. Diet, microbiota and immune system in type 1 diabetes development and evolution. Nutrients. 2015;7(11):9171‐9184. doi:10.3390/nu7115461.
  • Liu XP, Liu R, Zhao XM, Chen LN. Detecting early-warning signals of type 1 diabetes and its leading biomolecular networks by dynamical network biomarkers. BMC Med Genomics. 2013;6(Suppl 2):S8. doi:10.1186/1755-8794-6-S2-S8.
  • Li MY, Zeng T, Liu R, Chen LN. Detecting tissue-specific early warning signals for complex diseases based on dynamical network biomarkers: study of type 2 diabetes by cross-tissue analysis. Brief Bioinform. 2013;15(2):229–243. doi:10.1093/bib/bbt027.
  • Yang YK, Tian ZH, Song MY, Ma CX, Ge ZY, Li PL. Detecting the critical states of type 2 diabetes mellitus based on degree matrix network entropy by Cross-Tissue Analysis. Entropy. 2022;24(9):1249. doi:10.3390/e24091249.
  • Liu R, Chen P, Chen LN, Cowen L. Single-sample landscape entropy reveals the imminent phase transition during disease progression. Bioinformatics. 2020;36(5):1522–1532. doi:10.1093/bioinformatics/btz758.
  • Chen P, Li YJ, Liu XP, Liu R, Chen LN. Detecting the tipping points in a three-state model of complex diseases by temporal differential networks. J Transl Med. 2017;15(1):217. doi:10.1186/s12967-017-1320-7.
  • Liu R, Zhong JY, Yu XT, Li YJ, Chen P. Identifying critical state of complex diseases by single-sample-based hidden Markov model. Front Genet. 2019;10:285. doi:10.3389/fgene.2019.00285.
  • Srinivasan B. A guide to the Michaelis–menten equation: steady state and beyond. FEBS J. 2022;289(20):6086–6098. doi:10.1111/febs.16124.
  • Singh A, Malik P. Bifurcations in a modified Leslie–Gower predator–prey discrete model with Michaelis–menten prey harvesting. J Appl Math Comput. 2021;67(1–2):143–174. doi:10.1007/s12190-020-01491-9.
  • Khanin R, Vinciotti V, Mersinias V, Smith CP, Wit E. Statistical reconstruction of transcription factor activity using Michaelis–menten kinetics. Biometrics. 2007;63(3):816–823. doi:10.1111/j.1541-0420.2007.00757.x.
  • Ronen M, Rosenberg R, Shraiman BI, Uri A. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc Natl Acad Sci USA. 2002;99(16):10555–10560. doi:10.1073/pnas.152046799.
  • Chen LN, Wang RQ, Li CG, Aihara K. Modeling biomolecular networks in cells: structures and dynamics. Net Cells: Struct Dyn. 2010. doi:10.1007/978-1-84996-214-8.
  • Aleksandar DK, Gevers D, Siljander H, Tommi V, Tuulia H, Anu MH, Peet A, Tillmann V, Pöhö P, Mattila I, et al. The dynamics of the human infant gut microbiome in development and in progression toward type 1 diabetes. Cell Host Microbe. 2015;17(2):260–273. doi:10.1016/j.chom.2015.01.001.
  • Sun RX, Huang WJ, Xiao Y, Wang DD, Mu GH, Nan H, Ni B-R, Huang X-Q, Wang H-C, Liu Y-F. et al. Shenlian (SL) decoction, a Traditional Chinese Medicine Compound, May ameliorate blood glucose via mediating the gut microbiota in db/db mice. J Diabetes Res. 2022;2022:1–20. doi:10.1155/2022/7802107.
  • Langille MGI, Zaneveld J, Caporaso JG, McDonald D, Knights D, Reyes JA, Clemente JC, Burkepile DE, Vega Thurber RL, Knight R. et al. Predictive functional profiling of microbial communities using 16S rRNA marker gene sequences. Nat Biotechnol. 2013;31(9):814–821. doi:10.1038/nbt.2676.
  • Guasch-Ferré M, Santos JL, Martínez-González MA, Clish CB, Razquin C, Wang D, Liang L, Li J, Dennis C, Corella D. et al. Glycolysis/gluconeogenesis- and tricarboxylic acid cycle–related metabolites, Mediterranean diet, and type 2 diabetes. Am J Clin Nutr. 2020;111(4):835–844. doi:10.1093/ajcn/nqaa016.
  • Jin HY, Zhu BJ, Liu X, Jin J, Zou HD. Metabolic characterization of diabetic retinopathy: an 1H-NMR-based metabolomic approach using human aqueous humor. J Pharm Biomed Anal. 2019;174:414–421. doi:10.1016/j.jpba.2019.06.013.
  • Srinuanchai W, Nooin R, Pitchakarn P, Karinchai J, Suttisansanee U, Chansriniyom C, Jarussophon S, Temviriyanukul P, Nuchuchua O. Inhibitory effects of gymnema inodorum (Lour.) decne leaf extracts and its triterpene saponin on carbohydrate digestion and intestinal glucose absorption. J Ethnopharmacol. 2021;266:113398. doi:10.1016/j.jep.2020.113398.
  • Tilg H, Moschen AR. Adipocytokines: mediators linking adipose tissue, inflammation and immunity. Nat Rev Immunol. 2006;6(10):772–783. doi:10.1038/nri1937.
  • Samraj PI, Raj BS, Jeffrey S. Therapeutic benefits of alpha-lipoic acid supplementation in diabetes mellitus: a narrative review. J Diet Suppl. 2022;19(4):566–586. doi:10.1080/19390211.2021.2020387.
  • Xi Y, Xu PF. Diabetes and gut microbiota. World J Diabetes. 2021;12(10):1693–1703. doi:10.4239/wjd.v12.i10.1693.
  • Yang JY, Lee YS, Kim Y, Lee SH, Ryu S, Fukuda S, Hase K, Yang C-S, Lim HS, Kim M-S. et al. Gut commensal Bacteroides acidifaciens prevents obesity and improves insulin sensitivity in mice. Mucosal Immunol. 2017;10(1):104–116. doi:10.1038/mi.2016.42.
  • Sun Y, Nie Q, Zhang S, He H, Zuo S, Chen C, Yang J, Chen H, Hu J, Li S. et al. Parabacteroides distasonis ameliorates insulin resistance via activation of intestinal GPR109a. Nat Commun. 2023;14(1):7740. doi:10.1038/s41467-023-43622-3.
  • Sousa AP, Cunha DM, Franco C, Teixeira C, Gojon F, Baylina P, Fernandes R. Which role plays 2-hydroxybutyric acid on insulin resistance? Metabolites. 2021;11(12):835. doi:10.3390/metabo11120835.
  • Qian X, Si Q, Lin GP, Zhu MM, Lu JY, Zhang H, Wang G, Chen W. Bifidobacterium adolescentis is effective in relieving type 2 diabetes and May Be related to its dominant core genome and gut microbiota modulation capacity. Nutrients. 2022;14(12):2479. doi:10.3390/nu14122479.
  • Ezeji JC, Sarikonda DK, Hopperton A, Erkkila H, Cohen DE, Martinez SP, Cominelli F, Kuwahara T, Dichosa AEK, Good CE. et al. Parabacteroides distasonis: intriguing aerotolerant gut anaerobe with emerging antimicrobial resistance and pathogenic and probiotic roles in human health. Gut Microbes. 2021;13(1):1922241. doi:10.1080/19490976.2021.1922241.
  • Liu XP, Chang X, Leng SY, Tang H, Aihara K, Chen LN. Detection for disease tipping points by landscape dynamic network biomarkers. Natl Sci Rev. 2019;6(4):775–785. doi:10.1093/nsr/nwy162.
  • Zhong JY, Han CY, Wang YK, Chen P, Liu R, Valencia A. Identifying the critical state of complex biological systems by the directed-network rank score method. Bioinformatics. 2022;38(24):5398–5405. doi:10.1093/bioinformatics/btac707.
  • Peng H, Zhong JY, Chen P, Liu R. Identifying the critical states of complex diseases by the dynamic change of multivariate distribution. Brief Bioinform. 2022;23(5):bbac177. doi:10.1093/bib/bbac177.
  • Sun YC, Zhao HQ, Wu M, Xu JH, Zhu SS, Gao J. Identifying critical states of hepatocellular carcinoma based on landscape dynamic network biomarkers. Comput Biol Chem. 2020;85:107202. doi:10.1016/j.compbiolchem.2020.107202.
  • Kurtz ZD, Müller CL, Miraldi ER, Littman DR, Blaser MJ, Bonneau RA, von Mering C. Sparse and compositionally robust inference of microbial ecological networks. PLoS Comput Biol. 2015;11(5):e1004226. doi:10.1371/journal.pcbi.1004226.
  • Shi JF, Teschendorff AE, Chen WY, Chen LN, Li TJ. Quantifying Waddington’s epigenetic landscape: a comparison of single cell potency measures. Brief Bioinform. 2018;21:248–261.
  • Li L, Dai H, Fang ZY, Chen LN. c-CSN: single-cell RNA sequencing data analysis by conditional cell-specific network. Genomics, Proteomics Bioinf. 2021;19(2):319–329. doi:10.1016/j.gpb.2020.05.005.
  • Huang N, Mi TY, Xu SM, Dadd T, Ye X, Chen GQ, Du Y, Santhanam U. Traffic derived air pollution compromises skin barrier function and stratum corneum redox status: a population study. J Cosmet Dermatol. 2019;19(7):1751–1759. doi:10.1111/jocd.13194.