Search in:

Diabetes, Metabolic Syndrome and Obesity Volume 15, 2022 - Issue

Submit an article Journal homepage

Open access

245

Views

CrossRef citations to date

Altmetric

Listen

ORIGINAL RESEARCH

Development and Validation of a Simple Risk Model for Predicting Metabolic Syndrome (MetS) in Midlife: A Cohort Study

Musa S IbrahimInstitute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, EnglandCorrespondence[email protected]

https://orcid.org/0000-0002-3902-6895

Dong PangInstitute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England

Gurch RandhawaInstitute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England

Yannis PappasInstitute for Health Research, University of Bedfordshire, Putteridge Bury Luton, Bedfordshire, LU2 8LE, England

https://orcid.org/0000-0003-3087-860X

Pages 1051-1075 | Published online: 06 Apr 2022

Cite this article
CrossMark

In this article

Introduction
Subjects and Methods
Results
Discussion
Conclusion
References

Full Article
Figures & data
References
Citations
Metrics
Licensing
Reprints & Permissions
View PDF PDF View EPUB EPUB

Abstract

Purpose

To develop and validate a simple risk model for predicting metabolic syndrome in midlife using a prospective cohort data.

Design

Prospective cohort study.

Participants

A total of 7626 members of the 1958 British birth cohort (individuals born in the first week of March 1958) participated in the biomedical survey at age 45 and have completed information on metabolic syndrome.

Methods

Variables utilised were obtained prospectively at birth, 7, 16, 23 and 45 years. Multivariable logistic regression was used to develop a total of ten (10) MetS risk prediction models taking the life course approach. Measures of discrimination and calibration were used to evaluate the performance of the models. A pragmatic criteria developed was used to select one model with the most potential to be useful. The internal validity (overfitting) of the selected model was assessed using bootstrap technique of Stata.

Main Outcome Measure

Metabolic syndrome was defined based on the NCEP-ATP III clinical criteria.

Results

There is high prevalence of MetS among the cohort members (19.6%), with males having higher risk as compared to females (22.8% vs 16.4%, P < 0.001). Individuals with MetS are more likely to have higher levels of HbA1c and low HDL-cholesterol. Similarly, regarding the individual components of MetS, male cohort members are more likely to have higher levels of glycaemia (HbA1c), BP and serum triglycerides. In contrast, female cohort members have lower levels of HDL-cholesterol and higher levels of waist circumference. Furthermore, a total of ten (10) MetS risk prediction models were developed taking the life course approach. Of these, one model with the most potential to be applied in practical setting was selected. The model has good accuracy (AUROC 0.91 (0.90, 0.92)), is well calibrated (Hosmer-Lemeshow 6.47 (0.595)) and has good internal validity.

Conclusion

Early life factors could be included in a risk model to predict MetS in midlife. The developed model has been shown to be accurate and has good internal validity. Therefore, interventions targeting socioeconomic inequality could help in the wider prevention of MetS. However, the validity of the developed model needs to be further established in an external population.

Keywords:

metabolic syndrome
prediction model
risk score
development and validation of risk model
1958 British birth cohort
national child development study

Introduction

Worldwide, the prevalence of Metabolic Syndrome (MetS) - “a clustering of risk factors which includes hypertension, central obesity, impaired glucose metabolism with insulin resistance, and dyslipidemia”^Citation1 is high and rising, with nearly one in four adults affected.^Citation2 The diagnosis of MetS increases an individual’s risk of developing CVD by two to threefold, T2DM by fivefold and all-cause mortality by twofold.^Citation3 Furthermore, MetS is recognised as a useful tool for identifying individuals at increased risk of CVD and T2DM. Hence, the WHO stress that it should be viewed as a premorbid state of CVD and T2DM rather than a distinct clinical disease.^Citation4

There is increased popularity in the use of risk prediction models in public health/ clinical practice, partly due to availability of large datasets, advanced statistical methods and computational power.^Citation5 Furthermore, abdominal obesity is generally presumed to be central to the diagnosis of MetS, but, not all individuals with obesity have MetS and vice versa.^Citation6 Without a valid test, it may not be possible to detect or suspect MetS in non-obese individuals.^Citation7 Identifying the subset of individuals (both obese and non-obese) with MetS is necessary and could have implications on subsequent management approach.^Citation7 Thus, an accurate and reliable MetS prediction model could be used in screening individuals at increased risk of MetS. Focusing on assessing risk of MetS provides a single unifying theme which could enable clinicians to identify an individual’s global risk of CVD and T2DM (holistic approach).^Citation8 Therefore, both from clinical and public health perspective, early identification and control of MetS is of great significance, as it could result in the reduction of T2DM and CVD related morbidity and mortality.

Previous evidence suggests that the risk of MetS starts early on in life^{Citation9,Citation10} and persists into adulthood.^Citation11 This makes it appropriate to use life course approach in the studies of MetS.^Citation12 Early life risk factors are as important in the development of MetS as adult risk factors. In order to effectively prevent and manage MetS, an approach which is holistic and equitable should be employed, focusing on risk factors (both biological and psychosocial) acquired at different stages of life.^Citation13 Therefore, the purpose of this study is to develop and validate a simple prediction model taking a life course approach using a prospective birth cohort data to forecast risk of MetS in midlife.

Subjects and Methods

Participants

The 1958 British birth cohort, also known as the National Child Development Study (NCDS). The full details of the cohort are previously provided.^Citation14 Briefly, the cohort consists of 18,558 individuals born in the first week of March 1958 in Britain. It was originally set-up to investigate the factors (both obstetric and social) responsible for the high perinatal mortality (stillbirth rate). In subsequent years, the NCDS was adjusted and used for monitoring how the members are developing (educationally, physically and socially) as they grow from infancy to adulthood. Since its inception, regular prospective follow-ups have been conducted both in childhood and adulthood.

Between the years 2002 and 2003 (cohort members aged 44/45 years), follow up was conducted in a form of a biomedical survey. The main purpose of the survey was to objectively collect measures of disease and biomedical risk factors to address a wide range of specific hypotheses relating to anthropometry, cardiovascular system, respiratory system and allergic diseases, visual and hearing impairment, and mental health. This survey was carried out by registered research nurses who visited cohort members at their homes or specified research clinics within that period. A total of 9377 cohort members participated in the survey. However, only 8585 participants consented for all the four parts of the investigation.

In terms of demography, the 1958 British birth cohort is predominantly white (typical representation of Great Britain at the beginning of the cohort). Nevertheless, the sample has been shown to represent the national population concerning many socioeconomic characteristics during follow-ups at 33 years as well as the 45 year survey.^Citation15

The 1958 British birth cohort data is managed by the United Kingdom Data Service, based at the University of Essex, UK. We accessed the anonymized data after obtaining a formal permission from them. In the same vein, the ethical approval for this study was granted by Institute for Health Research Ethics Committee, University of Bedfordshire.

Variables

Predictors

This study utilised predictors that are well established in the literature and clinically relevant.

Sociodemographic Characteristics

The main demographic variables considered are gender and socioeconomic status.

Socioeconomic conditions/Social class: participants’ childhood social class is defined according to registrar general’s classification of the father’s occupation in 1958 (OPCS, 1987). Where this information is not available, the father’s occupation at age 7 is used. Furthermore, cohort members’ socioeconomic conditions at age 21/23 years and 33/ 45 years were used.^Citation12 Categorisation is same as the social categories at birth (I–V). Based on the above, a binary variable of social class was generated as (Non-manual vs manual).^Citation12

Early Life Predictors

Early-life factors were all recorded prospectively at birth (birth weight, gestational age and childhood social class) or at age 7 (BMI at 7 years, household crowding and family history of T2DM).

Birth weight: The study participants’ birth weight was recorded in pounds and ounces. Before analysis, this was converted to kilograms (kg) and categorised into quartiles.

Gestational age: participants’ gestational age as reported by mothers was recorded in days. This was converted to weeks and gestational age less than 38 weeks is defined as “Preterm”.^Citation16

Family history of T2DM: this was elicited during the survey conducted at age 7 with response Yes or No. In this analysis, family history of T2DM is treated as a binary variable.

BMI at 7 years: Participants’ weight (kg) and height (cm) measured during the follow-up at age 7 were used to compute BMI (kg/m2) at 7.

Household overcrowding: was assessed by determining the ratio of the number of individuals by the total number of rooms in a given house. Household having ≥1.5 persons per room is considered overcrowded.^Citation12

Adolescence/Early Adulthood Factors (16 and 23 Years)

BMI at 16 and 23 years: Participants’ weight (kg) and height (cm) measured during the follow-up at ages 16 and 23 years were used to compute BMI (kg/m²) for the respective ages. BMI was classified based on standard adult categories as; underweight <18.5 kg/m², normal 18.5–24.9, overweight 25–29.9 and obese ≥30 kg/m².

Socioeconomic status at 16 and 23 years: Same as above.

Adulthood Predictors (45 Years)

Blood pressure, HDL-cholesterol, triglycerides, waist circumference and high HbA1c, are the adult predictors considered in this study.^{Citation2,Citation17–20} See below for description.

Outcome Variables

In this study, the primary outcome of interest is MetS defined based on the NCEP-ATP III clinical criteria. Information regarding the individual components of MetS was collected during the NCDS biomedical survey (when cohort members were 43–45 years) by a trained registered research nurse, using standard protocols. However, because fasting plasma glucose (FPG) was not recorded in the biomedical survey, glycosylated haemoglobin (HbA1c) was used as a proxy variable.^{Citation12,Citation21,Citation22} MetS is identified at the age of 43–45 years old if three or more of the following occurred:

Blood Pressure

Blood pressure was measured in a quiet room using an Omron 705CP automated sphygmomanometer (Omron, Tokyo, Japan). A hypertension variable was generated as systolic blood pressure of 130 mmHg or higher, or diastolic blood pressure of 85 mmHg or higher. Hypertension was assessed as a binary outcome, using the above definition.

Blood Glucose (Glycated Haemoglobin (HbA1c)

In the biomedical survey, HbA1c was investigated using ion-exchange high-performance liquid chromatography (HLC-723GHbA1c 2.2; Tosoh Corp, Tokyo Japan).This technique has been adjudged to be standard and reproducible.^{Citation23,Citation24} In this analysis, high blood glucose variable was derived based on the data of glycated hemoglobin at age 45 years. Noteworthy, HbA1c ≥6.5% has been shown to be a good predictor of diabetes in previous modelling studies.^{Citation25,Citation26} But for the purpose of identifying MetS, a lower cut-off point of HbA1c ≥6.0% (which corresponds to pre- diabetes)^{Citation21,Citation22} is usually considered. Therefore, the cut-off point of HbA1c ≥6.0% is used in this study. High blood glucose was treated as a binary variable during further analysis.

HDL-Cholesterol

Enzymatic methods with an autoanalyser (Olympus AU640, Japan) were used to measure High-density lipoprotein cholesterol (HDL-C). Based on the data available for HDL-cholesterol, a low HDL cholesterol variable was generated. Low HDL-cholesterol was defined as <0.40 g/L for men and <0.50 g/L for women (NCEP, 2001) and was treated as a binary outcome variable during further statistical analysis.

Triglycerides

Triglycerides were also measured using enzymatic methods with an autoanalyser (Olympus AU640, Japan). Triglycerides level of ≥150 mg/dL was used as the cut-off level in this study and was treated as a binary outcome variable during further statistical analysis.

Abdominal Obesity

During the survey, waist circumference (WC) was measured using a tension tape at an imaginary meeting point between the lower rib and upper part of pelvis, in the mid-axillary line.^Citation27 During the procedure, participants were requested to be in a loose dress with the belt removed and relax their muscles of abdomen by breathing in and out. WC was measured and recorded to the nearest 1 centimeter (cm). In this study, abdominal obesity variable was generated as WC ≥102 cm for men or ≥88 cm for women.^Citation1

Covariates

The potential confounders included in this study were selected a priori based on the literature.

Early Life

Early life potential socioeconomic/ psychosocial stress: Proxy variables on birth and perinatal conditions were chosen to capture the heading and subsequent MetS. These include:

Maternal smoking during pregnancy: (no smoking, variable smoker, moderate smoker, heavy smoker),

Type of delivery: (vaginal, emergency caesarean, elective caesarean),

Mother’s parity: number of children previously borne by a mother in 1958,

Mother’s age at birth (23 years or less, 24 to 27 years, 28 to 31 years, and 32 years or more)

Foetal distress: No distress, Yes distress

Breastfed: collected at 7 years (no, yes for less than 1 month, yes more than 1 month).^Citation12

Adult Health Behaviours/Lifestyle

During the biomedical survey, information concerning the above mentioned was collected from the cohort members through the means of the computer-assisted self-administered interview (CASI).

Smoking status: is categorised as non-smoker (individuals who have never smoked), ex-smoker (smoked one or more cigarette per day in the past, but have currently stopped smoking) or current smoker (smoke one or more cigarettes per day).

Drinking status: is coded as non-drinker, 1–2 drinks per day, 3–4 drinks per day, 5–6 drinks per day, and ≥7 drinks per day.

Physical activity: level of physical activity is defined as

highly active (vigorous exercise at least once per week), moderately active (moderate exercise at least once per week) and not active (hardly ever/never moderate-vigorous exercise).^Citation12

BMI at 45 years: classified as; underweight <18.5 kg/m², normal 18.5–24.9, overweight 25–29.9 and obese ≥30 kg/m².

Statistical Methods

Descriptive Statistics

The study uses the information of 7626 members of the 1958 British birth cohort who have complete data regarding all components of MetS at 45 years. Chi-square test of association was initially conducted in order to test the association between MetS, its individual components, the selected predictors and confounders. The level of significance was set at P= 0.05.

Model Development

Variables Selection

Variables included in the logistic regression models were selected a priori based on the literature and clinical/public health relevance. Further, their significance to the understanding of the life course origin of MetS was also considered.

Model Specification

Using the stepwise logistic regression technique, various combination of the predictors was used to build models starting with gender and father’s social class at birth. Variables were included sequentially taking a life course approach. No pre-specified inclusion criteria were set. The only parameters of interest assessed are the stability and fit of the produced models. A total of 10 models with good fit and stability were initially selected for further assessment of performance. The analysis was performed both on complete cases and multiply imputed data.

Predictive Performance of the Developed Models

For the purpose of this study, the performance of the prediction models was assessed using the measures of discrimination and calibration.

Discrimination

This refers to the ability of a predictive risk model to distinguish participants that will develop the disease in context from those that will not.^Citation28 This is often assessed using sensitivity, specificity, and the Receiver Operator Characteristic (ROC) curve.^Citation28

In this study discrimination is assessed using measures of sensitivity, specificity and AUROC.

Calibration

Calibration is a statistical measure that evaluates whether what is being predicted by a risk model appears to be close to what is observed in reality over time.^Citation28 It is a further test of a model’s predictive power and is usually assessed along with discrimination.

In this study calibration is assessed using Hosmer–Lemeshow goodness-of-fit - estimates using deciles as well as calibration plots.

Missing Values

Missing values were imputed by multiple imputations by chain equations (MICE) method using the imputation by chained equation (ICE) programme of Stata^Citation29 in order to account for missing data. The final imputed datasets were based on m=10 imputations.

Selecting the Model with the Most Potential to Be Useful

Given the large number of models generated following the stepwise logistic regression, it is necessary to select one (with the most potentials to be used in a real-life setting). Central to this decision is the simplicity and accuracy of the model. To develop risk models which are clinically useful, the models’ simplicity and reliability of measurements are vital criteria.^{Citation30,Citation31} Complex models generated through extensive variable selection often lead to over-optimistic predictions.^Citation32 Thus, for any prediction model or risk score to be considered useful, it should be accurate (statistically significant calibration, and discrimination above 0.70), generalisable (externally validated by a separate research team on a different population) and usable (has few components that are commonly used in practical setting).^Citation33

Therefore, to prioritize risk models or scores at this stage of the analysis, pragmatic criteria were developed by the research team by modifying the criteria set by Wyatt and Altman^Citation30 and Altman et al.^Citation33 MetS risk prediction model is favored if it is accurate (has discrimination above 0.70 and statistically significant calibration), simple to use (contains few predictors that may not be difficult to obtain in a routine clinical setting). In addition to the above, the model should contain no more than two adult predictors. Finally, the model should contain predictors that can improve equity in the prevention of MetS.^Citation34

Adjusting for Potential Confounders

Potential confounders were added to the selected model in sequence; starting with those collected around birth (early life).

Internal Validity of the Selected Model

A random sample was drawn with replacement using the bootstrap command of Stata (Stata, 2015) with 1000 reps in order to test the internal validity of the selected model.

The summary of the stages involved in developing and internally validating the MetS risk prediction model is provided below:

Variables selected a priori based on the literature, clinical/ public health relevance and their significance to the understanding of the life course origin of MetS.
Multiple logistic regression models created using various combinations of early life predictors and adulthood predictors (life course approach).
Ten (10) stable models with good fit were selected for further assessment of performance.
Models’ performance was assessed using discrimination and calibration.
Model with the most potentials to be useful was selected after applying the model selection criteria.
Potential confounders were adjusted for sequentially.
Internal validity of the selected model was assessed using the bootstrapping technique of Stata.

Results

Sample Characteristics

above shows the summary characteristic of the studied sample in respect to MetS.

Development and Validation of a Simple Risk Model for Predicting Metabolic Syndrome (MetS) in Midlife: A Cohort Study

Abstract

Purpose

Design

Participants

Methods

Main Outcome Measure

Results

Conclusion

Introduction

Subjects and Methods

Participants

Variables

Predictors

Sociodemographic Characteristics

Early Life Predictors

Adolescence/Early Adulthood Factors (16 and 23 Years)

Adulthood Predictors (45 Years)

Outcome Variables

Blood Pressure

Blood Glucose (Glycated Haemoglobin (HbA1c)

HDL-Cholesterol

Triglycerides

Abdominal Obesity

Covariates

Early Life

Adult Health Behaviours/Lifestyle

Statistical Methods

Descriptive Statistics

Model Development

Variables Selection

Model Specification

Predictive Performance of the Developed Models

Discrimination

Calibration

Missing Values

Selecting the Model with the Most Potential to Be Useful

Adjusting for Potential Confounders

Internal Validity of the Selected Model

Results

Sample Characteristics

Table 1 Sample Characteristics

Table 2 Gender Differences in the Prevalence in the Individual Components of MetS

Developing MetS Risk Prediction Models

Table 3 Model 1 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth and High Waist Circumference)

Table 4 Model 2 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, High Waist Circumference and HbA1c ≥6.0)

Table 5 Model 3 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, BMI at 7, HbA1c ≥6.0 and Low HDL-Cholesterol)

Table 6 Model 4 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, BMI at 7, HbA1c ≥6.0, Hypertension and Overcrowding)

Table 7 Model 5 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, BMI at 7, Hypertension and HDL-Cholesterol)

Table 8 Model 6 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, BMI at 7, HbA1c ≥6.0, Hypertension, High Serum Triglycerides and Overcrowding)

Table 9 Model 7 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, BMI at 7, Hypertension and High Serum Triglycerides)

Table 10 Model 8 (MetS Prediction Model Consisting of Father’s Social Class at Birth, High Waist Circumference and Hypertension)

Table 11 Model 9 (MetS Prediction Model Consisting of Gender, Father’s Social Class at Birth, Family History of T2DM and High Waist Circumference)

Table 12 Model 10 (MetS Prediction Model Consisting of Gender, BMI at 23 and High Waist Circumference)

Performance of the Developed Models

Models Discrimination

Models Calibration

Table 13 Summary of the Performance Parameters of the Ten (10) MetS Models Developed

Selecting the Model with the Most Potentials to Be Useful

Adjusting for Potential Confounders

Table 14 Selected Model (Model 7) Adjusted for Confounders

Internal Validity of the Selected Model (Model 7)

Table 15 Selected Model (Model 7) Applied on a Random Bootstrap Sample

Table 16 Summary Performance of the Selected Model (Model 7) Applied on a Random Bootstrap Sample

Model Equation

Discussion

Main Findings

Strengths and Weaknesses of the Analysis

Conclusion

Disclosure

References

Related research

To cite this article:

Download citation

Your download is now in progress and you may close this window

Login or register to access this feature

Information for

Open access

Opportunities

Help and information