1,072
Views
0
CrossRef citations to date
0
Altmetric
Introduction

Introduction to the special issue on Data Science for COVID-19

&
Pages 555-569 | Received 18 Jul 2022, Published online: 06 Sep 2022

Abstract

An introduction to this Special Issue on Data Science for COVID-19 is included in this paper. It contains a general overview about methods and applications of nonparametric inference and other flexible data science methods for the COVID-19 pandemic. Specifically, some methods existing before the COVID-19 outbreak are surveyed, followed by an account of survival analysis methods for COVID-related times. Then, several nonparametric tools for the estimation of certain COVID rates are revised, along with the forecasting of most relevant series counts, and some other related problems. Within this setup, the papers published in this special issue are briefly commented in this introductory article.

1. Introduction

Over the past decades, nonparametric statistical inference has been used to model important features in epidemiology. It has been also used for data analysis related to the evolution and side effects of pandemics. In the case of the current COVID-19 pandemic, a huge amount of data-analytical procedures have been proposed or used over the past three years.

Section 2 deals with some nonparametric statistical methods for epidemiology proposed and used before COVID-19. A short overview of nonparametric procedures proposed for COVID-19 is also included in the rest of the paper. Section 3 deals with survival analysis methods proposed or used for the study of the distribution of relevant times related to COVID-19: incubation period, time until hospitalisation, length of stay in hospital, as well as COVID-19 infection of a patient and survival time for patients. Nonparametric estimation of recovery rates and in-hospital mortality rates are also considered in this section. Nonparametric estimation of COVID-19 incidence, infection rate, transmission rate and mortality rate are considered in Section 4. Forecasting methods for COVID-19 cases, hospitalisations and deaths are presented in Section 5, while Section 6 deals with some relevant classification problems related to COVID-19. Section 7 deals with several nonparametric statistical procedures proposed for other setups related to the COVID-19 pandemic. Finally, Section 8 describes in more detail the contributions of this Special Issue.

2. Some pre-COVID-19 nonparametric statistical methods in epidemiology

Survival analysis methods, including censored and truncated data, have been widely applied in Biomedicine, and specifically in Epidemiology. Rao, Talwaker, and Kundu (Citation1991) proposed confidence intervals for the relative risk ratio under random censorship. Hoover et al. (Citation1993) proposed using events from dropouts to construct a nonparametric survival function estimate for interval-censored and left-truncated data and they applied these methods to the incubation time of AIDS. Gauderman and Thomas (Citation1994) proposed some methods for the analysis of diseases with variable age at onset. They presented a modification of the Cox proportional hazards model to include both measured (environmental) covariates and latent (genetic) variables, as well as their interactions. Three right censoring strategies were evaluated in Van Benthem et al. (Citation1997) to model AIDS incubation time. In the same setup Geskus (Citation2001) presents some methods for estimating the AIDS incubation time distribution when the date of seroconversion is censored. A method for assessing dose-response effects from case-control and cohort studies with interval-censored exposure information is presented in Cook, Brumback, Wigg, and Ryan (Citation2001). Cure models are used in Stoltenbere, Nordeng, Ystrom, and Samuelsen (Citation2020) in the context of perinatal epidemiology. They study the relation between a gestational exposure to paracetamol and attention-deficit hyperactivity disorder, a condition that can only be ascertained after several years. Recurrent events in observational studies are considered in Hernández-Herrera, Morina, and Navarro (Citation2022). Since it is common to include subjects who became at risk before follow-up, this leads to left censoring, and simply ignoring these prior episodes can lead to biased and inefficient estimates. The authors proposed statistical methods in this context, that are included in the R package miRecSurv.

Statistical methods original from other fields have been also used for Epidemiology. For instance, Liu, Yue, Lai, Huang, and Zhang (Citation2019) discussed the EWMA control chart based on rank methods for a multivariate process, and showed that the proposed method is efficient in detecting shifts for multivariate processes, by applying it to Japanese influenza data.

The kernel method for curve estimation has been used in epidemiology over the past few years. Adaptive and fixed bandwidth-based kernel density estimates are used in Lemke, Mattauch, Heidinger, Pebesma, and Hense (Citation2015) for spatial cancer epidemiology. A kernel weighting approach has been used by Wang, Graubard, Katki, and Li (Citation2020) to improve the external validity of epidemiologic cohort analyses. Kernel density estimation and approximate Bayesian computation have been also used in Irvine and Hollingsworth (Citation2018) to produce flexible epidemiological model fitting.

The Bayesian paradigm was also present in several other papers using nonparametric techniques for epidemiology. The paper (Xu, Kypraios, and O'Neill Citation2016) develops a Bayesian nonparametric approach using Gaussian Processes to estimate the infection process in an epidemic. A specific problem of estimating a disease risk from highly correlated environmental exposure covariates and a censored survival outcome is considered in Belloni, Laurent, Guihenneuc, and Ancelet (Citation2020). Bayesian profile regression mixture models were extended by assuming an instantaneous excess hazard ratio disease sub-model. A specific adaptive Metropolis-Within-Gibbs algorithm-including label switching moves was implemented to infer the model. This nonparametric Bayesian approach was applied to the estimation of the risk of death by lung cancer in a cohort of French uranium miners. Nonparametric Bayesian meta-regression, incorporating functional meta-predictors and spatial dependency was proposed by Yu et al. (Citation2021) in the context of environmental epidemiology. The method was applied to a temperature-mortality association study and to a suicide seasonality study. A Bayesian penalised B-spline estimation approach was proposed in Meng and Tao (Citation2017) to estimate the parameters and initial values for ordinary differential equations used in epidemiology. The approach was applied to the transmission dynamics of hepatitis C virus in mainland China.

Splines and penalised splines have been also considered in many more papers. For instance, Bergen, Sheppard, Kaufman, and Szpiro (Citation2016) considered measurement errors in air pollution epidemiology studies. They developed analytic bias correction methods when using penalised regression splines to predict exposure. The method is applied to analyse the association of systolic blood pressure with levels of two pollutants, PM2.5 and NO2. In the presence of unmeasured spatial confounding, Bobb et al. (Citation2022) studied the bias increase. These authors proposed an exposure-penalised spline approach that selects the degree of spatial smoothing to explain spatial variability in the exposure. Confounder control is also considered in Howe et al. (Citation2011), where splines are also used for trend analysis. Smoothing splines are proposed in semiparametric ANOVA models in Wahba, Wang, Gu, Klein, and Klein (Citation1995). The methods are applied to the data from the Wisconsin Epidemiologic Study of Diabetic Retinopathy to model the risk of progression of diabetic retinopathy as a function of glycosylated haemoglobin, duration of diabetes and body mass index.

Nonparametric and semiparametric forecasting is of primary interest in Epidemiology. Nonparametric estimation of time-dependent transmission rates using SEIR models are proposed in Smirnova, de Camp, and Chowell (Citation2019). The method is illustrated using case incidence data for various epidemics including the 1918 influenza pandemic in San Francisco and the 2014–2015 Ebola epidemic in West Africa. Forecasting methods originally proposed for epidemics have also been used to predict other social indicators. For instance, Cortes, Sánchez-Sánchez, Santonja, and Villanueva (Citation2013) considered nonparametric probabilistic forecasting of academic performance in Spanish high school using ordinary differential equations coming from epidemiological modelling. A bootstrap approach is employed for forecasting.

Some other relevant problems in Epidemiology have been addressed using nonparametric methods (Savilov and Astafyev Citation1987, among others). Nonparametric exact tests based on the number of records for early detection of emerging events in epidemiology have been proposed in Khraibani, Jacob, Ducrot, Charras-Garrido, and Sala (Citation2015). A nonparametric approach for estimating cut-offs in continuous risk indicators was applied to type 2 diabetes in Klotsche, Ferger, Pieper, Rehm, and Wittchen (Citation2009). Nonparametric backcasting methods have been used in Punyacharoensin and Viwatwongkasem (Citation2009) to estimate the trends along three decades of the HIV/AIDS epidemic in Thailand. The infection rate and its derivatives of the 2003 Severe Acute Respiratory Syndrome (SARS) epidemic in Beijing, China, were estimated nonparametrically in Chen, Huggins, Yip, and Lam (Citation2008) considering multiplicative counting processes.

3. Survival analysis methods for the study of the distribution of relevant times

Survival analysis has been a key tool for analysing the distribution of relevant times during the current COVID-19 pandemic. This includes incubation period, time until going to hospital, length-of-stay in hospital or in ICU, duration of COVID-19 of a patient and survival times. Related quantities such as recovery rates or mortality rates can be estimated using information about some of these relevant times.

Parametric methods (e.g. lognormal and Gamma distribution) have been recently proposed for the estimation of the distribution of incubation period of COVID-19 based on the doubly interval-censored data (Yin, Zhu, and Lu Citation2021). But more flexible methods have been presented as well. They consist of semiparametric regression analysis of doubly censored (Wong, Zhou, and Hu Citation2022) and a simple semiparametric sieve-estimation method based on Laguerre Polynomials (Kreiss and Van Keilegom Citation2022). Also Deng, You, Liu, Qin, and Zhou (Citation2021) use theory from renewal process by considering the incubation period as the interarrival time, and the duration between departure from Wuhan and onset of symptoms as the mixture of forward time and interarrival time with censored intervals. As a consequence, a consistent estimator for the distribution of the generation time based on the incubation period and a serial interval is proposed for incubation-infectious diseases. A right truncation mechanism is considered by Linton et al. (Citation2020), who carried out a statistical analysis of publicly available case data.

Different times related to COVID-19 hospital admission have been studied using flexible statistical tools. For instance, time until hospitalisation in Galicia (NW Spain), has been studied by Pedrosa-Laza, López-Cheda, and Cao (Citation2022), who used nonparametric mixture cure models selecting relevant covariates such as sex and age. The distribution of length-of-stay has been considered by López-Cheda, Jácome, Cao, and De Salazar (Citation2021), who also used nonparametric mixture cure models and showed that the proposed model outperformed standard approaches, providing more accurate ICU and hospital ward length-of-stay distribution estimates. Adjusting for sex and age is key for accurately forecasting hospital ward and ICU occupancy, as well as discharge or death outcomes. Concerning home-isolated patients, the duration of SARS-CoV-2 RNA detection in COVID-19 patients was studied by Omar et al. (Citation2020), using interval-censored survival analysis. The method was applied to a 2020 database from Rhineland-Palatinate, Germany.

Several methods are reviewed in Seaman, Presanis, and Jackson (Citation2022) for estimating a time-to-event distribution from right-truncated data in an epidemic. For example, one may be interested in estimating the distribution of time from onset of disease symptoms to death. Since at the beginning of an epidemic the only data available are coming from individuals who have died, right truncation causes the distribution of times to event in the sample to be biased towards shorter times compared to the population distribution. These methods are illustrated by applying them to data from individuals who had died from coronavirus disease by 5 April 2020. Other works have considered explanatory covariates as risk factors for death caused by COVID-19. Cox regression and mixture cure models (Seif, Sharafi, Ghaem, and Kasraei Citation2022) were used to study factors associated with the survival of Iranian patients with COVID-19. The findings revealed that important factors were age, history of diabetes and hypertension.

A new empirical likelihood method is proposed in Hu, Liang, Dai, and Bao (Citation2022) to analyse the recovery rate of COVID-19 based on a doubly censored dataset. The method provides much less biased estimators than previous approaches when the censoring percentage is large. The analysis of COVID-19 in-hospital mortality has been carried out in Xue, Saeed, Castagna, Jorde, and Agalliu (Citation2022). Competing risk models, mixture cure models and promotion time cure models have been considered and compared for estimating in-hospital mortality. These methods were applied to a cohort of COVID-19 in-hospital patients with diabetes, showing that statin use improved the overall survival of older but not younger patients.

4. Nonparametric estimation of COVID-19 incidence and infection, transmission and mortality rates

Nonparametric smoothing techniques are widely used to estimate COVID-19-related curves. This is the case of the incidence and infection, transmission and mortality rates.

Spline-based Bayesian hierarchical models have been used in Wistuba, Mayr, and Staerk (Citation2022) to estimate the incidence of the COVID-19 pandemic in Germany via death counts. Kernel principal component analysis control charts and K-nearest neighbour control chart were used in Fawzy and Ghalib (Citation2022) to monitor the number of infections of COVID-19. These methods were applied to official data coming from the Public Health Department of the Iraqi Ministry of Health. Generalised additive models were used by de Oliveira, Binner, Mandal, Kelly, and Power (Citation2021) to model cumulative and daily curves for confirmed cases and deaths. The pure Markov-Switching and the family of Markov-Switching GARCH models were used to identify structural breaks in the COVID-19 time series. These methods were applied in the time series of 189 countries, collected from the Centre for Systems Science and Engineering at Johns Hopkins University.

A space-time epidemic modelling framework was proposed in Wang et al. (Citation2022) to study the spatial-temporal pattern in the spread of infectious diseases. A quasi-likelihood approach via the penalised spline approximation and alternatively reweighted least-squares techniques are considered to estimate the model. The proposed method is applied to the COVID-19 pandemic.

Parametric regression methods – including the linear, log-linear, polynomial, generative additive regression, and spline regression – and nonparametric ones – including K-nearest neighbourhood, support vector machines and decision trees – have been used in Atteia, Mengash, and Samee (Citation2021) for estimating models for the total number of COVID-19 cases and total deaths.

A semiparametric Bayesian approach is presented in Schweinberger, Bomiriya, and Babkin (Citation2022) for flexible semiparametric modelling of epidemics. The method was applied to model the spread of the coronavirus MERS in South Korea in 2015.

Official data on the number of people infected with SARS-CoV-2-have been released in most of the countries just on the basis of a non-representative sample of population which tested positive. However a reliable estimation of the number of infected, including asymptomatic people, turns out to be crucial. In order to overcome the current data shortcoming, a bootstrap-driven, estimation procedure for the number of people infected with the SARS-CoV-2 is proposed in Fenga (Citation2021). This method was applied to data from Italy.

Estimation of viral load in wastewater samples is a proxy that has been used worldwide to overcome the problem of asymptomatic patients. In Courbariaux et al. (Citation2022), a flexible smoother adapted to censored data with outliers was used to monitor the viral load of SARS-CoV-2 in wastewaters arriving at wastewater treatment plants (WWTPs) in France.

Generalised additive models, kernel smoothing and LOESS have been used in Vallejo et al. (Citation2022) to monitor the viral load of SARS-CoV-2 in a wastewater plant located in Northwest Spain and to formulate statistical models to predict the number of infected people (symptomatic and asymptomatic) based on the smoothed viral load and some other environmental variables.

Compartmental models that allow the infection rate to assume any continuous-time profile are considered by Bisiacco, Pillonetto, and Cobelli (Citation2022). Closed-form expressions of the infection rate time-course can be derived for such models. As a consequence, a nonparametric estimate of the infection rate is proposed. Using real data collected in Italy, this technique proved to be a useful tool to monitor COVID-19 transmission dynamics.

In Scrucca (Citation2022), a near real-time COVID-19 index is proposed for monitoring the evolution of the pandemic. The index is computed from predictions obtained from a GAM beta regression for modelling the test-positive rate as a function of time. The proposal is illustrated using data on the COVID-19 pandemic in Italy. A minimax approximation criterion and polynomial splines are used in Abbasov et al. (Citation2022) for estimating and predicting dynamics of the spread of the coronavirus infection. The method was applied to model the spread of the coronavirus infection in Russia for a period of more than one year.

Case fatality ratios during COVID-19 have been estimated nonparametrically in Ghosh, Samanta, and Nieto (Citation2021) by means of the Kaplan–Meier estimator. Furthermore, a nonparametric cure model has been estimated using Nadaraya–Watson weights to estimate the cure rate. The methods are illustrated by estimating relative risks and cumulative mortality rates of COVID-19 in Spain, Italy and Israel.

In Pillonetto, Bisiacco, Palu, and Cobelli (Citation2021), a new class of nonparametric compartmental models is developed to describe how the impact of the lockdown varies in time. Hospitalised data are mapped into an infinite-dimensional space, hence obtaining a function which also takes into account how social distancing measures and people's growing awareness of infection's risk evolves as time progresses. This also allows reconstructing a continuous-time profile of the SARS-CoV-2 reproduction number. When applied to data collected in Lombardy, the model illustrates how people behaviour changed during the restrictions and its importance to contain the epidemic.

The trend pattern of COVID-19 has been studied in Kumar, Agiwal, and Yau (Citation2022) using a spline-based time series model. The estimation of the model parameters is obtained under the Bayesian setup for the best-fitted model. In Pijpers (Citation2021), a nonparametric method for determining epidemiological reproduction numbers is proposed and applied to the timeline of hospitalisation admissions for COVID-19 in the Netherlands up to 20 May 2020. A state-space model is considered in Zhou and Ji (Citation2020) for the transmission dynamics of COVID-19. Semiparametric Bayesian inference is used to fit the model. The approach is applied to COVID-19 data from six states of the United States. An analysis of COVID-19 with spline regression at the province level during the first-level response to a major public health emergency out of Hubei, China, was carried out in Liang and Shen (Citation2021). The 22 Chinese provinces considered from 19 January to 12 March 2020 were grouped into 3 regions, namely eastern, central and western provinces, and the trends between adjacent knots were compared among the 3 regions. Spline regression was applied to the data.

In Shi et al. (Citation2021aCitation2021b) an extended weight kernel density estimation model is used to forecast COVID-19 onset risk and to identify spatiotemporal variations of lockdown effects in China. Other simple nonparametric methods, including time-delay correlation analysis, have been considered in Diebner and Timmesfeld (Citation2021) to explore COVID-19 daily records of diagnosed cases and fatalities. The estimates for Germany, France, Italy, United States of America, United Kingdom, Spain, Switzerland, and Brazil are obtained and compared.

5. Forecasting COVID-19 cases, hospitalisations and deaths

Forecasting the most relevant series for a pandemic like COVID-19 is an important task. Public health decision making can benefit from a reliable forecast. This implies that forecasting techniques are one of the priorities of health authorities.

A semiparametric mixed model for short-term projection of daily COVID-19 incidence in Canada is proposed in Mullah and Yan (Citation2022). The proposed model is shown to describe the historical trend very well with an excellent ability to predict the short-term trajectory. In O'Dea and Drake (Citation2022), a semiparametric, Gaussian infection state-space compartmental model with time-dependent parameters has been formulated. The model is useful for forecasting several COVID-19 relevant series. Its performance is evaluated one to four weeks ahead of forecasts of COVID-19 cases, hospital admissions and deaths in the state of California.

An exploratory approach based on the complex network-defined splines is considered in Demertzis, Tsiotas, and Magafas (Citation2020). The method is applied to the evolution of COVID-19 in Greece. A VAR epidemiological model is proposed in Shang, Galow, and Galow (Citation2021) for regional forecasting of COVID-19 caseload. Nonparametric regression is also used. The method is applied across seven major demographically representative New York City metropolitan region counties. Forecasting COVID-19 onset risk is also considered in Shi et al. (Citation2021a) using an extended weight kernel density estimation model.

6. COVID-19 diagnosis, outcomes and other related classification problems

Several classification problems related to COVID-19 are reviewed in this section. It covers from side effects of the pandemic and COVID-19 sentiment to the diagnosis of COVID-19 and prediction of COVID-19 outcomes for a given patient.

Motivation of teleworkers and non-teleworkers in times of COVID-19 in Spain is analysed in Romeo, Yepes-Baldo, and Beltra (Citation2022). Nonparametric statistical analysis and classification and regression trees are used to explore the motivation of employees who teleworked and who did not, during the lockdown. In Rochim, Widyaningrum, and Eridani (Citation2020), COVID-19 sentiment is classified using support vector machines with different kernel functions. The methods are applied to a dataset about comments on Youtube to analyse public sentiment on the increase in cases at the beginning of the COVID-19 pandemic in Indonesia.

A computer-aided detection method for COVID-19 from CT images has been proposed in Saygili (Citation2022). The method is based on a Gaussian mixture model and a kernel support vector machines classifier. Chest CT images have been also used by Turkoglu (Citation2021) to produce a COVID-19 detection system using multiple kernels-extreme learning machines based on deep convolutional neural networks.

In Alotaibi et al. (Citation2022), an optimal kernel extreme learning machine for COVID-19 classification on an epidemiology dataset is performed. Bayesian nonparametric dimensionality reduction was used in Ajirak et al. (Citation2021) for predicting the severity of COVID-19 in pregnant women using categorical data. The method was applied to a dataset of 155 test-positive COVID-19 pregnant women collected at Stony Brook University Hospital. A nonparametric fuzzy hypothesis testing for quantiles applied to clinical characteristics of COVID-19 was developed in Chukhrova and Johannssen (Citation2021). An additional comprehensive case study is performed on COVID-19 in HIV-infected individuals with a focus on human body temperature and related measurement problems. In Gude-Sampedro et al. (Citation2021), a prognostic model based on comorbidities to predict COVID-19 severity is developed and validated. Predicting progression to disease severity, hospitalisation, admission to intensive care unit (ICU) and mortality in patients with COVID-19 infection has been performed for a dataset of COVID-19 patients collected in Galicia (Northwest Spain). A game-theoretic approach was adopted by Davila-Pena, García-Jurado, and Casas-Méndez (Citation2022) to assess the influence of features on classification problems. The method is applied to the same dataset of COVID-19 patients in Galicia.

7. Nonparametric statistical methods for other setups related to COVID-19

Several issues related to COVID-19 are considered in this section. They include under-reporting probability, estimation of highest density regions for COVID-19, and impacts of COVID-19 in meteorology and mobility.

Truncation data analysis has been proposed in Liang, Dai, and Restaino (Citation2022) for the under-reporting probability of the COVID-19 pandemic. Hypothesis testing on the differences in truncation probabilities, that are related to the under-reporting rates, is also considered. These methods are applied to COVID-19 data in several countries, where under-reporting probabilities are expected to be high. In Saavedra-Nieves (Citation2022), a nonparametric estimation of the highest density regions for COVID-19 is proposed. Set estimation methods are used. The behaviour of classical plug-in methods and a recently proposed hybrid algorithm for highest density regions estimation are compared through an extensive simulation study. Both methods are applied to analyse a real data set about COVID-19 cases in the United States.

The meteorological impact of COVID-19 lockdown is analysed in Hua et al. (Citation2021) using generalised additive models. Reductions in nitrogen dioxide (NO2) and fine particulate matter (PM2.5) at 34 sites in the Beijing area are studied. In López-Oriona, D'Urso, Vilar, and Lafuente-Rego (Citation2022), spatial weighted robust clustering of multivariate time series based on quantile dependence is proposed. The method is applied to multivariate time series of mobility indicators concerning the COVID-19 pandemic.

8. Data science for COVID-19

This Special Issue on Data Science for COVID-19 gathers together five important contributions covering different aspects of the study of the COVID-19 pandemic. In this section, a brief overview of the featured papers is provided.

8.1. Semiparametric estimation of features of the incubation and generation times

A situation of interest in epidemics arises when it is known that a person has infected another one (a transmission pair). The exact infection times of both individuals are assumed to be unobserved, and the difference between them (also unobserved) is known as generation time. The distribution of this random variable and some of its features are of great importance in the evolution of epidemics. For instance, the well-known basic reproduction number (the average number of new infections caused by a single individual) can be expressed as a function of the density of this generation time.

Another unobserved random variable of interest is the incubation time, i.e. the difference between the observed time at which one person has symptoms and the infection time. Kreiss and Van Keilegom (Citation2022) exploit the relationship between the observed symptoms time and some related truncation and location variables to propose a semiparametric approach that allows estimating the densities of the incubation and generation times via a sieve of Laguerre polynomials. The new model is then applied to analyse a real dataset of transmission pairs from the early period of the COVID-19 pandemic.

8.2. Testing for equal under-reporting in different groups

Under-reporting of infected cases and deaths are likely to occur in the early stages of a pandemic. It might be due, for instance, to people dying from the virus before being properly diagnosed to be infected or, on the contrary, to people that recover from the virus after having only mild symptoms, hence equally failing to be properly diagnosed.

Liang et al. (Citation2022) propose a truncation model to study the difference in under-reporting probabilities in different population groups, based on the nonparametric product limit estimator of their corresponding survival functions. They apply this methodology to test the equality of under-reporting probabilities of different age groups across different countries and conclude that, in some countries, elder people had a significantly higher under-reporting probability than younger people.

8.3. Epidemics modelling with a network-based semiparametric framework

The network of contacts in a population crucially influences the way that infectious diseases spread, and so, a network-based approach to epidemics has been explored by many authors (see, for instance, Britton and O'Neill (Citation2002) or Groendyke, Welch, and Hunter (Citation2011)). Such a population contact network is generated by a random graph model, which is usually based on parametric assumptions. However, the parametric models available in the existing literature usually induce short-tailed distributions of the number of contacts of the population members, whereas there is some empirical evidence that in real life these distributions are long-tailed (Jones and Handcock Citation2003).

Therefore, Schweinberger et al. (Citation2022) propose a semiparametric Bayesian approach in which the prior over the epidemiological parameter is parametric, but the prior over the network parameter is nonparametric. A simple numerical example shows that this approach can accommodate both short-tailed and also long-tailed distributions of the number of contacts. The new methodology is then successfully applied to data from the partially observed MERS epidemic in South Korea in 2015, where the coronavirus MERS is related to the coronaviruses SARS and COVID-19.

8.4. COVID-19 hotspot identification through highest density regions

Saavedra-Nieves (Citation2022) suggests to study the expansion of COVID-19 in the United States by analysing the distribution of the locations of confirmed cases of coronavirus. In this regard, areas of high concentration of confirmed cases are specifically detected via the regions of high density of such a distribution.

This immediately poses the problem of estimation of the highest density regions. A recent, fully automatic and nonparametric method of estimation based on geometric features (r-convexity) is considered. That methodology was introduced in Rodríguez-Casal and Saavedra-Nieves (Citation2022), where its theoretical properties were studied in depth. In Saavedra-Nieves (Citation2022) we can find a detailed simulation study comparing its finite-sample performance with the plug-in alternatives, and also an application to COVID-19 data, which shows its usefulness at the time of revealing hotspots in the spreading of the pandemic.

8.5. Spatiotemporal modelling of COVID-19 spread

Susceptible-Infectious-Recovered (SIR) models (see Brauer, Van den Driessche, and Wu Citation2008) provide widely acknowledged tools to describe the dynamics of the spread of an infectious disease. See also Tang et al. (Citation2020) for a recent extensive review. However, they are mostly deterministic and are not intended for handling stochastic variability, or for allowing incorporating auxiliary information in terms of covariates.

Wang et al. (Citation2022) propose to remedy this by considering a nonparametric spatiotemporal model that combines the use of the SIR model as the deterministic skeleton with statistical time series regression models, in a novel spatiotemporal epidemic model (STEM). Consistency and asymptotic normality of the coefficient estimators is shown, and this facilitates making inferences on the proposed models, such as constructing confidence bands for the true model component functions. The new methodology is put into practice with COVID-19 data from the United States.

Acknowledgments

The Guest Editor of this special issue on Data Science for COVID-19 and the Past Editor-in-Chief of the Journal of Nonparametric Statistics would like to thank the members of the Editorial Board of the journal and all the anonymous referees who took part in the evaluation of the manuscripts submitted for possible publication in this special issue.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

The research of the first author has been supported by the MICINN grant PID2020-113578RBI00 and by the Xunta de Galicia (Grupos de Referencia Competitiva ED431C-2020-14 and Centro de Investigación del Sistema Universitario de Galicia ED431G 2019/01), all of them through the European Regional Development Fund (ERDF). The research of the second author has been supported by the MICINN grant PID2019-109387GB-I00 and by the Junta de Extremadura grant GR21044.

References

  • Abbasov, A.M., Mamedov, Z.F., Tvaronaviciene, M., Borodin, A., Vygodchikova, I.Y., and Aliev, A.A. (2022), ‘Estimation and Prediction Dynamics of the Spread of the Coronavirus Infection Using Minimax Approximation Criterion and Polynomial Splines’, Applied and Computational Mathematics, 21(1), 101–117.
  • Ajirak, M., Heiselman, C., Fuchs, A., Heiligenstein, M., Herrera, K., Garretto, D., and Djuric, P.M. (2021), ‘Bayesian Nonparametric Dimensionality Reduction of Categorical Data for Predicting Severity of COVID-19 in Pregnant Women’, in 29th European Signal Processing Conference (EUSIPCO), pp. 1980–1984.
  • Alotaibi, S.S., Al-Rasheed, A., Althahabi, S., Hamza, M.A., Mohamed, A., Zamani, A., Motwakel, A., and Eldesouki, M.I. (2022), ‘Optimal Kernel Extreme Learning Machine for COVID-19 Classification on Epidemiology Dataset’, CMC – Computers Materials & Continua, 73(2), 3305–3318.
  • Atteia, G.E., Mengash, H.A., and Samee, N.A. (2021), ‘Evaluation of Using Parametric and Non-parametric Machine Learning Algorithms for COVID-19 Forecasting’, International Journal of Advanced Computer Science and Applications, 12(10), 647–657.
  • Belloni, M., Laurent, O., Guihenneuc, C., and Ancelet, S. (2020), ‘Bayesian Profile Regression to Deal with Multiple Highly Correlated Exposures and a Censored Survival Outcome. First Application in Ionizing Radiation Epidemiology’, Frontiers in Public Health, 8, 557006.
  • Bergen, S., Sheppard, L., Kaufman, J.D., and Szpiro, A.A. (2016), ‘Multipollutant Measurement Error in Air Pollution Epidemiology Studies Arising from Predicting Exposures with Penalized Regression Splines’, Journal of the Royal Statistical Society Series C - Applied Statistics, 65(5), 731–753.
  • Bisiacco, M., Pillonetto, G., and Cobelli, C. (2022), ‘Closed-form Expressions and Nonparametric Estimation of COVID-19 Infection Rate’, Automatica, 140, 110265.
  • Bobb, J.F., Cruz, M.F., Mooney, S.J., Drewnowski, A., Arterburn, D., and Cook, A.J. (2022), ‘Accounting for Spatial Confounding in Epidemiological Studies with Individual-level Exposures: An Exposure-penalized Spline Approach,’ Journal of the Royal Statistical Society Series A – Statistics in Society, 185(3), 1271–1293.
  • Brauer, F., Van den Driessche, P., and Wu, J. (2008), Mathematical Epidemiology, Berlin: Springer.
  • Britton, T., and O'Neill, P.D. (2002), ‘Statistical Inference for Stochastic Epidemics in Populations with Network Structure’, Scandinavian Journal of Statistics, 29, 375–390.
  • Chen, F., Huggins, R.M., Yip, P.S.F., and Lam, K.F. (2008), ‘Nonparametric Estimation of Multiplicative Counting Process Intensity Functions with An Application to the Beijing SARS Epidemic’, Communications in Statistics - Theory and Methods, 37(2), 294–306.
  • Chukhrova, N., and Johannssen, A. (2021), ‘Nonparametric Fuzzy Hypothesis Testing for Quantiles Applied to Clinical Characteristics of COVID-19’, International Journal of Intelligent Systems, 36(6), 2922–2963.
  • Cook, R.J., Brumback, B.B., Wigg, M.B., and Ryan, L.M. (2001), ‘Synthesis of Evidence from Epidemiological Studies with Interval-censored Exposure Due to Grouping’, Biometrics, 57(3), 671–680.
  • Cortes, J.C., Sánchez-Sánchez, A., Santonja, F.J., and Villanueva, R.J. (2013), ‘Non-parametric Probabilistic Forecasting of Academic Performance in Spanish High School Using an Epidemiological Modelling Approach’, Applied Mathematics and Computation, 221, 648–661.
  • Courbariaux, M., Cluzel, N., Wang, S.Y., Marechal, V., Moulin, L., Wurtzer, S., Mouchel, J.M., Maday, Y., and Nuel, G. (2022), ‘A Flexible Smoother Adapted to Censored Data with Outliers and Its Application to SARS-CoV-2 Monitoring in Wastewater’, Frontiers in Applied Mathematics and Statistics, 8, 836349.
  • Davila-Pena, L., García-Jurado, I., and Casas-Méndez, B. (2022), ‘Assessment of the Influence of Features on a Classification Problem: An Application to COVID-19 Patients’, European Journal of Operational Research, 299(2), 631–641.
  • de Oliveira, A.M.B., Binner, J.M., Mandal, A., Kelly, L., and Power, G.J. (2021), ‘Using GAM Functions and Markov-Switching Models in an Evaluation Framework to Assess Countries’ Performance in Controlling the COVID-19 Pandemic', BMC Public Health, 21(1), 2173.
  • Demertzis, K., Tsiotas, D., and Magafas, L. (2020), ‘Modeling and Forecasting the COVID-19 Temporal Spread in Greece: An Exploratory Approach Based on Complex Network Defined Splines’, International Journal of Environmental Research and Public Health, 17(13), 4693.
  • Deng, Y.H., You, C., Liu, Y.K., Qin, J., and Zhou, X.H. (2021), ‘Estimation of Incubation Period and Generation Time Based on Observed Length-biased Epidemic Cohort with Censoring for COVID-19 Outbreak in China’, Biometrics, 77(3), 929–941.
  • Diebner, H.H., and Timmesfeld, N. (2021), ‘Exploring COVID-19 Daily Records of Diagnosed Cases and Fatalities Based on Simple Nonparametric Methods’, Infectious Disease Reports, 13(2), 302–328.
  • Fawzy, H.M., and Ghalib, A. (2022), ‘Predicting of Infected People with Corona Virus (COVID-19) by Using Non-parametric Quality Control Charts’, International Journal of Nonlinear Analysis and Applications, 13(1), 2115–2126.
  • Fenga, L. (2021), ‘COVID-19: An Automatic, Semiparametric Estimation Method for the Population Infected in Italy’, PeerJ, 9, e10819.
  • Gauderman, W.J., and Thomas, D.C. (1994), ‘Censored Survival Models for Genetic Epidemiology: A Gibbs Sampling Approach’, Genetic Epidemiology, 11(2), 171–188.
  • Geskus, R.B. (2001), ‘Methods for Estimating the AIDS Incubation Time Distribution when Date of Seroconversion is Censored’, Statistics in Medicine, 20(5), 795–812.
  • Ghosh, S., Samanta, G., and Nieto, J.J. (2021), ‘Application of Non-parametric Models for Analyzing Survival Data of COVID-19 Patients’, Journal of Infection and Public Health, 14(10), 1328–1333.
  • Groendyke, C., Welch, D., and Hunter, D.R. (2011), ‘Bayesian Inference for Contact Networks Given Epidemic Data’, Scandinavian Journal of Statistics, 38, 600–616.
  • Gude-Sampedro, F., Fernández-Merino, C., Ferreiro, L., Lado-Baleato, O., Espasandín-Domínguez, J., Hervada, X., Cadarso, C.M., and Valdés, L. (2021), ‘Development and Validation of a Prognostic Model Based on Comorbidities to Predict COVID-19 Severity: A Population-based Study’, International Journal of Epidemiology, 50(1), 64–74.
  • Hernández-Herrera, G., Morina, D., and Navarro, A. (2022), ‘Left-censored Recurrent Event Analysis in Epidemiological Studies: A Proposal for when the Number of Previous Episodes is Unknown’, Medical Research Methodology, 22(1), 20.
  • Hoover, D.R., Munoz, A., Carey, V., Taylor, J.M.G., Vanraden, M., Chmiel, J.S., Kingsley, L., Bacellar, H., Jacobson, L.P., Kuo, V., Park, L., Saah, A., Farzadegan, H., Graham, N., Margolick, J., McArthur, J., Odaka, N., Phair, J.P., Cohen, B., Rinaldo, C.R., Armstrong, J., Gupta, P., Ho, M., Detels, R., Visscher, B.R., Dudley, J., Fahey, J.L., Giorgi, J.V., Imagawa, D., Vermund, S.H., Schrager, L.K., and Kaslow, R.A. (1993), ‘Using Events from Dropouts in Nonparametric Survival Function Estimation with Application to Incubation of AIDS’, Journal of the American Statistical Association, 88(421), 37–43.
  • Howe, C.J., Cole, S.R., Westreich, D.J., Greenland, S., Napravnik, S., and Eron, J.J. (2011), ‘Splines for Trend Analysis and Continuous Confounder Control in Epidemiologic Research’, American Journal of Epidemiology, 173(11), S273.
  • Hu, J., Liang, W., Dai, H.S., and Bao, Y.C. (2022), ‘Efficient Empirical Likelihood Inference for Recovery Rate of COVID19 Under Double-censoring’, Journal of Statistical Planning and Inference, 221, 172–187.
  • Hua, J.X., Zhang, Y.X., de Foy, B., Shang, J., Schauer, J.J.D., Mei, X.D., Sulaymon, I.D., and Han, T.T. (2021), ‘Quantitative Estimation of Meteorological Impacts and the COVID-19 Lockdown Reductions on NO 2 and PM 2.5 oer the Beijing Area Using Generalized Additive Models (GAM)’, Journal of Environmental Management, 291, 112676.
  • Irvine, M.A., and Hollingsworth, T.D. (2018), ‘Kernel-density Estimation and Approximate Bayesian Computation for Flexible Epidemiological Model Fitting in Python’, Epidemics, 25, 80–88.
  • Jones, J.H., and Handcock, M.S. (2003), ‘Social Networks: Sexual Contacts and Epidemic Thresholds’, Nature, 423, 605–606.
  • Khraibani, Z., Jacob, C., Ducrot, C., Charras-Garrido, M., and Sala, C. (2015), ‘A Non Parametric Exact Test Based on the Number of Records for an Early Detection of Emerging Events: Illustration in Epidemiology’, Communications in Statistics - Theory and Methods, 44(4), 726–749.
  • Klotsche, J., Ferger, D., Pieper, L., Rehm, J., and Wittchen, H.U. (2009), ‘A Novel Nonparametric Approach for Estimating Cut-offs in Continuous Risk Indicators with Application to Diabetes Epidemiology’, BMC Medical Research Methodology, 9, 63.
  • Kreiss, A., and Van Keilegom, I. (2022), ‘Semi-parametric Estimation of Incubation and Generation Times by Means of Laguerre Polynomials,’ Journal of Nonparametric Statistics. https://doi.org/10.1080/10485252.2022.2028281.
  • Kumar, J., Agiwal, V., and Yau, C.Y. (2022), ‘Study of the Trend Pattern of COVID-19 Using Spline-based Time Series Model: A Bayesian Paradigm’, Japanese Journal of Statistics and Data Science, 5(1), 363–377.
  • Lemke, D., Mattauch, V., Heidinger, O., Pebesma, E., and Hense, H.W. (2015), ‘Comparing Adaptive and Fixed Bandwidth-based Kernel Density Estimates in Spatial Cancer Epidemiology’, International Journal of Health Geographics, 14, 15.
  • Liang, W., Dai, H.S., and Restaino, M. (2022), ‘Truncation Data Analysis for the Under-reporting Probability in COVID-19 Pandemic,’ Journal of Nonparametric Statistics. https://doi.org/10.1080/10485252.2021.1989426.
  • Liang, C., and Shen, L. (2021), ‘An Analysis of Coronavirus Disease 2019 with Spline Regression at Province Level During First-level Response to Major Public Health Emergency out of Hubei, China’, Epidemiology and Infection, 149, e10.
  • Linton, N.M., Kobayashi, T., Yang, Y.C., Hayashi, K., Akhmetzhanov, A.R., Jung, S.M., Yuan, B.Y., Kinoshita, R., and Nishiura, H. (2020), ‘Incubation Period and Other Epidemiological Characteristics of 2019 Novel Coronavirus Infections with Right Truncation: A Statistical Analysis of Publicly Available Case Data’, Journal of Clinical Medicine, 9(2), 538.
  • Liu, L., Yue, J., Lai, X., Huang, J.P., and Zhang, J. (2019), ‘Multivariate Nonparametric Chart for Influenza Epidemic Monitoring’, Scientific Reports, 9, 17472.
  • López-Cheda, A., Jácome, M.A., Cao, R., and De Salazar, P.M. (2021), ‘Estimating Lengths-of-stay of Hospitalised COVID-19 Patients Using a Non-parametric Model: A Case Study in Galicia (Spain)’, Epidemiology and Infection, 149, e102.
  • López-Oriona, A., D'Urso, P., Vilar, J.A., and Lafuente-Rego, B. (2022), ‘Spatial Weighted Robust Clustering of Multivariate Time Series Based on Quantile Dependence with An Application to Mobility During COVID-19 Pandemic,’ IEEE Transactions on Fuzzy Systems. https://doi.org/10.1109/TFUZZ.2021.3136005.
  • Meng, L.X., and Tao, J. (2017), ‘Bayesian Penalized B-spline Estimation Approach for Epidemic Models’, Journal of Statistical Computation and Simulation, 87(1), 88–99.
  • Mullah, M.A., and Yan, P. (2022), ‘A Semi-parametric Mixed Model for Short-term Projection of Daily COVID-19 Incidence in Canada’, Epidemics, 38, 100537.
  • O'Dea, E.B., and Drake, J.M. (2022), ‘A Semi-parametric, State-space Compartmental Model with Time-dependent Parameters for Forecasting COVID-19 Cases, Hospitalizations and Deaths’, Journal of the Royal Society Interface, 19(187), 20210702.
  • Omar, S., Bartz, C., Becker, S., Basenach, S., Pfeifer, S., Trapp, C., Hamm, H., Schlichting, H.C., Friederichs, M., Koch, U., Jestrabek, C., Hilger, E., Vogt, M., Jahn, K., Chen, S., Barnighausen, T., and Zanger, P. (2020), ‘Duration of SARS-CoV-2 RNA Detection in COVID-19 Patients in Home Isolation, Rhineland-Palatinate, Germany, 2020-an Interval-censored Survival Analysis’, Eurosurveillance, 25(30), 5–12.
  • Pedrosa-Laza, M., López-Cheda, A., and Cao, R. (2022), ‘Cure Models to Estimate Time Until Hospitalization Due to COVID-19’, Applied Intelligence, 52, 794–807.
  • Pijpers, F.P. (2021), ‘A Non-parametric Method for Determining Epidemiological Reproduction Numbers’, Journal of Mathematical Biology, 82(5), 37.
  • Pillonetto, G., Bisiacco, M., Palu, G., and Cobelli, C. (2021), ‘Tracking the Time Course of Reproduction Number and Lockdown's Effect on Human Behaviour During SARS-CoV-2 Epidemic: Nonparametric Estimation’, Scientific Reports, 11(1), 9772.
  • Punyacharoensin, N., and Viwatwongkasem, C. (2009), ‘Trends in Three Decades of HIV/AIDS Epidemic in Thailand by Nonparametric Backcalculation Method’, AIDS, 23(9), 1143–1152.
  • Rao, B.R., Talwaker, S., and Kundu, D. (1991), ‘Confidence-intervals for the Relative Risk Ratio Parameter from Survival-data Under a Random Censorship Model in Biomedical and Epidemiologic Studies (by Simulation)’, Biometrical Journal, 33(8), 959–984.
  • Rochim, A.F., Widyaningrum, K., and Eridani, D. (2020), ‘Performance Comparison of Support Vector Machine Kernel functions in Classifying COVID-19 Sentiment’, in 4th International Seminar on Research of Information Technology and Intelligent Systems (ISRITI).
  • Rodríguez-Casal, A., and Saavedra-Nieves, P. (2022), ‘A Data-adaptive Method for Estimating Density Level Sets Under Shape Conditions’, Annals of Statistics, 50, 1653–1668.
  • Romeo, M., Yepes-Baldo, M., and Beltra, L. (2022), ‘Motivation of Teleworkers and Non-teleworkers in Times of COVID-19 in Spain: An Exploratory Study Using Non-parametric Analysis and Classification and Regression Trees’, Frontiers in Psychology, 13, 852758.
  • Saavedra-Nieves, P. (2022), ‘Nonparametric Estimation of Highest Density Regions for COVID-19,’ Journal of Nonparametric Statistics. https://doi.org/10.1080/10485252.2021.1988083.
  • Savilov, E.D., and Astafyev, V.A. (1987), ‘Use of Nonparametric Statistical Criteria in Epidemiological Investigations’, Zhurnal Mikrobiologii Epidemiologii i Immunobiologii, 11, 36–38.
  • Saygili, A. (2022), ‘Computer-aided Detection of COVID-19 from CT Images Based on Gaussian Mixture Model and Kernel Support Vector Machines Classifier’, Arabian Journal for Science and Engineering, 47(2), 2435–2453. https://doi.org/10.1080/10485252.2021.1972294.
  • Schweinberger, M., Bomiriya, R.P., and Babkin, S. (2022), ‘A Semiparametric Bayesian Approach to Epidemics, with Application to the Spread of the Coronavirus MERS in South Korea in 2015,’ Journal of Nonparametric Statistics. To appear.
  • Scrucca, L. (2022), ‘A COVINDEX Based on a GAM Beta Regression Model with An Application to the COVID-19 Pandemic in Italy,’ Statistical Methods and Applications. https://doi.org/10.1007/s10260-021-00617-y.
  • Seaman, S.R., Presanis, A., and Jackson, C. (2022), ‘Estimating a Time-to-event Distribution from Right-truncated Data in An Epidemic: A Review of Methods,’ Statistical Methods in Medical Research. https://doi.org/10.1177/09622802211023955.
  • Seif, M., Sharafi, M., Ghaem, H., and Kasraei, F. (2022), ‘Factors Associated with Survival of Iranian Patients with COVID-19: Comparison of Cox Regression and Mixture Cure Model’, Tropical Diseases Travel Medicine and Vaccines, 8(1), 4.
  • Shang, A.C., Galow, K.E., and Galow, G.G. (2021), ‘Regional Forecasting of COVID-19 Caseload by Non-parametric Regression: A VAR Epidemiological Model’, Aims Public Health, 8(1), 124–136.
  • Shi, W.Z., Tong, C.Z., Zhang, A.S., Wang, B., Shi, Z.C., Yao, Y.P., and Jia, P. (2021a), ‘An Extended Weight Kernel Density Estimation Model Forecasts COVID-19 Onset Risk and Identifies Spatiotemporal Variations of Lockdown Effects in China’, Communications Biology, 4(1), 126.
  • Shi, W.Z., Tong, C.Z., Zhang, A.S., Wang, B., Shi, Z.C., Yao, Y.P., and Jia, P. (2021b), ‘An Extended Weight Kernel Density Estimation Model Forecasts COVID-19 Onset Risk and Identifies Spatiotemporal Variations of Lockdown Effects in China (vol 4, 126, 2021)’, Communications Biology, 4(1), 418.
  • Smirnova, A., de Camp, L., and Chowell, G. (2019), ‘Forecasting Epidemics Through Nonparametric Estimation of Time-dependent Transmission Rates Using the SEIR Model’, Bulletin of Mathematical Biology, 81(11), 4343–4365.
  • Stoltenbere, E.A., Nordeng, H.M.E., Ystrom, E., and Samuelsen, S.O. (2020), ‘The Cure Model in Perinatal Epidemiology’, Statistical Methods in Medical Research, 29(10), 2783–2794.
  • Tang, L., Zhou, Y., Wang, L., Purkayastha, S., Zhang, L., He, J., Wang, F., and Song, P.X.-K. (2020), ‘A Review of Multi-compartment Infectious Disease Models’, International Statistical Review, 88, 462–513.
  • Turkoglu, M. (2021), ‘COVID-19 Detection System Using Chest CT Images and Multiple Kernels-extreme Learning Machine Based on Deep Neural Network’, IRBM, 42(4), 207–214.
  • Vallejo, J.A., Trigo, N., Rumbo-Feal, S., Conde-Pérez, K., López-Oriona, A., Barbeito, I., Vaamonde, M., Tarrío-Saavedra, J., Reif, R., Ladra, S., Rodiño-Janeiro, B.K., Nasser, M., Cid, A., Veiga, M.C., Acevedo, A., Lamora, C., Bou, G., Cao, R., and Poza, M. (2022), ‘Modeling the Number of People Infected with SARS-COV-2 from Wastewater Viral Load in Northwest Spain’, Science of the Total Environment, 811, 152334.
  • Van Benthem, B.H.B., Veugelers, P.J., Schechter, M.T., Kaldor, J.M., PageShafter, K.A., and Van Griensven, G.J.P. (1997), ‘Modelling the AIDS Incubation Time: Evaluation of Three Right Censoring Strategies’, AIDS, 11(6), 834–835.
  • Wahba, G., Wang, Y.D., Gu, C., Klein, R., and Klein, B. (1995), ‘Smoothing Spline ANOVA for Exponential Families, with Application to the Wisconsin Epidemiologic Study of Diabetic Retinopathy’, Annals of Statistics, 23(6), 1865–1895.
  • Wang, L.X., Graubard, B.I., Katki, H.A., and Li, A.Y. (2020), ‘Improving External Validity of Epidemiologic Cohort Analyses: A Kernel Weighting Approach’, Journal of the Royal Statistical Society Series A – Statistics in Society, 183(3), 1293–1311.
  • Wang, Y.Y., Kim, M., Yu, S., Li, X.Y., Wang, G.N., and Wang, L. (2022), ‘Nonparametric Estimation and Inference for Spatiotemporal Epidemic Models,’ Journal of Nonparametric Statistics. https://doi.org/10.1080/10485252.2021.1988084.
  • Wistuba, T., Mayr, A., and Staerk, C. (2022), ‘Estimating the Course of the COVID-19 Pandemic in Germany Via Spline-based Hierarchical Modelling of Death Counts’, Scientific Reports, 12(1), 9784.
  • Wong, K.Y., Zhou, Q.N., and Hu, T. (2022), ‘Semiparametric Regression Analysis of Doubly-censored Data with Applications to Incubation Period Estimation,’ Lifetime Data Analysis. https://doi.org/10.1007/s10985-022-09567-3.
  • Xu, X.G., Kypraios, T., and O'Neill, P.D. (2016), ‘Bayesian Non-parametric Inference for Stochastic Epidemic Models Using Gaussian Processes’, Biostatistics, 17(4), 619–633.
  • Xue, X.N., Saeed, O., Castagna, F., Jorde, U.P., and Agalliu, I. (2022), ‘The Analysis of COVID-19 in-hospital Mortality: A Competing Risk Approach Or a Cure Model?,’ Statistical Methods in Medical Research. https://doi.org/10.1177/09622802221106300.
  • Yin, M.Z., Zhu, Q.W., and Lu, X. (2021), ‘Parameter Estimation of the Incubation Period of COVID-19 Based on the Doubly Interval-censored Data Model’, Nonlinear Dynamics, 106(2), 1347–1358.
  • Yu, J., Park, J., Choi, T., Hashizume, M., Kim, Y., Honda, Y., and Chung, Y. (2021), ‘Nonparametric Bayesian Functional Meta-regression: Applications in Environmental Epidemiology’, Journal of Agricultural Biological and Environmental Statistics, 26(1), 45–70.
  • Zhou, T.J., and Ji, Y. (2020), ‘Semiparametric Bayesian Inference for the Transmission Dynamics of COVID-19 with a State-space Model’, Contemporary Clinical Trials, 97, 106146.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.