834
Views
0
CrossRef citations to date
0
Altmetric
Editorial

Editorial to special issue Frontiers of Data Analysis

&

The special issue Frontiers of Data Analysis of the Journal of Applied Statistics (JAS), Taylor & Francis, contains papers that were presented in the 2019 ICSA China Conference which took place during July 1–4, 2019 on the campus of Nankai University, Tianjin, China. The conference was organized jointly by the International Chinese Statistical Association, Nankai University and Shanghai Jiaotong University. The conference was also co-sponsored by the K.C. Wong Education Foundation of Hong Kong and Chern Institute of Mathematics of Nankai University. The scientific program committee of the conference focused on creating collaboration opportunities and identifying new directions for further research. The conference attracted more than 500 participants and offered two plenary keynote lectures, 102 invited scientific sessions, and social events that included the opening mixer and banquet. The scientific sessions covered a wide range of topics including biostatistics, bioinformatics, statistics, engineering, finance, economics, genetics and genomics, big data computing, clinical trials, health policy and data science. This special issue of JAS includes 10 papers that were presented in the conference and were carefully reviewed by referees under the guideline of the journal. The selected papers cover various topics of state of art statistical methods and applications.

He, Sun and Shao [Citation1] considered a failure time data analysis problem under the context of gene identification and presented a network-based survival analysis approach for identifying target genes that can be used for developing cancer immunotherapies and predicting patient survival. In the method, the Cox model was used and the LASSO estimation procedure was applied for estimation. Then the authors applied the approach for identifying candidate genes as possible targets for new intervention of gliomas based on both animal and human data.

Sub-cohort sampling designs including nested case-control (NCC) and case-cohort (CC) studies are common in biomedical studies. Lee, Zeleniuch-Jacquotte and Liu [Citation2] presented Monte Carlo simulation studies on risk prediction performance under NCC, CC and full-cohort studies to illustrate the importance of the matching procedure in these designs.

In statistical process control (SPC), the quality of a process can be characterized by a functional relationship between two or more variables, particularly by a profile monitoring. Liu, Zhu and Lin [Citation3] studied a generalized likelihood ratio test for monitoring a profile data through a nonparametric regression by estimating the on-line profiles without a prespecified functional form for the profiles.

Wang, Jia and Jin [Citation4] discussed regression analysis of bivariate right-censored failure time data or univariate right-censored failure time data with dependent censoring with the focus on estimation of the cumulative baseline hazard function. More specifically, they investigated the situation where the failure time of interest follows the Cox model and the relationship between two correlated failure times or the failure time of interest and the censoring time can be described by an Archimedean copula model. In addition to providing a useful formula for the estimation, the authors also developed a graphical model checking procedure.

Wang et al. [Citation5] investigated the inference about the class of Box-Cox transformation models, which is commonly used for regression analysis of failure time data due to its flexibility, when one observes left-truncated and right-censored failure time data. In particular, the proportional hazards model and the additive hazards model are two special cases. The authors proposed a Bayesian estimation approach for the situation where the baseline hazards function can be approximated by the piecewise function. In the method, a conditional marginal prior was employed and a MCMC sampling procedure was developed.

Yang et al. [Citation6] addressed the multiplicity issue in the multiple comparisons in correlated binary data. They present asymptotic simultaneous confidence intervals (SCIs) for many-to-one comparisons of proportion differences adjusting for multiplicity and the correlation, which can be applied in the analysis of paired data arising in ophthalmological, orthopedic and otolaryngologic studies.

The Cox proportional hazards regression model has been used extensively in social science, biomedical sciences, economics and other fields. Zhao and Su [Citation7] extended the Cox proportional hazards regression model with a generalized concept of relative risk and presented a novel application on how to identify factors associated with the housing price. With the housing data in Tianjin, a metropolitan city in China, they presented the results that how the macro regulation policies, qualities of public schools, the structure and neighborhood characteristics and distance of the residential property to downtown Tianjin were associated with the housing price.

The order-of-addition experiment arises in many areas including biochemistry, pharmaceutical science and food and nutritional science. It is to identify the optimal order of adding components for optimizing the response of interest. With large m components, it is not feasible to test all m! possible adding order. Zhao et al. [Citation8] provided a study on pair-wise ordering designs and presented a new class of minimal-point pair-wise ordering designs using a recursive relation between two successive full pair-wise ordering designs.

Interval-censored failure time data are a general type of failure time data and include right-censored failure time data as a special case. Correspondingly a great deal of literature has been established for regression analysis of interval-censored data. In contrast, only limited literature exists on regression analysis of mixed interval-censored, the topic of Zhu et al. [Citation9], a combination of exact and interval-censored observations. For the problem, the authors investigated the fitting of the proportional odds model to the data and derived the maximum likelihood estimation procedure. Furthermore, they established the asymptotic properties of the proposed estimators and applied the method a motivating set of data on child cancer survivors.

As failure time data, longitudinal data also often occur in various studies and in various forms. Yu and Zhong [Citation10] discussed regression analysis of high-dimensional longitudinal data under the time-varying linear mixed effect model and presented a penalized estimation procedure with the use of both lasso penalty and fusion penalty. In particular, an efficient two-stage parameter estimation algorithm was provided for the estimation of longitudinal trajectories of fixed effects coefficients, and the approach was applied to a set of health and retirement survey data.

Acknowledgments

We would like to thank the authors and reviewers for their time and effort. We are indebted to the Editor-in-Chief, Dr. Jie Chen, and her editorial team for their editorial assistance.

References

  • X. He, X. Sun, and Y. Shao, Network-based survival analysis to discover target genes for developing cancer immunotherapies and predicting patient survival. J. Appl. Stat. 48 (2021), pp. 1352–1373.
  • M. Lee, A. Zeleniuch-Jacquotte, and M. Liu, Empirical evaluation of sub-cohort sampling designs for risk prediction modeling. J. Appl. Stat. 48 (2021), pp. 1374–1401.
  • Y. Liu, J. Zhu, and D.K.L. Lin, A generalized likelihood ratio test for monitoring profile data. J. Appl. Stat. 48 (2021), pp. 1402–1415.
  • A. Wang, X. Jia, and Z. Jin, Estimation of the cumulative baseline hazard function for dependently right-censored failure time data. J. Appl. Stat. 48 (2021), pp. 1416–1428.
  • C. Wang, J. Jiang, L. Luo, and S. Wang, Bayesian analysis of the Box-Cox transformation model based on left-truncated and right-censored data. J. Appl. Stat. 48 (2021), pp. 1429–1441.
  • Z. Yang, G.-L. Tian, X. Liu, and C.-X. Ma, Simultaneous confidence interval construction for many-to-one comparisons of proportion differences based on correlated paired data. J. Appl. Stat. 48 (2021), pp. 1442–1456.
  • B.B. Zhao, and R. Su, Determinants of the heavily right-tailed residential housing price in Tianjin. J. Appl. Stat. 48 (2021), pp. 1457–1474.
  • Y. Zhao, D.K.J. Lin, and M.-Q. Liu, Designs for order-of-addition experiments. J. Appl. Stat. 48 (2021), pp. 1475–1495.
  • L. Zhu, X. Tong, D. Cai, Y. Li, R. Sun, D.K. Srivastava, and M.M. Hudson, Maximum likelihood estimation for the proportional odds model with mixed interval-censored failure time data. J. Appl. Stat. 48 (2021), pp. 1496–1512.
  • J. Yu, and H. Zhong, Time varying mixed effects model with fused lasso regularization. J. Appl. Stat. 48 (2021), pp. 1513–1526.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.