79
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Markov chain composite likelihood and its application in genetic recombination model

, &
Pages 1389-1415 | Received 08 Sep 2023, Accepted 19 Nov 2023, Published online: 29 Nov 2023
 

Abstract

Phylogenetic Trees are critical in human genome research for investigating human evolution and identifying disease-associated genetic markers. New high-throughput genome sequencing technologies raise an urgent need to develop statistical methods that can construct phylogenetic trees from long genome sequences with quick computation speeds, while considering various biological complexities. Though an ancestral mixture model has been proposed [Chen SC, Lindsay BG. Building mixture trees from binary sequence data. Biometrika. 2006;93(4):843–860. doi: 10.1093/biomet/93.4.843] to this end by allowing genetic mutations over generations, another essential evolution factor, genetic recombination, is missed. Therefore, in this paper, we develop a novel genetic recombination model for tree construction and propose to use Markov chain composite likelihood (MCCL) to make model estimation computationally feasible. To further reduce computation complexity, a hierarchical estimator is constructed to estimate unknown ancestral distributions through MCCL. Simulation studies and real data example show that our proposed methods perform well and fast, so have the potential for implementation in long sequence genome data.

Acknowledgments

The authors express their sincere gratitude for the reviewer's insightful comments and valuable suggestions, which have significantly contributed to the enhancement of this manuscript. During the preparation of this work, Dr. Bruce G. Lindsay passed away due to an illness. We miss this brilliant statistician, wise and excellent advisor, and warm friend dearly. The first author is particularly grateful to Professor Lindsay's invaluable mentor, support, and encouragement to her early stage research, which the author will treasure forever.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Additional information

Funding

This work was partially supported by the NSF [grant number 1950549].

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,209.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.