Abstract
In this work, we analyze the long-range dependence parameter for a nucleotide sequence in several different transformations. The long-range dependence parameter is estimated by the approximated maximum likelihood method, by a novel estimator based on the spectral envelope theory, by a regression method based on the periodogram function, and also by the detrended fluctuation analysis method. We study the length distribution of coding and noncoding regions for all Homo sapiens chromosomes available from the European Bioinformatics Institute. The parameter of the tail rate decay is estimated by the Hill estimator ˆα. We show that the tail rate decay is greater than 2 for coding regions, while for almost all noncoding regions it is less than 2.
AMS Classification :
Acknowledgements
N. Crato's research was partially supported by FCT-Fundaçcão para a Ciência e Tecnologia (Programme FEDER/POCI 2010), Portugal. R.R. Linhares was supported by CNPq-Brazil. S.R.C. Lopes research was partially supported by CNPq-Brazil, by CAPES-Brazil, by INCT-Matemática and also by Pronex Probabilidade e Processos Estocásticos - E-26/170.008/2008 -APQ1. The authors thank the editor and two anonymous referees for valuable comments that greatly improved the paper.