77
Views
0
CrossRef citations to date
0
Altmetric
Original Articles

A Note on Distinguishing Random Trees Populations

Pages 239-251 | Received 03 Aug 2009, Accepted 30 Mar 2011, Published online: 07 Dec 2012
 

Abstract

This article addresses the problem of identifying differences between populations of trees. Recently, a sophisticated test was proposed by Balding et al. (Citation2009), the BFFS test, a Kolmogorov type of test that maximizes the differences between the information of the samples, but it does not have a naive computation, since it involves a search over the set of trees that grows exponentially fast. An algorithm for computing the test statistic was devised in Balding et al. (Citation2009), considering a search for a minimum cut over a transport network in a Ford Fulkerson type routine. The test was shown powerful but complex at the time to apply it in practice. On the contrary, we propose a very simple statistical test based on the distance between empirical mean trees, as an analog of the two sample Z statistic for comparing two means. Despite its simplicity, we can report that the test is quite powerful to separate distributions with different means, but it does not distinguish between different populations with the same means. In that case, the BFFS test should be applied. Nevertheless, on a real data set from proteomics, also discussed on Balding et al. (Citation2009), our test obtained the same results, making it a valuable preliminary evaluation tool for random trees population discrimination.

Mathematics Subject Classification:

Acknowledgments

We thank Antonio Galves for illuminating discussions about the discriminative power of context trees. We also thank Ricardo Fraiman for his continuos encouragement and support, and the Universidad de San Andres for its hospitality when part of this article was produced. We would like to thank Florencia Leonardi for providing the data used in our example of determination of protein functionality, which was also analyzed in Leonardi (Citation2007). This work was partially supported by PICT 2005-31659 and Secyt grant 69/08 and 05/B352.

Log in via your institution

Log in to Taylor & Francis Online

PDF download + Online access

  • 48 hours access to article PDF & online version
  • Article PDF can be downloaded
  • Article PDF can be printed
USD 61.00 Add to cart

Issue Purchase

  • 30 days online access to complete issue
  • Article PDFs can be downloaded
  • Article PDFs can be printed
USD 1,069.00 Add to cart

* Local tax will be added as applicable

Related Research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.