1,215
Views
51
CrossRef citations to date
0
Altmetric
Articles

Coalescent-based species delimitation is sensitive to geographic sampling and isolation by distance

, , , &
 

Abstract

Species are a fundamental unit of biodiversity that are delimited via genetic data and coalescent-based methods with increasing frequency. Despite the widespread use of coalescent-based species delimitation, we do not fully understand the sensitivity of these methods to potential sources of bias and violations of their underlying assumptions. One implicit assumption of coalescent-based species delimitation is that geographic sampling is adequate and representative of genetic variation among populations within the lineage of interest. Yet exhaustive geographic sampling is logistically difficult, if not impossible, for many taxa that span large geographic expanses or occupy remote regions. Here, we examine the impact of geographic sampling on the output of Bayes-factor delimitation with SNAPP, a popular coalescent-based species delimitation pipeline. First, we demonstrate the problematic nature of sparse geographic sampling and isolation by distance for species delimitation using simulated data sets of populations connected by different levels of gene flow. We then examine whether similar trends are present in an empirical dataset of Andesiops mayflies (Ephemeroptera: Baetidae) from a high elevation transect in the Ecuadorian Andes. In both the simulated and empirical analyses, we systematically exclude geographically intermediate sites to quantify the impact of geographic sampling and isolation by distance on coalescent-based species delimitation. We find that removing intermediate sites with genetically admixed individuals incorrectly favors multi-species delimitation scenarios. Oversplitting is especially pronounced when isolation by distance is strong, but exists even when gene flow among neighboring populations is relatively high. These findings highlight the importance of adequate geographic sampling in species delimitation and urge caution in interpreting the output of such methods when species’ distributions are sparsely sampled and in systems characterized by strong patterns of isolation by distance.

Acknowledgments

We thank L. Knowles and D. Hillis for organizing a symposium on species delimitation at the Portland 2017 Evolution meetings that inspired our study. We also thank M. Hahn, T. Reeder, A. Leache, and S. Harrington for useful feedback and input on earlier versions of our study.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplemental data

Supplemental data for this article can be accessed here: https://doi.org/10.1080/14772000.2020.1730475.

Associate Editor: Dimitar Dimitrov

Additional information

Funding

This research was supported by National Science Foundation (NSF) Dimensions of Biodiversity Grants DEB-1046408 and DEB-1045960. N. Mason was supported by a NSF Postdoctoral Fellowship in Biology (DBI-1710739).

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.