763
Views
0
CrossRef citations to date
0
Altmetric
Research Article

Filling reference libraries with diatom environmental sequences: strengths and weaknesses

ORCID Icon, , ORCID Icon, ORCID Icon, , ORCID Icon, , ORCID Icon, ORCID Icon, ORCID Icon, ORCID Icon & ORCID Icon show all
Pages 103-127 | Received 16 Mar 2022, Accepted 26 Jun 2023, Published online: 12 Sep 2023
 

Abstract

Diatom species identification with DNA metabarcoding is an economical, fast and reliable alternative to identification via light microscopy for river quality monitoring. Using a short DNA sequence of the rbcL gene and ‘Diat.barcode’, a reference barcode library, enables the identification of more than 90% of the environmental sequences to species level in French rivers. But the completeness of this library is much lower in other regions, such as the tropical French overseas departments. A barcode library completion method using high-throughput sequencing data combined with microscopy count data from natural samples (Rimet et al. Citation2018) was applied and tested in rivers of Martinique and Guadeloupe (West Indies), for which only 45% of the environmental sequences could be identified to species level using Diat.barcode v9. Assigning barcodes to the most abundant species in the islands by this method is illustrated with Ulnaria goulardii and two new species belonging to Nupela and Epithemia, which are also described in this paper. The more complex situation of morphologically similar species is illustrated by reference to Gomphonema designatum and G. bourbonense. Using a combination of molecular and morphological data, their conspecificity, as G. bourbonense, is demonstrated with their reference barcodes. However, when several morphologically similar species and several environmental sequences belonging to the same clade are present, it is not possible to relate the barcodes to corresponding morphological species.

Applying this method enabled the Diat.barcode library (v.10) to be updated, with 84% of the environmental sequences from the West Indies now identifiable at the species level. However, many morphological species still lack barcodes. In these cases, more classical methods, such as cell isolation, Sanger sequencing and morphological observations of cultures, must be applied.

Acknowledgments

The authors gratefully acknowledge Anne Eulin-Garrigue for providing the results of her previous work on the same sites. Zlatko Levkov received support from the Alexander von Humboldt Foundation.

Disclosure statement

No potential conflict of interest was reported by the author(s).

Supplementary data

Supplemental data for this article can be accessed at https://doi.org/10.1080/0269249X.2023.2237977. Anne Eulin-Garrigue and Estelle Lefrancois provided the supplemental data when floristic counts were made in the framework of the monitoring of rivers (Eulin et al. Citation2017a, Citationb, Citationc, Citationd, Citatione, Citationf). Figure legends for Supplementary data: Supplementary data 1. LM micrographs of Nupela boucheziae sp. nov. Figs 1–23. Valves with raphe and Figs. 1–19. Valves without raphe, from a site called Blanche-Pont de l'Alma in Martinique. Scale bar = 10 m. Supplementary data 2. LM micrographs of Epithemia boucheziae sp. nov. Figs 1–10. Valve view. Figs 11–13c. Girdle view, from river Riviere Bras David in Guadeloupe. Scale bar = 10 m. Supplementary data 3. SEM micrographs of Epithemia boucheziae sp. nov. Figs 1–2. External valve view. Figs. 3–4 Internal valve view. Scale bars = 10 m. Supplementary data 4. Spearman's correlation between frustule counts of suspected species and percentage of reads of ASVs. (Correlation coefficient: bottom left of the table, p-value: upper right of the table.)

Additional information

Funding

This work was funded by OFB: Office Français de la Biodiversité.