A total crapshoot? Evaluating bioinformatic decisions in animal diet metabarcoding analyses.

Abstract

Metabarcoding studies provide a powerful approach to estimate the diversity and abundance of organisms in mixed communities in nature. While strategies exist for optimizing sample and sequence library preparation, best practices for bioinformatic processing of amplicon sequence data are lacking in animal diet studies. Here we evaluate how decisions made in core bioinformatic processes, including sequence filtering, database design, and classification, can influence animal metabarcoding results. We show that denoising methods have lower error rates compared to traditional clustering methods, although these differences are largely mitigated by removing low-abundance sequence variants. We also found that available reference datasets from GenBank and BOLD for the animal marker gene cytochrome oxidase I (COI) can be complementary, and we discuss methods to improve existing databases to include versioned releases. Taxonomic classification methods can dramatically affect results. For example, the commonly used Barcode of Life Database (BOLD) Classification API assigned fewer names to samples from order through species levels using both a mock community and bat guano samples compared to all other classifiers (vsearch-SINTAX and q2-feature-classifier's BLAST + LCA, VSEARCH + LCA, and Naive Bayes classifiers). The lack of consensus on bioinformatics best practices limits comparisons among studies and may introduce biases. Our work suggests that biological mock communities offer a useful standard to evaluate the myriad computational decisions impacting animal metabarcoding accuracy. Further, these comparisons highlight the need for continual evaluations as new tools are adopted to ensure that the inferences drawn reflect meaningful biology instead of digital artifacts.

Authors

O'Rourke, Devon R

Bokulich, Nicholas A

Jusino, Michelle A

MacManes, Matthew

Foster, Jeffrey T

Publication Date

September 2020

Published In

Ecology and Evolution Journal

Digital Object Identifier (doi)

https://doi.org/10.1002/ece3.6594

Start Page

9721

End Page

9739

A total crapshoot? Evaluating bioinformatic decisions in animal diet metabarcoding analyses.

Overview

Abstract

Authors

Publication Date

Published In

Identity

Digital Object Identifier (doi)

Additional Document Info

Start Page

End Page

Volume

Issue