The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly.

Abstract

Characterizing transcriptomes in non-model organisms has resulted in a massive increase in our understanding of biological phenomena. This boon, largely made possible via high-throughput sequencing, means that studies of functional, evolutionary, and population genomics are now being done by hundreds or even thousands of labs around the world. For many, these studies begin with a de novo transcriptome assembly, which is a technically complicated process involving several discrete steps. The Oyster River Protocol (ORP), described here, implements a standardized and benchmarked set of bioinformatic processes, resulting in an assembly with enhanced qualities over other standard assembly methods. Specifically, ORP produced assemblies have higher Detonate and TransRate scores and mapping rates, which is largely a product of the fact that it leverages a multi-assembler and kmer assembly process, thereby bypassing the shortcomings of any one approach. These improvements are important, as previously unassembled transcripts are included in ORP assemblies, resulting in a significant enhancement of the power of downstream analysis. Further, as part of this study, I show that assembly quality is unrelated with the number of reads generated, above 30 million reads. Code Availability: The version controlled open-source code is available at https://github.com/macmanes-lab/Oyster_River_Protocol. Instructions for software installation and use, and other details are available at http://oyster-river-protocol.rtfd.org/.

Authors

MacManes, Matthew

Publication Date

2018

Published In

PeerJ Journal

Keywords

Assembly

Bioinformatics

Transcriptome

Digital Object Identifier (doi)

https://doi.org/10.7717/peerj.5428

Start Page

e5428

The Oyster River Protocol: a multi-assembler and kmer approach for de novo transcriptome assembly.

Overview

Abstract

Authors

Publication Date

Published In

Research

Keywords

Identity

Digital Object Identifier (doi)

Additional Document Info

Start Page

Volume