Publications

Detailed Information

Protein-level assembly increases protein sequence recovery from metagenomic samples manyfold

Cited 160 time in Web of Science Cited 174 time in Scopus
Authors

Steinegger, Martin; Mirdita, Milot; Soeding, Johannes

Issue Date
2019-07
Publisher
Nature Publishing Group
Citation
Nature Methods, Vol.16 No.7, pp.603-609
Abstract
The open-source de novo protein-level assembler, Plass (https://plass. mmseqs. com), assembles six-frame-translated sequencing reads into protein sequences. It recovers 2-10 times more protein sequences from complex metagenomes and can assemble huge datasets. We assembled two redundancy-filtered reference protein catalogs, 2 billion sequences from 640 soil samples (soil reference protein catalog) and 292 million sequences from 775 marine eukaryotic metatranscriptomes (marine eukaryotic reference catalog), the largest free collections of protein sequences.
ISSN
1548-7091
URI
https://hdl.handle.net/10371/202573
DOI
https://doi.org/10.1038/s41592-019-0437-4
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • College of Natural Sciences
  • School of Biological Sciences
Research Area Development of algorithms to search, cluster and assemble sequence data, Metagenomic analysis, Pathogen detection in sequencing data

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share