1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life.

Mukherjee, Supratim, Seshadri, Rekha, Varghese, Neha J., Eloe-Fadrosh, Emiley A., Meier-Kolthoff, Jan P., Göker, Markus, Coates, R Cameron, Hadjithomas, Michalis, Pavlopoulos, Georgios A., Paez-Espino, David, Yoshikuni, Yasuo, Visel, Axel, Whitman, William B., Garrity, George M., Eisen, Jonathan A., Hugenholtz, Philip, Pati, Amrita, Ivanova, Natalia N., Woyke, Tanja, Klenk, Hans-Peter and Kyrpides, Nikos C. (2017) 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life.. Nature Biotechnology, 35 7: 676-683. doi:10.1038/nbt.3886

Author Mukherjee, Supratim
Seshadri, Rekha
Varghese, Neha J.
Eloe-Fadrosh, Emiley A.
Meier-Kolthoff, Jan P.
Göker, Markus
Coates, R Cameron
Hadjithomas, Michalis
Pavlopoulos, Georgios A.
Paez-Espino, David
Yoshikuni, Yasuo
Visel, Axel
Whitman, William B.
Garrity, George M.
Eisen, Jonathan A.
Hugenholtz, Philip
Pati, Amrita
Ivanova, Natalia N.
Woyke, Tanja
Klenk, Hans-Peter
Kyrpides, Nikos C.
Title 1,003 reference genomes of bacterial and archaeal isolates expand coverage of the tree of life.
Journal name Nature Biotechnology   Check publisher's open access policy
ISSN 1546-1696
Publication date 2017-06-12
Year available 2017
Sub-type Article (original research)
DOI 10.1038/nbt.3886
Open Access Status DOI
Volume 35
Issue 7
Start page 676
End page 683
Total pages 8
Place of publication New York, NY, United States
Publisher Nature Publishing Group
Language eng
Subject 1305 Biotechnology
1502 Bioengineering
2402 Applied Microbiology and Biotechnology
2204 Biomedical Engineering
1313 Molecular Medicine
Abstract We present 1,003 reference genomes that were sequenced as part of the Genomic Encyclopedia of Bacteria and Archaea (GEBA) initiative, selected to maximize sequence coverage of phylogenetic space. These genomes double the number of existing type strains and expand their overall phylogenetic diversity by 25%. Comparative analyses with previously available finished and draft genomes reveal a 10.5% increase in novel protein families as a function of phylogenetic diversity. The GEBA genomes recruit 25 million previously unassigned metagenomic proteins from 4,650 samples, improving their phylogenetic and functional interpretation. We identify numerous biosynthetic clusters and experimentally validate a divergent phenazine cluster with potential new chemical structure and antimicrobial activity. This Resource is the largest single release of reference genomes to date. Bacterial and archaeal isolate sequence space is still far from saturated, and future endeavors in this direction will continue to be a valuable resource for scientific discovery.
Keyword Reference genomes
Genomic Encyclopedia of Bacteria and Archaea (GEBA)
Q-Index Code C1
Q-Index Status Provisional Code
Grant ID DE-AC02-05CH11231
Institutional Status UQ

Document type: Journal Article
Sub-type: Article (original research)
Collections: HERDC Pre-Audit
School of Chemistry and Molecular Biosciences
Institute for Molecular Bioscience - Publications
Version Filter Type
Citation counts: TR Web of Science Citation Count  Cited 16 times in Thomson Reuters Web of Science Article | Citations
Scopus Citation Count Cited 19 times in Scopus Article | Citations
Google Scholar Search Google Scholar
Created: Fri, 23 Jun 2017, 01:08:28 EST