0000000000352873
AUTHOR
Rob Phillips
MCRL: using a reference library to compress a metagenome into a non-redundant list of sequences, considering viruses as a case study
Abstract Motivation Metagenomes offer a glimpse into the total genomic diversity contained within a sample. Currently, however, there is no straightforward way to obtain a non-redundant list of all putative homologs of a set of reference sequences present in a metagenome. Results To address this problem, we developed a novel clustering approach called ‘metagenomic clustering by reference library’ (MCRL), where a reference library containing a set of reference genes is clustered with respect to an assembled metagenome. According to our proposed approach, reference genes homologous to similar sets of metagenomic sequences, termed ‘signatures’, are iteratively clustered in a greedy fashion, re…
Human Phageprints: A high-resolution exploration of oral phages reveals globally-distributed phage families with individual-specific and temporally-stable community compositions
AbstractMetagenomic studies have revolutionized the study of novel phages. However these studies trade the depth of coverage for breadth. In this study we show that the targeted sequencing of a phage genomic region as small as 200-300 base pairs, can provide sufficient sequence diversity to serve as an individual-specific barcode or “Phageprint”. The targeted approach reveals a high-resolution view of phage communities that is not available through metagenomic datasets. By creating instructional videos and collection kits, we enabled citizen scientists to gather ∼700 oral samples spanning ∼100 individuals residing in different parts of the world. In examining phage communities at 6 differen…