Search results for "metagenomic"
showing 10 items of 177 documents
CROSSMAPPER: estimating cross-mapping rates and optimizing experimental design in multi-species sequencing studies
2020
Motivation Numerous sequencing studies, including transcriptomics of host-pathogen systems, sequencing of hybrid genomes, xenografts, mixed species systems, metagenomics and meta-transcriptomics, involve samples containing genetic material from divergent organisms. A crucial step in these studies is identifying from which organism each sequencing read originated, and the experimental design should be directed to minimize biases caused by cross-mapping of reads to incorrect source genomes. Additionally, pooling of sufficiently different genetic material into a single sequencing library could significantly reduce experimental costs but requires careful planning and assessment of the impact of…
MCRL: using a reference library to compress a metagenome into a non-redundant list of sequences, considering viruses as a case study
2019
Abstract Motivation Metagenomes offer a glimpse into the total genomic diversity contained within a sample. Currently, however, there is no straightforward way to obtain a non-redundant list of all putative homologs of a set of reference sequences present in a metagenome. Results To address this problem, we developed a novel clustering approach called ‘metagenomic clustering by reference library’ (MCRL), where a reference library containing a set of reference genes is clustered with respect to an assembled metagenome. According to our proposed approach, reference genes homologous to similar sets of metagenomic sequences, termed ‘signatures’, are iteratively clustered in a greedy fashion, re…
Adaptive reference-free compression of sequence quality scores
2014
Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…
Metagenomics reveals our incomplete knowledge of global diversity
2008
Metagenomic sequencing obtains huge amounts of sequences from environmental and clinical samples, thus providing a glimpse of the global prokaryotic diversity of both species and genes in these sources. The current trend in metagenomic analysis follows the so-called gene-centric approach, focused on describing the environments by the study of the functional roles of the proteins encoded in the sequenced genes. In this way, it is clear that metagenomic analysis relies heavily on the accurate knowledge of the universe of proteins stored in the databases. Nevertheless, it is known that some biases exist in the composition of databases (which are rich in sequences from common, cultivable and ea…
Two hundred and fifty-four metagenome-assembled bacterial genomes from the bank vole gut microbiota.
2020
Abstract Vertebrate gut microbiota provide many essential services to their host. To better understand the diversity of such services provided by gut microbiota in wild rodents, we assembled metagenome shotgun sequence data from a small mammal, the bank vole Myodes glareolus (Rodentia, Cricetidae). We were able to identify 254 metagenome assembled genomes (MAGs) that were at least 50% ( n = 133 MAGs), 80% ( n = 77 MAGs) or 95% ( n = 44 MAGs) complete. As typical for a rodent gut microbiota, these MAGs are dominated by taxa assigned to the phyla Bacteroidetes ( n = 132 MAGs) and Firmicutes ( n = 80), with some Spirochaetes ( n = 15) and Proteobacteria ( n = 11). Based on coverage over…
Animal rennets as sources of dairy lactic acid bacteria
2014
ABSTRACT The microbial composition of artisan and industrial animal rennet pastes was studied by using both culture-dependent and -independent approaches. Pyrosequencing targeting the 16S rRNA gene allowed to identify 361 operational taxonomic units (OTUs) to the genus/species level. Among lactic acid bacteria (LAB), Streptococcus thermophilus and some lactobacilli, mainly Lactobacillus crispatus and Lactobacillus reuteri , were the most abundant species, with differences among the samples. Twelve groups of microorganisms were targeted by viable plate counts revealing a dominance of mesophilic cocci. All rennets were able to acidify ultrahigh-temperature-processed (UHT) milk as shown by pH …
Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut
2013
Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…
Comprehensive dataset of shotgun metagenomes from stratified freshwater lakes and ponds
2020
AbstractStratified lakes and ponds featuring steep oxygen gradients are significant net sources of greenhouse gases and hotspots in the carbon cycle. Despite their significant biogeochemical roles, the microbial communities, especially in the oxygen depleted compartments, are poorly known. Here, we present a comprehensive dataset including 267 shotgun metagenomes from 41 stratified lakes and ponds mainly located in the boreal and subarctic regions, but also including one tropical reservoir and one temperate lake. For most lakes and ponds, the data includes a vertical sample set spanning from the oxic surface to the anoxic bottom layer. The majority of the samples were collected during the o…
What is in a lichen? A metagenomic approach to reconstruct the holo-genome of Umbilicaria pustulata
2019
AbstractLichens are valuable models in symbiosis research and promising sources of biosynthetic genes for biotechnological applications. Most lichenized fungi grow slowly, resist aposymbiotic cultivation, and are generally poor candidates for experimentation. Obtaining contiguous, high quality genomes for such symbiotic communities is technically challenging. Here we present the first assembly of a lichen holo-genome from metagenomic whole genome shotgun data comprising both PacBio long reads and Illumina short reads. The nuclear genomes of the two primary components of the lichen symbiosis – the fungus Umbilicaria pustulata (33 Mbp) and the green alga Trebouxia sp. (53 Mbp) – were assemble…
Extremophilic taxa predominate in a microbial community of photovoltaic panels in a tropical region
2021
ABSTRACT Photovoltaic panels can be colonized by a highly diverse microbial diversity, despite life-threatening conditions. Although they are distributed worldwide, the microorganisms living on their surfaces have never been profiled in tropical regions using 16S rRNA high-throughput sequencing and PICRUst metagenome prediction of functional content. In this work, we investigated photovoltaic panels from two cities in southeast Brazil, Sorocaba and Itatiba, using these bioinformatics approach. Results showed that, despite significant differences in microbial diversity (p < 0.001), the taxonomic profile was very similar for both photovoltaic panels, dominated mainly by Proteobacteria,…