Search results for "metagenomic"

showing 10 items of 177 documents

CROSSMAPPER: estimating cross-mapping rates and optimizing experimental design in multi-species sequencing studies

2020

Motivation Numerous sequencing studies, including transcriptomics of host-pathogen systems, sequencing of hybrid genomes, xenografts, mixed species systems, metagenomics and meta-transcriptomics, involve samples containing genetic material from divergent organisms. A crucial step in these studies is identifying from which organism each sequencing read originated, and the experimental design should be directed to minimize biases caused by cross-mapping of reads to incorrect source genomes. Additionally, pooling of sufficiently different genetic material into a single sequencing library could significantly reduce experimental costs but requires careful planning and assessment of the impact of…

Statistics and Probability:Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC]Computer sciencecomputer.software_genreBiochemistryGenomeTranscriptome03 medical and health sciencesResource (project management)GenomesTranscriptomicsMolecular BiologyOrganismGenòmica -- Informàtica030304 developmental biology0303 health sciences030306 microbiologyHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNADNAGenome analysisGenome AnalysisAnàlisis de seqüènciesComputer Science ApplicationsApplications NoteComputational MathematicsComputational Theory and MathematicsCross-mappingResearch DesignMetagenomicsRNAData miningLine (text file)computerSoftwareGenèticaparametres
researchProduct

MCRL: using a reference library to compress a metagenome into a non-redundant list of sequences, considering viruses as a case study

2019

Abstract Motivation Metagenomes offer a glimpse into the total genomic diversity contained within a sample. Currently, however, there is no straightforward way to obtain a non-redundant list of all putative homologs of a set of reference sequences present in a metagenome. Results To address this problem, we developed a novel clustering approach called ‘metagenomic clustering by reference library’ (MCRL), where a reference library containing a set of reference genes is clustered with respect to an assembled metagenome. According to our proposed approach, reference genes homologous to similar sets of metagenomic sequences, termed ‘signatures’, are iteratively clustered in a greedy fashion, re…

Statistics and ProbabilityContigComputer scienceRobustness (evolution)Computational biologyOriginal PapersBiochemistryComputer Science ApplicationsSet (abstract data type)Computational MathematicsComputational Theory and MathematicsMetagenomicsReference genesGene familyHuman viromeCluster analysisMolecular BiologyBioinformatics
researchProduct

Adaptive reference-free compression of sequence quality scores

2014

Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…

Statistics and ProbabilityFOS: Computer and information sciencesComputer sciencemedia_common.quotation_subjectReference-freecomputer.software_genreBiochemistryDNA sequencingSet (abstract data type)Redundancy (information theory)BWTComputer Science - Data Structures and AlgorithmsCode (cryptography)AnimalsHumansQuality (business)Data Structures and Algorithms (cs.DS)Quantitative Biology - GenomicsCaenorhabditis elegansMolecular Biologymedia_commonGenomics (q-bio.GN)SequenceGenomeSettore INF/01 - Informaticareference-free compressionHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNAData CompressioncompressionComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsFOS: Biological sciencesData miningquality scoreMetagenomicscomputerBWT; compression; quality score; reference-free compressionAlgorithmsReference genome
researchProduct

Metagenomics reveals our incomplete knowledge of global diversity

2008

Metagenomic sequencing obtains huge amounts of sequences from environmental and clinical samples, thus providing a glimpse of the global prokaryotic diversity of both species and genes in these sources. The current trend in metagenomic analysis follows the so-called gene-centric approach, focused on describing the environments by the study of the functional roles of the proteins encoded in the sequenced genes. In this way, it is clear that metagenomic analysis relies heavily on the accurate knowledge of the universe of proteins stored in the databases. Nevertheless, it is known that some biases exist in the composition of databases (which are rich in sequences from common, cultivable and ea…

Statistics and ProbabilityGeneticsPhylogenetic treebiologyPhylumGenetic VariationGenomicsBiodiversityGenomicsGenome Analysisbiology.organism_classificationBiochemistryComputer Science ApplicationsComputational MathematicsTaxonComputational Theory and MathematicsEvolutionary biologyMetagenomicsGenBankCIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIALTaxonomic rankLetter to the EditorMolecular BiologyEcosystemAcidobacteria
researchProduct

Two hundred and fifty-four metagenome-assembled bacterial genomes from the bank vole gut microbiota.

2020

Abstract Vertebrate gut microbiota provide many essential services to their host. To better understand the diversity of such services provided by gut microbiota in wild rodents, we assembled metagenome shotgun sequence data from a small mammal, the bank vole Myodes glareolus (Rodentia, Cricetidae). We were able to identify 254 metagenome assembled genomes (MAGs) that were at least 50% ( n  = 133 MAGs), 80% ( n  = 77 MAGs) or 95% ( n  = 44 MAGs) complete. As typical for a rodent gut microbiota, these MAGs are dominated by taxa assigned to the phyla Bacteroidetes ( n  = 132 MAGs) and Firmicutes ( n  = 80), with some Spirochaetes ( n  = 15) and Proteobacteria ( n  = 11). Based on coverage over…

Statistics and Probabilitymetagenomicsbacterial genomicsGenomeBacteriametsämyyräArvicolinaesuolistomikrobistoBacterialsequencinggenomiikkaLibrary and Information Sciencesmicrobial ecologybakteeritComputer Science ApplicationsEducationGastrointestinal MicrobiomemikrobiekologiaAnimalslcsh:QStatistics Probability and Uncertaintylcsh:ScienceInformation Systems
researchProduct

Animal rennets as sources of dairy lactic acid bacteria

2014

ABSTRACT The microbial composition of artisan and industrial animal rennet pastes was studied by using both culture-dependent and -independent approaches. Pyrosequencing targeting the 16S rRNA gene allowed to identify 361 operational taxonomic units (OTUs) to the genus/species level. Among lactic acid bacteria (LAB), Streptococcus thermophilus and some lactobacilli, mainly Lactobacillus crispatus and Lactobacillus reuteri , were the most abundant species, with differences among the samples. Twelve groups of microorganisms were targeted by viable plate counts revealing a dominance of mesophilic cocci. All rennets were able to acidify ultrahigh-temperature-processed (UHT) milk as shown by pH …

Streptococcus thermophilusColony CountColony Count MicrobialApplied Microbiology and BiotechnologyAcidification; Animal rennet pastes; Autolysis; Lactic acid bacteria; Microbial ecology; PyrosequencingMicrobial ecologyMicrobialCheeseRNA Ribosomal 16SLactobacillusEnterococcus casseliflavusLactic acid bacteriaCluster AnalysisPhylogenyEcologybiologyLactobacillus crispatusBacterialAnimal rennet pastefood and beveragesPyrosequencingHydrogen-Ion ConcentrationAutolysiBiotaAnimals; Cluster Analysis; Colony Count Microbial; DNA Bacterial; DNA Ribosomal; Enterococcus; Hydrogen-Ion Concentration; Lactobacillus; Microbial Viability; Milk; Molecular Sequence Data; Phylogeny; RNA Ribosomal 16S; Sequence Analysis DNA; Biota; ChymosinMilkSequence AnalysisChymosinBiotechnologyDNA Bacterial16SMolecular Sequence DataDNA RibosomalEnterococcus faecalisMicrobiologyAcidificationAnimalsRibosomalMicrobial ViabilitySequence Analysis DNADNAbiology.organism_classificationLactobacillus reuteriLactobacillusEnterococcusFood MicrobiologyRNAMetagenomicsEnterococcusFood ScienceEnterococcus faeciumSettore AGR/16 - Microbiologia Agraria
researchProduct

Comparison of different assembly and annotation tools on analysis of simulated viral metagenomic communities in the gut

2013

Abstract Background The main limitations in the analysis of viral metagenomes are perhaps the high genetic variability and the lack of information in extant databases. To address these issues, several bioinformatic tools have been specifically designed or adapted for metagenomics by improving read assembly and creating more sensitive methods for homology detection. This study compares the performance of different available assemblers and taxonomic annotation software using simulated viral-metagenomic data. Results We simulated two 454 viral metagenomes using genomes from NCBI's RefSeq database based on the list of actual viruses found in previously published metagenomes. Three different ass…

Taxonomic classificationComputational biologyBiologyGenomeContig MappingContig MappingUser-Computer Interface03 medical and health sciencesAnnotationDatabases GeneticGeneticsRefSeqCluster AnalysisHumansComputer SimulationTaxonomic rank030304 developmental biologyDe Bruijn sequenceInternetPrincipal Component Analysis0303 health sciencesBacteriaContigChimera identification030306 microbiologyComputational BiologyFunctional annotationViral metagenomeIntestinesAssembler performanceMetagenomicsVirusesMetagenomicsAlgorithmsResearch ArticleBiotechnologyBMC Genomics
researchProduct

Comprehensive dataset of shotgun metagenomes from stratified freshwater lakes and ponds

2020

AbstractStratified lakes and ponds featuring steep oxygen gradients are significant net sources of greenhouse gases and hotspots in the carbon cycle. Despite their significant biogeochemical roles, the microbial communities, especially in the oxygen depleted compartments, are poorly known. Here, we present a comprehensive dataset including 267 shotgun metagenomes from 41 stratified lakes and ponds mainly located in the boreal and subarctic regions, but also including one tropical reservoir and one temperate lake. For most lakes and ponds, the data includes a vertical sample set spanning from the oxic surface to the anoxic bottom layer. The majority of the samples were collected during the o…

Total organic carbonBiogeochemical cycleOceanographyBorealMetagenomicsEnvironmental scienceEcosystemAnoxic watersSubarctic climateCarbon cycle
researchProduct

What is in a lichen? A metagenomic approach to reconstruct the holo-genome of Umbilicaria pustulata

2019

AbstractLichens are valuable models in symbiosis research and promising sources of biosynthetic genes for biotechnological applications. Most lichenized fungi grow slowly, resist aposymbiotic cultivation, and are generally poor candidates for experimentation. Obtaining contiguous, high quality genomes for such symbiotic communities is technically challenging. Here we present the first assembly of a lichen holo-genome from metagenomic whole genome shotgun data comprising both PacBio long reads and Illumina short reads. The nuclear genomes of the two primary components of the lichen symbiosis – the fungus Umbilicaria pustulata (33 Mbp) and the green alga Trebouxia sp. (53 Mbp) – were assemble…

TrebouxiaAposymbioticbiologyMetagenomicsShotgun sequencingHorizontal gene transferComputational biologybiology.organism_classificationLichenGeneGenome
researchProduct

Extremophilic taxa predominate in a microbial community of photovoltaic panels in a tropical region

2021

ABSTRACT Photovoltaic panels can be colonized by a highly diverse microbial diversity, despite life-threatening conditions. Although they are distributed worldwide, the microorganisms living on their surfaces have never been profiled in tropical regions using 16S rRNA high-throughput sequencing and PICRUst metagenome prediction of functional content. In this work, we investigated photovoltaic panels from two cities in southeast Brazil, Sorocaba and Itatiba, using these bioinformatics approach. Results showed that, despite significant differences in microbial diversity (p < 0.001), the taxonomic profile was very similar for both photovoltaic panels, dominated mainly by Proteobacteria,…

Tropical Climatefood.ingredientbiologyConstruction MaterialsEcologyPhylumMicrobiotaCyanobacteriabiology.organism_classificationSphingomonasMicrobiologyExtremophilesfoodMicrobial population biologyMetagenomicsGenusRNA Ribosomal 16SHymenobacterSolar EnergyGeneticsMetagenomeDeinococcusProteobacteriaMolecular BiologyFEMS Microbiology Letters
researchProduct