Search results for "reference genome"

showing 7 items of 27 documents

Phylogenetic Distribution of Polysaccharide-Degrading Enzymes in Marine Bacteria

2021

Deconstruction is an essential step of conversion of polysaccharides, and polysaccharide-degrading enzymes play a key role in this process. Although there is recent progress in the identification of these enzymes, the diversity and phylogenetic distribution of these enzymes in marine microorganisms remain largely unknown, hindering our understanding of the ecological roles of marine microorganisms in the ocean carbon cycle. Here, we studied the phylogenetic distribution of nine types of polysaccharide-degrading enzymes in marine bacterial genomes. First, we manually compiled a reference sequence database containing 961 experimentally verified enzymes. With this reference database, we annota…

Microbiology (medical)ecological differentiationPhylogenetic treePhylumcarbohydrate active enzymeslcsh:QR1-502polysaccharide-degrading enzymesGenomicsBacterial genome sizeCellulaseBiologyphylogenyMicrobiologylcsh:MicrobiologyMarine bacteriophagemarine bacteriaEvolutionary biologyPhylogeneticsbiology.proteingenomicsReference genomeOriginal ResearchFrontiers in Microbiology
researchProduct

The era of reference genomes in conservation genomics

2022

Progress in genome sequencing now enables the large-scale generation of reference genomes. Various international initiatives aim to generate reference genomes representing global biodiversity. These genomes provide unique insights into genomic diversity and architecture, thereby enabling comprehensive analyses of population and functional genomics, and are expected to revolutionize conservation genomics.

QH301 Biology580 Plants (Botany)Genetics -- ResearchEvolutionsbiologibiodiversity conservation; conservation genetics; ERGA; European Reference Genome AtlasConservation genetics; Biodiversity conservation; European Reference Genome Atlas; ERGAAnimal genome mappingudc:630*1GenomeGEERGA[SDV.BID.EVO]Life Sciences [q-bio]/Biodiversity/Populations and Evolution [q-bio.PE][SDE.BE.BIOD]Environmental Sciences/Biodiversity and Ecology/domain_sde.be.biodERGA ; Biodiversity [MeSH] ; Genomics [MeSH] ; Ecology Evolution Behavior and Systematics ; conservation genetics ; Genome [MeSH] ; biodiversity conservation ; European Reference Genome Atlas3rd-DASGenomicsBiodiversityreferenčni genomi[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM][SDE.BE.BEC]Environmental Sciences/Biodiversity and Ecology/domain_sde.be.becChemistry10121 Department of Systematic and Evolutionary BotanygenomikaGE Environmental Sciences:Informàtica::Aplicacions de la informàtica::Bioinformàtica [Àrees temàtiques de la UPC]biodiverzitetaSettore BIO/18 - GENETICAeducationQH426 GeneticsQH301European Reference Genome AtlasVDP::Matematikk og Naturvitenskap: 400::Basale biofag: 470[SDE.BE.EVO]Environmental Sciences/Biodiversity and Ecology/domain_sde.be.evoGeneticsconservation genetics ; biodiversity conservation ; European Reference Genome Atlas ; ERGAgenomi10211 Zurich-Basel Plant Science CenterGenomesGenetikBiologyQH426Ecology Evolution Behavior and SystematicsEvolutionary BiologyBiodiversity conservation; Conservation genetics; European Reference Genome AtlasAmbientaleEcologíaGenética1105 Ecology Evolution Behavior and Systematicsconservation geneticsWildlife conservation570 Life sciences; biologyHuman medicinebiodiversity conservationAnimal genetics[SDE.BE]Environmental Sciences/Biodiversity and EcologyGenètica
researchProduct

Adaptive reference-free compression of sequence quality scores

2014

Motivation: Rapid technological progress in DNA sequencing has stimulated interest in compressing the vast datasets that are now routinely produced. Relatively little attention has been paid to compressing the quality scores that are assigned to each sequence, even though these scores may be harder to compress than the sequences themselves. By aggregating a set of reads into a compressed index, we find that the majority of bases can be predicted from the sequence of bases that are adjacent to them and hence are likely to be less informative for variant calling or other applications. The quality scores for such bases are aggressively compressed, leaving a relatively small number at full reso…

Statistics and ProbabilityFOS: Computer and information sciencesComputer sciencemedia_common.quotation_subjectReference-freecomputer.software_genreBiochemistryDNA sequencingSet (abstract data type)Redundancy (information theory)BWTComputer Science - Data Structures and AlgorithmsCode (cryptography)AnimalsHumansQuality (business)Data Structures and Algorithms (cs.DS)Quantitative Biology - GenomicsCaenorhabditis elegansMolecular Biologymedia_commonGenomics (q-bio.GN)SequenceGenomeSettore INF/01 - Informaticareference-free compressionHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNAData CompressioncompressionComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsFOS: Biological sciencesData miningquality scoreMetagenomicscomputerBWT; compression; quality score; reference-free compressionAlgorithmsReference genome
researchProduct

SeqEditor: an application for primer design and sequence analysis with or without GTF/GFF files

2021

[Motivation]: Sequence analyses oriented to investigate specific features, patterns and functions of protein and DNA/RNA sequences usually require tools based on graphic interfaces whose main characteristic is their intuitiveness and interactivity with the user’s expertise, especially when curation or primer design tasks are required. However, interface-based tools usually pose certain computational limitations when managing large sequences or complex datasets, such as genome and transcriptome assemblies. Having these requirments in mind we have developed SeqEditor an interactive software tool for nucleotide and protein sequences’ analysis.

Statistics and ProbabilityInterface (Java)Sequence analysisComputer sciencePcr assayBiochemistryGenomeTranscriptome03 medical and health sciencesSequence Analysis ProteinMultiplex polymerase chain reactionHumansNucleotideAmino Acid SequenceMolecular Biology030304 developmental biologychemistry.chemical_classification0303 health sciencesGenomeInformation retrievalContig030302 biochemistry & molecular biologyChromosomeComputer Science ApplicationsComputational MathematicsComputingMethodologies_PATTERNRECOGNITIONComputational Theory and MathematicschemistryLine (text file)Primer (molecular biology)Sequence AnalysisSoftwareReference genome
researchProduct

A web application for the unspecific detection of differentially expressed DNA regions in strand-specific expression data

2015

Abstract Genomic technologies allow laboratories to produce large-scale data sets, either through the use of next-generation sequencing or microarray platforms. To explore these data sets and obtain maximum value from the data, researchers view their results alongside all the known features of a given reference genome. To study transcriptional changes that occur under a given condition, researchers search for regions of the genome that are differentially expressed between different experimental conditions. In order to identify these regions several algorithms have been developed over the years, along with some bioinformatic platforms that enable their use. However, currently available appli…

Statistics and ProbabilitySequence analysisADNGenomicsComputational biologyBiologycomputer.software_genreBiochemistryGenomeComputer GraphicsExpressió genèticaWeb applicationHumansMolecular BiologyGeneInternetMicroarray analysis techniquesbusiness.industryGenome HumanGene Expression ProfilingComputational BiologyHigh-Throughput Nucleotide SequencingDNAGenomicsSequence Analysis DNAComputer Science ApplicationsGene expression profilingComputational MathematicsGenòmicaComputingMethodologies_PATTERNRECOGNITIONComputational Theory and MathematicsData miningbusinesscomputerAlgorithmsGenèticaReference genome
researchProduct

One is not enough: On the effects of reference genome for the mapping and subsequent analyses of short-reads.

2020

Mapping of high-throughput sequencing (HTS) reads to a single arbitrary reference genome is a frequently used approach in microbial genomics. However, the choice of a reference may represent a source of errors that may affect subsequent analyses such as the detection of single nucleotide polymorphisms (SNPs) and phylogenetic inference. In this work, we evaluated the effect of reference choice on short-read sequence data from five clinically and epidemiologically relevant bacteria (Klebsiella pneumoniae, Legionella pneumophila, Neisseria gonorrhoeae, Pseudomonas aeruginosa and Serratia marcescens). Publicly available whole-genome assemblies encompassing the genomic diversity of these species…

Systematic errorSingle Nucleotide PolymorphismsPathology and Laboratory MedicineGenomeKlebsiella PneumoniaeDatabase and Informatics MethodsData sequencesKlebsiellaMedicine and Health SciencesBiology (General)CladePhylogenyData ManagementEcologyPhylogenetic treeBacterial GenomicsMicrobial GeneticsChromosome MappingHigh-Throughput Nucleotide SequencingPhylogenetic AnalysisGenomicsBacterial PathogensPhylogeneticsLegionella PneumophilaComputational Theory and MathematicsMedical MicrobiologyModeling and SimulationPathogensSequence AnalysisResearch ArticleComputer and Information SciencesBioinformaticsQH301-705.5LegionellaSequence alignmentSingle-nucleotide polymorphismGenomicsComputational biologyMicrobial GenomicsBiologyResearch and Analysis MethodsPolymorphism Single NucleotideMicrobiologyCellular and Molecular NeurosciencePhylogeneticsGeneticsSNPBacterial GeneticsEvolutionary SystematicsMolecular BiologyMicrobial PathogensEcology Evolution Behavior and SystematicsTaxonomyEvolutionary BiologyBacteriaOrganismsBiology and Life SciencesBacteriologySequence AlignmentGenome BacterialReference genomePLoS Computational Biology
researchProduct

Whole genome sequencing of the black grouse (Tetrao tetrix): reference guided assembly suggests faster-Z and MHC evolution

2014

Background The different regions of a genome do not evolve at the same rate. For example, comparative genomic studies have suggested that the sex chromosomes and the regions harbouring the immune defence genes in the Major Histocompatability Complex (MHC) may evolve faster than other genomic regions. The advent of the next generation sequencing technologies has made it possible to study which genomic regions are evolutionary liable to change and which are static, as well as enabling an increasing number of genome studies of non-model species. However, de novo sequencing of the whole genome of an organism remains non-trivial. In this study, we present the draft genome of the black grouse, wh…

Tetrao tetrixMaleGenome evolutionBiologyGenomePolymorphism Single NucleotideChromosomesBirdsEvolution MolecularMajor Histocompatibility ComplexGene densityGeneticsAnimalsGenetikGenome sizeRepetitive Sequences Nucleic AcidGeneticsComparative genomicsWhole genome sequencingteeriGenomeComputational BiologyHigh-Throughput Nucleotide SequencingMolecular Sequence AnnotationGenome projectGenomicsEvolutionary biologyReference genomeBiotechnologyResearch ArticleBMC Genomics
researchProduct