Search results for "Sequence analysis"

showing 10 items of 1349 documents

Identification and visualization of differential isoform expression in RNA-seq time series

2018

Abstract Motivation As sequencing technologies improve their capacity to detect distinct transcripts of the same gene and to address complex experimental designs such as longitudinal studies, there is a need to develop statistical methods for the analysis of isoform expression changes in time series data. Results Iso-maSigPro is a new functionality of the R package maSigPro for transcriptomics time series data analysis. Iso-maSigPro identifies genes with a differential isoform usage across time. The package also includes new clustering and visualization functions that allow grouping of genes with similar expression patterns at the isoform level, as well as those genes with a shift in major …

0301 basic medicineStatistics and ProbabilityGene isoformIdentificationComputer scienceSequence analysisGene ExpressionRNA-SeqComputational biologyBiochemistryBioconductorTranscriptomeMice03 medical and health sciences0302 clinical medicineEstadística e Investigación OperativaRNA IsoformsAnimalsMolecular BiologyGeneVisualizationRegulation of gene expressionB-LymphocytesSequence Analysis RNAGene Expression ProfilingCell DifferentiationApplications NotesComputer Science ApplicationsVisualizationComputational Mathematics030104 developmental biologyGene Expression RegulationComputational Theory and MathematicsRNA-seq time seriesSoftware030217 neurology & neurosurgeryIsoform expression
researchProduct

panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.

2018

Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…

0301 basic medicineStatistics and ProbabilityLineage (genetic)Computer scienceAb initioComputational biologyBacterial genome size[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]BiochemistryGenome[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR][SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Insertion sequenceMolecular BiologyGenomic organizationHigh-Throughput Nucleotide SequencingSequence Analysis DNA[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM][SDV.MP.BAC]Life Sciences [q-bio]/Microbiology and Parasitology/BacteriologyPipeline (software)[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and Mathematics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]DNA Transposable Elements[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Genome BacterialSoftwareBioinformatics (Oxford, England)
researchProduct

dAPE: a web server to detect homorepeats and follow their evolution.

2016

Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.

0301 basic medicineStatistics and ProbabilityRepetitive Sequences Amino AcidWeb serverInternetComputer sciencecomputer.software_genreBiochemistryApplications NotesComputer Science ApplicationsWorld Wide WebEvolution Molecular03 medical and health sciencesComputational Mathematics030104 developmental biologyComputational Theory and MathematicsAnimalsHumansData miningMolecular BiologycomputerSequence AlignmentSequence AnalysisSoftwareBioinformatics (Oxford, England)
researchProduct

AFS: identification and quantification of species composition by metagenomic sequencing

2017

Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…

0301 basic medicineStatistics and ProbabilitySequence analysisLibrary preparationComputational biologyBiologyBioinformaticsBiochemistrylaw.invention03 medical and health sciences0404 agricultural biotechnologylawMolecular BiologyPolymerase chain reactionShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNA04 agricultural and veterinary sciencesAccession number (bioinformatics)040401 food scienceBiological materialsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsMetagenomicsFood MicrobiologyIdentification (biology)MetagenomicsSoftwareBioinformatics
researchProduct

In vitro versus in vivo compositional landscapes of histone sequence preferences in eucaryotic genomes

2018

Abstract Motivation Although the nucleosome occupancy along a genome can be in part predicted by in vitro experiments, it has been recently observed that the chromatin organization presents important differences in vitro with respect to in vivo. Such differences mainly regard the hierarchical and regular structures of the nucleosome fiber, whose existence has long been assumed, and in part also observed in vitro, but that does not apparently occur in vivo. It is also well known that the DNA sequence has a role in determining the nucleosome occupancy. Therefore, an important issue is to understand if, and to what extent, the structural differences in the chromatin organization between in vit…

0301 basic medicineStatistics and Probabilityved/biology.organism_classification_rank.speciesComputational biologySaccharomyces cerevisiaeGenomeBiochemistryDNA sequencingHistones03 medical and health sciences0302 clinical medicineIn vivoComputational Theory and MathematicNucleosomeAnimalsModel organismCaenorhabditis elegansMolecular BiologySequence (medicine)GenomebiologySettore INF/01 - Informaticaved/biologyComputer Science ApplicationChromatinComputer Science ApplicationsChromatinNucleosomesComputational Mathematics030104 developmental biologyHistoneEukaryotic CellsComputational Theory and Mathematicsbiology.proteinComputer Vision and Pattern RecognitionSequence Analysis030217 neurology & neurosurgery
researchProduct

A Clonal Lineage of Fusarium oxysporum Circulates in the Tap Water of Different French Hospitals.

2016

ABSTRACT Fusarium oxysporum is typically a soilborne fungus but can also be found in aquatic environments. In hospitals, water distribution systems may be reservoirs for the fungi responsible for nosocomial infections. F. oxysporum was previously detected in the water distribution systems of five French hospitals. Sixty-eight isolates from water representative of all hospital units that were previously sampled and characterized by translation elongation factor 1α sequence typing were subjected to microsatellite analysis and full-length ribosomal intergenic spacer (IGS) sequence typing. All but three isolates shared common microsatellite loci and a common two-locus sequence type (ST). This S…

0301 basic medicineSystemVeterinary medicineLineage (genetic)Sequence analysis030106 microbiologyBiologyInfectionsApplied Microbiology and BiotechnologyMicrobiology03 medical and health sciencesIntergenic regionOriginPeptide Elongation Factor 1FusariumPhylogeneticsFusarium oxysporum[SDV.IDA]Life Sciences [q-bio]/Food engineeringHumansTypingDrinking-waterDNA FungalPhylogenyVegetative compatibility groupsDiversityEcologyPublic and Environmental Health MicrobiologyDrinking Water[ SDV.IDA ] Life Sciences [q-bio]/Food engineeringFungiAustraliafood and beveragesSequence Analysis DNARibosomal RNAbiology.organism_classificationHospitals030104 developmental biologyFusariosisMicrosatelliteDNA IntergenicFranceFood ScienceBiotechnologyMicrosatellite RepeatsApplied and environmental microbiology
researchProduct

Inhabiting plant roots, nematodes, and truffles—polyphilus, a new helotialean genus with two globally distributed species

2018

Fungal root endophytes, including the common group of dark septate endophytes (DSEs), represent different taxonomic groups and potentially diverse life strategies. In this study, we investigated two unidentified helotialean lineages found previously in a study of DSE fungi of semiarid grasslands, from several other sites, and collected recently from a pezizalean truffle ascoma and eggs of the cereal cyst nematode Heterodera filipjevi. The taxonomic positions and phylogenetic relationships of 21 isolates with different hosts and geographic origins were studied in detail. Four loci, namely, nuc rDNA ITS1-5.8S-ITS2 (internal transcribed spacer [ITS]), partial 28S nuc rDNA (28S), partial 18S nu…

0301 basic medicineSystematicZygotePhysiologyLeotiomycetesHyaloscyphaceaeDNA RibosomalPlant Roots03 medical and health sciencesAscomycotaPhylogeneticsDNA Ribosomal SpacerRNA Ribosomal 28SBotanyRNA Ribosomal 18SGeneticsAnimalsCluster AnalysisTylenchoideaInternal transcribed spacerDNA FungalMolecular BiologyRibosomal DNAPhylogenyEcology Evolution Behavior and SystematicsComputingMilieux_MISCELLANEOUSTaxonomy[SDV.EE]Life Sciences [q-bio]/Ecology environmentHeterodera filipjeviCereal cyst nematodebiologyPhylogenetic tree3 new taxaSequence Analysis DNACell BiologyGeneral Medicine15. Life on land030108 mycology & parasitologybiology.organism_classificationEndophyteRNA Ribosomal 5.8S030104 developmental biologyHelotialesRNA Polymerase IIHyaloscyphaceaeMycologia
researchProduct

Evolutionary History of the Nesophontidae, the Last Unplaced Recent Mammal Family

2016

The mammalian evolutionary tree has lost several major clades through recent human-caused extinctions. This process of historical biodiversity loss has particularly affected tropical island regions such as the Caribbean, an area of great evolutionary diversification but poor molecular preservation. The most enigmatic of the recently extinct endemic Caribbean mammals are the Nesophontidae, a family of morphologically plesiomorphic lipotyphlan insectivores with no consensus on their evolutionary affinities, and which constitute the only major recent mammal clade to lack any molecular information on their phylogenetic placement. Here, we use a palaeogenomic approach to place Nesophontidae with…

0301 basic medicineSystematicsWest IndiesLineage (evolution)ZoologyBiologyNesophontesDNA Mitochondrial03 medical and health sciencesPhylogeneticsGeneticsAnimalsDNA AncientCladeMolecular BiologyPhylogenyEcology Evolution Behavior and SystematicsPhylogenetic treeEulipotyphlaBiodiversitySequence Analysis DNAbiology.organism_classificationBiological Evolution030104 developmental biologyAncient DNAGenome MitochondrialMammalMolecular Biology and Evolution
researchProduct

Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data

2016

Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…

0301 basic medicineTheoretical computer scienceBurrows–Wheeler transformComputer scienceGenomicsData_CODINGANDINFORMATIONTHEORYParallel computingGenomelaw.invention03 medical and health scienceslawGeneticsHumansEnsemblMulti-core processorApplied MathematicsLinear spaceSuffix arrayChromosome MappingHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNA030104 developmental biologyAlgorithmsBiotechnologyReference genomeIEEE/ACM Transactions on Computational Biology and Bioinformatics
researchProduct

Accelerating metagenomic read classification on CUDA-enabled GPUs.

2016

Metagenomic sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification; i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes software tools for fast and accurate metagenomic read classification are urgently needed. We present cuCLARK, a read-level classifier for CUDA-enabled GPUs, based on the fast and accurate classification of metagenomic sequences using reduced k-mers (…

0301 basic medicineTheoretical computer scienceWorkstationGPUsComputer scienceContext (language use)CUDAParallel computingBiochemistryGenomelaw.invention03 medical and health sciencesCUDAUser-Computer Interface0302 clinical medicineStructural BiologylawTaxonomic assignmentHumansMicrobiomeMolecular BiologyInternetXeonApplied MathematicsHigh-Throughput Nucleotide SequencingSequence Analysis DNAExact k-mer matchingComputer Science Applications030104 developmental biologyTitan (supercomputer)Metagenomics030220 oncology & carcinogenesisMetagenomicsDNA microarraySoftwareBMC bioinformatics
researchProduct