Search results for "sequence"

showing 10 items of 4987 documents

dAPE: a web server to detect homorepeats and follow their evolution.

2016

Abstract Summary Homorepeats are low complexity regions consisting of repetitions of a single amino acid residue. There is no current consensus on the minimum number of residues needed to define a functional homorepeat, nor even if mismatches are allowed. Here we present dAPE, a web server that helps following the evolution of homorepeats based on orthology information, using a sensitive but tunable cutoff to help in the identification of emerging homorepeats. Availability and Implementation dAPE can be accessed from http://cbdm-01.zdv.uni-mainz.de/∼munoz/polyx. Supplementary information Supplementary data are available at Bioinformatics online.

0301 basic medicineStatistics and ProbabilityRepetitive Sequences Amino AcidWeb serverInternetComputer sciencecomputer.software_genreBiochemistryApplications NotesComputer Science ApplicationsWorld Wide WebEvolution Molecular03 medical and health sciencesComputational Mathematics030104 developmental biologyComputational Theory and MathematicsAnimalsHumansData miningMolecular BiologycomputerSequence AlignmentSequence AnalysisSoftwareBioinformatics (Oxford, England)

researchProduct

AFS: identification and quantification of species composition by metagenomic sequencing

2017

Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…

0301 basic medicineStatistics and ProbabilitySequence analysisLibrary preparationComputational biologyBiologyBioinformaticsBiochemistrylaw.invention03 medical and health sciences0404 agricultural biotechnologylawMolecular BiologyPolymerase chain reactionShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNA04 agricultural and veterinary sciencesAccession number (bioinformatics)040401 food scienceBiological materialsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsMetagenomicsFood MicrobiologyIdentification (biology)MetagenomicsSoftwareBioinformatics

researchProduct

MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems

2016

This is a pre-copyedited, author-produced version of an article accepted for publication in Bioinformatics following peer review. The version of recordJorge González-Domínguez, Yongchao Liu, Juan Touriño, Bertil Schmidt; MSAProbs-MPI: parallel multiple sequence aligner for distributed-memory systems, Bioinformatics, Volume 32, Issue 24, 15 December 2016, Pages 3826–3828, https://doi.org/10.1093/bioinformatics/btw558is available online at: https://doi.org/10.1093/bioinformatics/btw558 [Abstracts] MSAProbs is a state-of-the-art protein multiple sequence alignment tool based on hidden Markov models. It can achieve high alignment accuracy at the expense of relatively long runtimes for large-sca…

0301 basic medicineStatistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject02 engineering and technologyParallel computingcomputer.software_genreBiochemistryExecution time03 medical and health sciences0202 electrical engineering electronic engineering information engineeringCluster (physics)Point (geometry)Amino Acid SequenceMolecular Biologymedia_commonSequenceMultiple sequence alignmentProtein multiple sequenceComputational BiologyProteinsMarkov ChainsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsDistributed memory systemsMSAProbs020201 artificial intelligence & image processingMPIData miningSequence AlignmentcomputerAlgorithmsSoftware

researchProduct

In vitro versus in vivo compositional landscapes of histone sequence preferences in eucaryotic genomes

2018

Abstract Motivation Although the nucleosome occupancy along a genome can be in part predicted by in vitro experiments, it has been recently observed that the chromatin organization presents important differences in vitro with respect to in vivo. Such differences mainly regard the hierarchical and regular structures of the nucleosome fiber, whose existence has long been assumed, and in part also observed in vitro, but that does not apparently occur in vivo. It is also well known that the DNA sequence has a role in determining the nucleosome occupancy. Therefore, an important issue is to understand if, and to what extent, the structural differences in the chromatin organization between in vit…

0301 basic medicineStatistics and Probabilityved/biology.organism_classification_rank.speciesComputational biologySaccharomyces cerevisiaeGenomeBiochemistryDNA sequencingHistones03 medical and health sciences0302 clinical medicineIn vivoComputational Theory and MathematicNucleosomeAnimalsModel organismCaenorhabditis elegansMolecular BiologySequence (medicine)GenomebiologySettore INF/01 - Informaticaved/biologyComputer Science ApplicationChromatinComputer Science ApplicationsChromatinNucleosomesComputational Mathematics030104 developmental biologyHistoneEukaryotic CellsComputational Theory and Mathematicsbiology.proteinComputer Vision and Pattern RecognitionSequence Analysis030217 neurology & neurosurgery

researchProduct

A Clonal Lineage of Fusarium oxysporum Circulates in the Tap Water of Different French Hospitals.

2016

ABSTRACT Fusarium oxysporum is typically a soilborne fungus but can also be found in aquatic environments. In hospitals, water distribution systems may be reservoirs for the fungi responsible for nosocomial infections. F. oxysporum was previously detected in the water distribution systems of five French hospitals. Sixty-eight isolates from water representative of all hospital units that were previously sampled and characterized by translation elongation factor 1α sequence typing were subjected to microsatellite analysis and full-length ribosomal intergenic spacer (IGS) sequence typing. All but three isolates shared common microsatellite loci and a common two-locus sequence type (ST). This S…

0301 basic medicineSystemVeterinary medicineLineage (genetic)Sequence analysis030106 microbiologyBiologyInfectionsApplied Microbiology and BiotechnologyMicrobiology03 medical and health sciencesIntergenic regionOriginPeptide Elongation Factor 1FusariumPhylogeneticsFusarium oxysporum[SDV.IDA]Life Sciences [q-bio]/Food engineeringHumansTypingDrinking-waterDNA FungalPhylogenyVegetative compatibility groupsDiversityEcologyPublic and Environmental Health MicrobiologyDrinking Water[ SDV.IDA ] Life Sciences [q-bio]/Food engineeringFungiAustraliafood and beveragesSequence Analysis DNARibosomal RNAbiology.organism_classificationHospitals030104 developmental biologyFusariosisMicrosatelliteDNA IntergenicFranceFood ScienceBiotechnologyMicrosatellite RepeatsApplied and environmental microbiology

researchProduct

Inhabiting plant roots, nematodes, and truffles—polyphilus, a new helotialean genus with two globally distributed species

2018

Fungal root endophytes, including the common group of dark septate endophytes (DSEs), represent different taxonomic groups and potentially diverse life strategies. In this study, we investigated two unidentified helotialean lineages found previously in a study of DSE fungi of semiarid grasslands, from several other sites, and collected recently from a pezizalean truffle ascoma and eggs of the cereal cyst nematode Heterodera filipjevi. The taxonomic positions and phylogenetic relationships of 21 isolates with different hosts and geographic origins were studied in detail. Four loci, namely, nuc rDNA ITS1-5.8S-ITS2 (internal transcribed spacer [ITS]), partial 28S nuc rDNA (28S), partial 18S nu…

0301 basic medicineSystematicZygotePhysiologyLeotiomycetesHyaloscyphaceaeDNA RibosomalPlant Roots03 medical and health sciencesAscomycotaPhylogeneticsDNA Ribosomal SpacerRNA Ribosomal 28SBotanyRNA Ribosomal 18SGeneticsAnimalsCluster AnalysisTylenchoideaInternal transcribed spacerDNA FungalMolecular BiologyRibosomal DNAPhylogenyEcology Evolution Behavior and SystematicsComputingMilieux_MISCELLANEOUSTaxonomy[SDV.EE]Life Sciences [q-bio]/Ecology environmentHeterodera filipjeviCereal cyst nematodebiologyPhylogenetic tree3 new taxaSequence Analysis DNACell BiologyGeneral Medicine15. Life on land030108 mycology & parasitologybiology.organism_classificationEndophyteRNA Ribosomal 5.8S030104 developmental biologyHelotialesRNA Polymerase IIHyaloscyphaceaeMycologia

researchProduct

Evolutionary History of the Nesophontidae, the Last Unplaced Recent Mammal Family

2016

The mammalian evolutionary tree has lost several major clades through recent human-caused extinctions. This process of historical biodiversity loss has particularly affected tropical island regions such as the Caribbean, an area of great evolutionary diversification but poor molecular preservation. The most enigmatic of the recently extinct endemic Caribbean mammals are the Nesophontidae, a family of morphologically plesiomorphic lipotyphlan insectivores with no consensus on their evolutionary affinities, and which constitute the only major recent mammal clade to lack any molecular information on their phylogenetic placement. Here, we use a palaeogenomic approach to place Nesophontidae with…

0301 basic medicineSystematicsWest IndiesLineage (evolution)ZoologyBiologyNesophontesDNA Mitochondrial03 medical and health sciencesPhylogeneticsGeneticsAnimalsDNA AncientCladeMolecular BiologyPhylogenyEcology Evolution Behavior and SystematicsPhylogenetic treeEulipotyphlaBiodiversitySequence Analysis DNAbiology.organism_classificationBiological Evolution030104 developmental biologyAncient DNAGenome MitochondrialMammalMolecular Biology and Evolution

researchProduct

An effective extension of the applicability of alignment-free biological sequence comparison algorithms with Hadoop

2016

Alignment-free methods are one of the mainstays of biological sequence comparison, i.e., the assessment of how similar two biological sequences are to each other, a fundamental and routine task in computational biology and bioinformatics. They have gained popularity since, even on standard desktop machines, they are faster than methods based on alignments. However, with the advent of Next-Generation Sequencing Technologies, datasets whose size, i.e., number of sequences and their total length, is a challenge to the execution of alignment-free methods on those standard machines are quite common. Here, we propose the first paradigm for the computation of k-mer-based alignment-free methods for…

0301 basic medicineTheoretical computer science030102 biochemistry & molecular biologySettore INF/01 - InformaticaComputer scienceComputationExtension (predicate logic)Information SystemHash tableDistributed computingTask (project management)Theoretical Computer Science03 medical and health sciences030104 developmental biologyAlignment-free sequence comparison and analysisHadoopHardware and Architecturealignment-free sequence comparison and analysis; distributed computing; Hadoop; MapReduce; software; theoretical computer science; information systems; hardware and architectureSequence comparisonMapReduceAlignment-free sequence comparison and analysiAlignment-free sequence comparison and analysis; Distributed computing; Hadoop; MapReduce; Theoretical Computer Science; Software; Information Systems; Hardware and ArchitectureSoftwareInformation Systems

researchProduct

Parallel and Space-Efficient Construction of Burrows-Wheeler Transform and Suffix Array for Big Genome Data

2016

Next-generation sequencing technologies have led to the sequencing of more and more genomes, propelling related research into the era of big data. In this paper, we present ParaBWT, a parallelized Burrows-Wheeler transform (BWT) and suffix array construction algorithm for big genome data. In ParaBWT, we have investigated a progressive construction approach to constructing the BWT of single genome sequences in linear space complexity, but with a small constant factor. This approach has been further parallelized using multi-threading based on a master-slave coprocessing model. After gaining the BWT, the suffix array is constructed in a memory-efficient manner. The performance of ParaBWT has b…

0301 basic medicineTheoretical computer scienceBurrows–Wheeler transformComputer scienceGenomicsData_CODINGANDINFORMATIONTHEORYParallel computingGenomelaw.invention03 medical and health scienceslawGeneticsHumansEnsemblMulti-core processorApplied MathematicsLinear spaceSuffix arrayChromosome MappingHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNA030104 developmental biologyAlgorithmsBiotechnologyReference genomeIEEE/ACM Transactions on Computational Biology and Bioinformatics

researchProduct

A detailed experimental study of a DNA computer with two endonucleases

2017

Abstract Great advances in biotechnology have allowed the construction of a computer from DNA. One of the proposed solutions is a biomolecular finite automaton, a simple two-state DNA computer without memory, which was presented by Ehud Shapiro’s group at the Weizmann Institute of Science. The main problem with this computer, in which biomolecules carry out logical operations, is its complexity – increasing the number of states of biomolecular automata. In this study, we constructed (in laboratory conditions) a six-state DNA computer that uses two endonucleases (e.g. AcuI and BbvI) and a ligase. We have presented a detailed experimental verification of its feasibility. We described the effe…

0301 basic medicineTheoretical computer scienceDNA LigasesComputer scienceCarry (arithmetic)Oligonucleotides0102 computer and information sciencesBioinformatics01 natural sciencesGeneral Biochemistry Genetics and Molecular Biologylaw.inventionAutomationComputers Molecular03 medical and health sciencesDNA computinglawA-DNADeoxyribonucleases Type II Site-Specificchemistry.chemical_classificationDNA ligaseFinite-state machineBase Sequencebiomolecular computers; DNA computing; finite automataProcess (computing)DNAModels TheoreticalEndonucleasesAutomaton030104 developmental biologychemistry010201 computation theory & mathematicsWord (computer architecture)Zeitschrift für Naturforschung C

researchProduct