Search results for "genomic"

showing 10 items of 1737 documents

MetaCache: context-aware classification of metagenomic reads using minhashing.

2017

Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…

0301 basic medicineStatistics and ProbabilityComputer scienceSequence analysisContext (language use)BiochemistryGenome03 medical and health scienceschemistry.chemical_compound0302 clinical medicineRefSeqHumansMolecular BiologyInformation retrievalShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNAComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicschemistryMetagenomicsMetagenomics030217 neurology & neurosurgeryDNAAlgorithmsSoftwareReference genomeBioinformatics (Oxford, England)
researchProduct

Small RNA-seq analysis of circulating miRNAs to identify phenotypic variability in Friedreich's ataxia patients.

2018

AbstractFriedreich’s ataxia (FRDA; OMIM 229300), an autosomal recessive neurodegenerative mitochondrial disease, is the most prevalent hereditary ataxia. In addition, FRDA patients have shown additional non-neurological features such as scoliosis, diabetes, and cardiac complications. Hypertrophic cardiomyopathy, which is found in two thirds of patients at the time of diagnosis, is the primary cause of death in these patients. Here, we used small RNA-seq of microRNAs (miRNAs) purified from plasma samples of FRDA patients and controls. Furthermore, we present the rationale, experimental methodology, and analytical procedures for dataset analysis. This dataset will facilitate the identificatio…

0301 basic medicineStatistics and ProbabilityEpigenomicsSmall RNAData DescriptorAtaxiaMitochondrial diseaseLibrary and Information SciencesBioinformaticsEducation03 medical and health sciences0302 clinical medicinemicroRNAMedicineHumansCirculating MicroRNAPathologicalCause of deathbusiness.industrySequence Analysis RNAHypertrophic cardiomyopathyNeuromuscular diseasemedicine.diseasePhenotypeComputer Science Applications030104 developmental biologyFriedreich AtaxiaNext-generation sequencingmedicine.symptomStatistics Probability and Uncertaintybusiness030217 neurology & neurosurgeryInformation SystemsScientific data
researchProduct

A generalization of Kingman's model of selection and mutation and the Lenski experiment.

2017

Kingman’s model of selection and mutation studies the limit type value distribution in an asexual population of discrete generations and infinite size undergoing selection and mutation. This paper generalizes the model to analyze the long-term evolution of Escherichia. coli in Lenski experiment. Weak assumptions for fitness functions are proposed and the mutation mechanism is the same as in Kingman’s model. General macroscopic epistasis are designable through fitness functions. Convergence to the unique limit type distribution is obtained.

0301 basic medicineStatistics and ProbabilityGeneralizationPopulationBiology01 natural sciencesModels BiologicalGeneral Biochemistry Genetics and Molecular Biology010104 statistics & probability03 medical and health sciencesStatisticsEscherichia coliApplied mathematicsQuantitative Biology::Populations and EvolutionLimit (mathematics)0101 mathematicsSelection GeneticeducationSelection (genetic algorithm)education.field_of_studyFitness functionGeneral Immunology and MicrobiologyApplied MathematicsGeneral MedicineQuantitative Biology::GenomicsBiological Evolution030104 developmental biologyDistribution (mathematics)Modeling and SimulationMutation (genetic algorithm)MutationEpistasisGeneral Agricultural and Biological SciencesMathematical biosciences
researchProduct

panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.

2018

Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…

0301 basic medicineStatistics and ProbabilityLineage (genetic)Computer scienceAb initioComputational biologyBacterial genome size[INFO.INFO-SE]Computer Science [cs]/Software Engineering [cs.SE]BiochemistryGenome[INFO.INFO-IU]Computer Science [cs]/Ubiquitous Computing03 medical and health sciences[INFO.INFO-CR]Computer Science [cs]/Cryptography and Security [cs.CR][SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]Insertion sequenceMolecular BiologyGenomic organizationHigh-Throughput Nucleotide SequencingSequence Analysis DNA[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM][SDV.MP.BAC]Life Sciences [q-bio]/Microbiology and Parasitology/BacteriologyPipeline (software)[INFO.INFO-MO]Computer Science [cs]/Modeling and SimulationComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and Mathematics[INFO.INFO-MA]Computer Science [cs]/Multiagent Systems [cs.MA]DNA Transposable Elements[INFO.INFO-ET]Computer Science [cs]/Emerging Technologies [cs.ET][INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC]Genome BacterialSoftwareBioinformatics (Oxford, England)
researchProduct

Variance component analysis to assess protein quantification in biomarker discovery. Application to MALDI-TOF mass spectrometry.

2017

International audience; Controlling the technological variability on an analytical chain is critical for biomarker discovery. The sources of technological variability should be modeled, which calls for specific experimental design, signal processing, and statistical analysis. Furthermore, with unbalanced data, the various components of variability cannot be estimated with the sequential or adjusted sums of squares of usual software programs. We propose a novel approach to variance component analysis with application to the matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) technology and use this approach for protein quantification by a classical signal processing algori…

0301 basic medicineStatistics and ProbabilityMALDI-TOFexperimental designBiometryprotein quantificationQuantitative proteomicsVariance component analysis[ CHIM ] Chemical Sciences01 natural sciencesSignaltechnological variability010104 statistics & probability03 medical and health sciencesstatistical analysis[INFO.INFO-TS]Computer Science [cs]/Signal and Image Processing[CHIM.ANAL]Chemical Sciences/Analytical chemistryComponent (UML)[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]biomarker discoverysum of squares type0101 mathematicsBiomarker discoverysignal processingMathematicsSignal processingAnalysis of Variance[ PHYS ] Physics [physics]Noise (signal processing)ProteinsGeneral MedicineVariance (accounting)[SDV.BIBS]Life Sciences [q-bio]/Quantitative Methods [q-bio.QM]030104 developmental biologySpectrometry Mass Matrix-Assisted Laser Desorption-IonizationLinear Modelsvariance components[ CHIM.ANAL ] Chemical Sciences/Analytical chemistryStatistics Probability and UncertaintyBiological systemAlgorithmsBiomarkersBiometrical journal. Biometrische Zeitschrift
researchProduct

Reference genome assessment from a population scale perspective: an accurate profile of variability and noise.

2017

Abstract Motivation Current plant and animal genomic studies are often based on newly assembled genomes that have not been properly consolidated. In this scenario, misassembled regions can easily lead to false-positive findings. Despite quality control scores are included within genotyping protocols, they are usually employed to evaluate individual sample quality rather than reference sequence reliability. We propose a statistical model that combines quality control scores across samples in order to detect incongruent patterns at every genomic region. Our model is inherently robust since common artifact signals are expected to be shared between independent samples over misassembled regions …

0301 basic medicineStatistics and ProbabilityQuality ControlGenotypeComputer sciencemedia_common.quotation_subjectPopulationGenomicsBioinformaticscomputer.software_genreBiochemistryGenome03 medical and health sciencesGenetic variationAnimalsHumansQuality (business)AlleleeducationMolecular BiologyGenotypingReliability (statistics)media_commonProtocol (science)education.field_of_studyGenomeModels StatisticalGenetic VariationReproducibility of ResultsGenomicsGenome AnalysisOriginal PapersComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsData miningcomputerSoftwareReference genome
researchProduct

AFS: identification and quantification of species composition by metagenomic sequencing

2017

Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…

0301 basic medicineStatistics and ProbabilitySequence analysisLibrary preparationComputational biologyBiologyBioinformaticsBiochemistrylaw.invention03 medical and health sciences0404 agricultural biotechnologylawMolecular BiologyPolymerase chain reactionShotgun sequencingHigh-Throughput Nucleotide SequencingSequence Analysis DNA04 agricultural and veterinary sciencesAccession number (bioinformatics)040401 food scienceBiological materialsComputer Science ApplicationsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsMetagenomicsFood MicrobiologyIdentification (biology)MetagenomicsSoftwareBioinformatics
researchProduct

Evolutionary distances corrected for purifying selection and ancestral polymorphisms.

2019

Abstract Evolutionary distance formulas that take into account effects due to ancestral polymorphisms and purifying selection are obtained on the basis of the full solution of Jukes–Cantor and Kimura DNA substitution models. In the case of purifying selection two different methods are developed. It is shown that avoiding the dimensional reduction implicitly carried out in the conventional model solving is instrumental to incorporate the quoted effects into the formalism. The problem of estimating the numerical values of the model parameters, as well as those of the correction terms, is not addressed.

0301 basic medicineStatistics and ProbabilityTime FactorsADNModel parametersGeneral Biochemistry Genetics and Molecular Biology03 medical and health sciencesNegative selection0302 clinical medicineQuantitative Biology::Populations and EvolutionStatistical physicsSelection GeneticMolecular clockPhylogenyMathematicsPolymorphism GeneticGeneral Immunology and MicrobiologyApplied MathematicsGeneral MedicineModels biològicsQuantitative Biology::GenomicsBiological EvolutionFormalism (philosophy of mathematics)030104 developmental biologyDimensional reductionModeling and SimulationMutationGeneral Agricultural and Biological Sciences030217 neurology & neurosurgeryEvolució (Biologia)Journal of theoretical biology
researchProduct

The Neolithic Transition in the Baltic Was Not Driven by Admixture with Early European Farmers

2017

Summary The Neolithic transition was a dynamic time in European prehistory of cultural, social, and technological change. Although this period has been well explored in central Europe using ancient nuclear DNA [1, 2], its genetic impact on northern and eastern parts of this continent has not been as extensively studied. To broaden our understanding of the Neolithic transition across Europe, we analyzed eight ancient genomes: six samples (four to ∼1- to 4-fold coverage) from a 3,500 year temporal transect (∼8,300–4,800 calibrated years before present) through the Baltic region dating from the Mesolithic to the Late Neolithic and two samples spanning the Mesolithic-Neolithic boundary from the…

0301 basic medicineSteppeHuman MigrationPopulation geneticsBalticBiologyGeneral Biochemistry Genetics and Molecular BiologyWhite PeoplePrehistory03 medical and health sciences0302 clinical medicineCultural EvolutionReportgenomicsHumansDNA Ancientancient DNAMesolithicHistory Ancient2. Zero hungergeographygeography.geographical_feature_categoryFarmersAgricultural and Biological Sciences(all)Human migrationbusiness.industryGenome HumanBiochemistry Genetics and Molecular Biology(all)population geneticsAgricultureBefore PresentArchaeologyLatviaNeolithic transition030104 developmental biologyAncient DNAArchaeologyPeriod (geology)General Agricultural and Biological SciencesbusinessUkraine030217 neurology & neurosurgery
researchProduct

A Vastly Increased Chemical Variety of RNA Modifications Containing a Thioacetal Structure

2018

International audience; Recently discovered new chemical entities in RNA modifications have involved surprising functional groups that enlarge the chemical space of RNA. Using LC-MS, we found over 100 signals of RNA constituents that contained a ribose moiety in tRNAs from E. coli. Feeding experiments with variegated stable isotope labeled compounds identified 37 compounds that are new structures of RNA modifications. One structure was elucidated by deuterium exchange and high-resolution mass spectrometry. The structure of msms2 i6 A (2-methylthiomethylenethio-N6-isopentenyl-adenosine) was confirmed by methione-D3 feeding experiments and by synthesis of the nucleobase. The msms2 i6 A contai…

0301 basic medicineStereochemistryThioacetal010402 general chemistry01 natural sciencesCatalysisNucleobaseisotope labelling03 medical and health scienceschemistry.chemical_compoundAcetalsRNA modificationsTandem Mass Spectrometry[SDV.BBM.GTP]Life Sciences [q-bio]/Biochemistry Molecular Biology/Genomics [q-bio.GN]RiboseEscherichia coliMoietySulfhydryl Compoundschemistry.chemical_classificationChemistrythioacetalsRNA[SDV.BBM.BM]Life Sciences [q-bio]/Biochemistry Molecular Biology/Molecular biologyGeneral Chemistryradical-SAM enzymesChemical space0104 chemical sciencesLC-MSRNA Bacterial030104 developmental biologyEnzymeNucleic Acid ConformationHydrogen–deuterium exchangeChromatography Liquid
researchProduct