Search results for "genomic"
showing 10 items of 1737 documents
MetaCache: context-aware classification of metagenomic reads using minhashing.
2017
Abstract Motivation Metagenomic shotgun sequencing studies are becoming increasingly popular with prominent examples including the sequencing of human microbiomes and diverse environments. A fundamental computational problem in this context is read classification, i.e. the assignment of each read to a taxonomic label. Due to the large number of reads produced by modern high-throughput sequencing technologies and the rapidly increasing number of available reference genomes corresponding software tools suffer from either long runtimes, large memory requirements or low accuracy. Results We introduce MetaCache—a novel software for read classification using the big data technique minhashing. Our…
Small RNA-seq analysis of circulating miRNAs to identify phenotypic variability in Friedreich's ataxia patients.
2018
AbstractFriedreich’s ataxia (FRDA; OMIM 229300), an autosomal recessive neurodegenerative mitochondrial disease, is the most prevalent hereditary ataxia. In addition, FRDA patients have shown additional non-neurological features such as scoliosis, diabetes, and cardiac complications. Hypertrophic cardiomyopathy, which is found in two thirds of patients at the time of diagnosis, is the primary cause of death in these patients. Here, we used small RNA-seq of microRNAs (miRNAs) purified from plasma samples of FRDA patients and controls. Furthermore, we present the rationale, experimental methodology, and analytical procedures for dataset analysis. This dataset will facilitate the identificatio…
A generalization of Kingman's model of selection and mutation and the Lenski experiment.
2017
Kingman’s model of selection and mutation studies the limit type value distribution in an asexual population of discrete generations and infinite size undergoing selection and mutation. This paper generalizes the model to analyze the long-term evolution of Escherichia. coli in Lenski experiment. Weak assumptions for fitness functions are proposed and the mutation mechanism is the same as in Kingman’s model. General macroscopic epistasis are designable through fitness functions. Convergence to the unique limit type distribution is obtained.
panISa: ab initio detection of insertion sequences in bacterial genomes from short read sequence data.
2018
Abstract Motivation The advent of next-generation sequencing has boosted the analysis of bacterial genome evolution. Insertion sequence (IS) elements play a key role in prokaryotic genome organization and evolution, but their repetitions in genomes complicate their detection from short-read data. Results PanISa is a software pipeline that identifies IS insertions ab initio in bacterial genomes from short-read data. It is a highly sensitive and precise tool based on the detection of read-mapping patterns at the insertion site. PanISa performs better than existing IS detection systems as it is based on a database-free approach. We applied it to a high-risk clone lineage of the pathogenic spec…
Variance component analysis to assess protein quantification in biomarker discovery. Application to MALDI-TOF mass spectrometry.
2017
International audience; Controlling the technological variability on an analytical chain is critical for biomarker discovery. The sources of technological variability should be modeled, which calls for specific experimental design, signal processing, and statistical analysis. Furthermore, with unbalanced data, the various components of variability cannot be estimated with the sequential or adjusted sums of squares of usual software programs. We propose a novel approach to variance component analysis with application to the matrix-assisted laser desorption/ionization time-of-flight (MALDI-TOF) technology and use this approach for protein quantification by a classical signal processing algori…
Reference genome assessment from a population scale perspective: an accurate profile of variability and noise.
2017
Abstract Motivation Current plant and animal genomic studies are often based on newly assembled genomes that have not been properly consolidated. In this scenario, misassembled regions can easily lead to false-positive findings. Despite quality control scores are included within genotyping protocols, they are usually employed to evaluate individual sample quality rather than reference sequence reliability. We propose a statistical model that combines quality control scores across samples in order to detect incongruent patterns at every genomic region. Our model is inherently robust since common artifact signals are expected to be shared between independent samples over misassembled regions …
AFS: identification and quantification of species composition by metagenomic sequencing
2017
Abstract Summary DNA-based methods to detect and quantify taxon composition in biological materials are often based on species-specific polymerase chain reaction, limited to detecting species targeted by the assay. Next-generation sequencing overcomes this drawback by untargeted shotgun sequencing of whole metagenomes at affordable cost. Here we present AFS, a software pipeline for quantification of species composition in food. AFS uses metagenomic shotgun sequencing and sequence read counting to infer species proportions. Using Illumina data from a reference sausage comprising four species, we reveal that AFS is independent of the sequencing assay and library preparation protocol. Cost-sav…
Evolutionary distances corrected for purifying selection and ancestral polymorphisms.
2019
Abstract Evolutionary distance formulas that take into account effects due to ancestral polymorphisms and purifying selection are obtained on the basis of the full solution of Jukes–Cantor and Kimura DNA substitution models. In the case of purifying selection two different methods are developed. It is shown that avoiding the dimensional reduction implicitly carried out in the conventional model solving is instrumental to incorporate the quoted effects into the formalism. The problem of estimating the numerical values of the model parameters, as well as those of the correction terms, is not addressed.
The Neolithic Transition in the Baltic Was Not Driven by Admixture with Early European Farmers
2017
Summary The Neolithic transition was a dynamic time in European prehistory of cultural, social, and technological change. Although this period has been well explored in central Europe using ancient nuclear DNA [1, 2], its genetic impact on northern and eastern parts of this continent has not been as extensively studied. To broaden our understanding of the Neolithic transition across Europe, we analyzed eight ancient genomes: six samples (four to ∼1- to 4-fold coverage) from a 3,500 year temporal transect (∼8,300–4,800 calibrated years before present) through the Baltic region dating from the Mesolithic to the Late Neolithic and two samples spanning the Mesolithic-Neolithic boundary from the…
A Vastly Increased Chemical Variety of RNA Modifications Containing a Thioacetal Structure
2018
International audience; Recently discovered new chemical entities in RNA modifications have involved surprising functional groups that enlarge the chemical space of RNA. Using LC-MS, we found over 100 signals of RNA constituents that contained a ribose moiety in tRNAs from E. coli. Feeding experiments with variegated stable isotope labeled compounds identified 37 compounds that are new structures of RNA modifications. One structure was elucidated by deuterium exchange and high-resolution mass spectrometry. The structure of msms2 i6 A (2-methylthiomethylenethio-N6-isopentenyl-adenosine) was confirmed by methione-D3 feeding experiments and by synthesis of the nucleobase. The msms2 i6 A contai…