Search results for "algorithm."
showing 10 items of 4617 documents
The genomic footprint of climate adaptation inChironomus riparius
2017
The gradual heterogeneity of climatic factors produces continuously varying selection pressures across geographic distances that leave signatures of clinal variation in the genome. Separating signatures of clinal adaptation from signatures of other evolutionary forces, such as demographic processes, genetic drift, and adaptation to specific non-clinal conditions of the immediate local environment is a major challenge. Here, we examine climate adaptation in five natural populations of the non-biting midge Chironomus riparius sampled along a climatic gradient across Europe. Our study integrates experimental data, individual genome resequencing, Pool-Seq data, and population genetic modelling.…
Genome-Wide Analysis Reveals Selection Signatures Involved in Meat Traits and Local Adaptation in Semi-Feral Maremmana Cattle
2021
The Maremmana cattle is an ancient Podolian-derived Italian breed raised in semi-wild conditions with distinctive morphological and adaptive traits. The aim of this study was to detect potential selection signatures in Maremmana using medium-density single nucleotide polymorphism array. Putative selection signatures were investigated combining three statistical approaches designed to quantify the excess of haplotype homozygosity either within (integrated haplotype score, iHS) or among pairs of populations (Rsb and XP-EHH), and contrasting the Maremmana with a single reference population composed of a pool of seven Podolian-derived Italian breeds. Overall, the three haplotype-based analyses …
Novel and known signals of selection for fat deposition in domestic sheep breeds from Africa and Eurasia
2018
International audience; Genomic regions subjected to selection frequently show signatures such as within-population reduced nucleotide diversity and outlier values of differentiation among differentially selected populations. In this study, we analyzed 50K SNP genotype data of 373 animals belonging to 23 sheep breeds of different geographic origins using the Rsb (extended haplotype homozygosity) and FST statistical approaches, to identify loci associated with the fat-tail phenotype. We also checked if these putative selection signatures overlapped with regions of high-homozygosity (ROH). The analyses identified novel signals and confirmed the presence of selection signature in genomic regio…
Quantum clustering in non-spherical data distributions: Finding a suitable number of clusters
2017
Quantum Clustering (QC) provides an alternative approach to clustering algorithms, several of which are based on geometric relationships between data points. Instead, QC makes use of quantum mechanics concepts to find structures (clusters) in data sets by finding the minima of a quantum potential. The starting point of QC is a Parzen estimator with a fixed length scale, which significantly affects the final cluster allocation. This dependence on an adjustable parameter is common to other methods. We propose a framework to find suitable values of the length parameter σ by optimising twin measures of cluster separation and consistency for a given cluster number. This is an extension of the Se…
Efficient Algorithms for Sequence Analysis with Entropic Profiles
2017
Entropy, being closely related to repetitiveness and compressibility, is a widely used information-related measure to assess the degree of predictability of a sequence. Entropic profiles are based on information theory principles, and can be used to study the under-/over-representation of subwords, by also providing information about the scale of conserved DNA regions. Here, we focus on the algorithmic aspects related to entropic profiles. In particular, we propose linear time algorithms for their computation that rely on suffix-based data structures, more specifically on the truncated suffix tree (TST) and on the enhanced suffix array (ESA). We performed an extensive experimental campaign …
Influence of pathway topology and functional class on the molecular evolution of human metabolic genes
2018
Metabolic networks comprise thousands of enzymatic reactions functioning in a controlled manner and have been shaped by natural selection. Thanks to the genome data, the footprints of adaptive (positive) selection are detectable, and the strength of purifying selection can be measured. This has made possible to know where, in the metabolic network, adaptive selection has acted and where purifying selection is more or less strong and efficient. We have carried out a comprehensive molecular evolutionary study of all the genes involved in the human metabolism. We investigated the type and strength of the selective pressures that acted on the enzyme-coding genes belonging to metabolic pathways …
Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing
2018
AbstractAptamers have in recent years emerged as a viable alternative to antibodies. High-throughput sequencing (HTS) has revolutionized aptamer research by increasing the number of reads from a few (using Sanger sequencing) to millions (using an HTS approach). Despite the availability and advantages of HTS compared to Sanger sequencing, there are only 50 aptamer HTS sequencing samples available on public databases. HTS data in aptamer research are primarily used to compare sequence enrichment between subsequent selection cycles. This approach does not take full advantage of HTS because the enrichment of sequences during selection can be due to inefficient negative selection when using live…
Next-generation sequencing: big data meets high performance computing
2017
The progress of next-generation sequencing has a major impact on medical and genomic research. This high-throughput technology can now produce billions of short DNA or RNA fragments in excess of a few terabytes of data in a single run. This leads to massive datasets used by a wide range of applications including personalized cancer treatment and precision medicine. In addition to the hugely increased throughput, the cost of using high-throughput technologies has been dramatically decreasing. A low sequencing cost of around US$1000 per genome has now rendered large population-scale projects feasible. However, to make effective use of the produced data, the design of big data algorithms and t…
Quantitatively characterizing drug-induced arrhythmic contractile motions of human stem cell-derived cardiomyocytes.
2018
Quantification of abnormal contractile motions of cardiac tissue has been a noteworthy challenge and significant limitation in assessing and classifying the drug-induced arrhythmias (i.e. Torsades de pointes). To overcome these challenges, researchers have taken advantage of computational image processing tools to measure contractile motion from cardiomyocytes derived from human induced pluripotent stem cells (hiPSC-CMs). However, the amplitude and frequency analysis of contractile motion waveforms doesn't produce sufficient information to objectively classify the degree of variations between two or more sets of cardiac contractile motions. In this paper, we generated contractile motion dat…
miRToolsGallery: a tag-based and rankable microRNA bioinformatics resources database portal
2017
Abstract Hundreds of bioinformatics tools have been developed for MicroRNA (miRNA) investigations including those used for identification, target prediction, structure and expression profile analysis. However, finding the correct tool for a specific application requires the tedious and laborious process of locating, downloading, testing and validating the appropriate tool from a group of nearly a thousand. In order to facilitate this process, we developed a novel database portal named miRToolsGallery. We constructed the portal by manually curating > 950 miRNA analysis tools and resources. In the portal, a query to locate the appropriate tool is expedited by being searchable, filterable and …