Search results for "COMPUTATION"
showing 10 items of 7362 documents
Advances in Understanding the Molecular Basis of the Mediterranean Diet Effect
2018
Posted with permission from the Annual Review of Food Science and Technology, Volume 9 by Annual Reviews, http://www.annualreviews.org. Increasingly, studies showing the protective effects of the Mediterranean diet (MedDiet) on different diseases (cardiovascular, diabetes, some cancers, and even total mortality and aging indicators) are being published. The scientific evidence level for each outcome is variable, and new studies are needed to better understand the molecular mechanisms whereby the MedDiet may exercise its effects. Here, we present recent advances in understanding the molecular basis of MedDiet effects, mainly focusing on cardiovascular diseases but also discussing other relat…
Non-primate lentiviral vectors and their applications in gene therapy for ocular disorders
2018
Lentiviruses have a number of molecular features in common, starting with the ability to integrate their genetic material into the genome of non-dividing infected cells. A peculiar property of non-primate lentiviruses consists in their incapability to infect and induce diseases in humans, thus providing the main rationale for deriving biologically safe lentiviral vectors for gene therapy applications. In this review, we first give an overview of non-primate lentiviruses, highlighting their common and distinctive molecular characteristics together with key concepts in the molecular biology of lentiviruses. We next examine the bioengineering strategies leading to the conversion of lentiviruse…
FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications
2017
Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…
The colored longest common prefix array computed via sequential scans
2018
Due to the increased availability of large datasets of biological sequences, the tools for sequence comparison are now relying on efficient alignment-free approaches to a greater extent. Most of the alignment-free approaches require the computation of statistics of the sequences in the dataset. Such computations become impractical in internal memory when very large collections of long sequences are considered. In this paper, we present a new conceptual data structure, the colored longest common prefix array (cLCP), that allows to efficiently tackle several problems with an alignment-free approach. In fact, we show that such a data structure can be computed via sequential scans in semi-exter…
Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus
2016
Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …
Alignment-free sequence comparison using absent words
2018
Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…
Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions.
2020
Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, w…
Biophysics of high density nanometer regions extracted from super-resolution single particle trajectories: application to voltage-gated calcium chann…
2019
AbstractThe cellular membrane is very heterogenous and enriched with high-density regions forming microdomains, as revealed by single particle tracking experiments. However the organization of these regions remain unexplained. We determine here the biophysical properties of these regions, when described as a basin of attraction. We develop two methods to recover the dynamics and local potential wells (field of force and boundary). The first method is based on the local density of points distribution of trajectories, which differs inside and outside the wells. The second method focuses on recovering the drift field that is convergent inside wells and uses the transient field to determine the…
Skeletal Dysplasia Mutations Effect on Human Filamins’ Structure and Mechanosensing
2016
AbstractCells’ ability to sense mechanical cues in their environment is crucial for fundamental cellular processes, leading defects in mechanosensing to be linked to many diseases. The actin cross-linking protein Filamin has an important role in the conversion of mechanical forces into biochemical signals. Here, we reveal how mutations in Filamin genes known to cause Larsen syndrome and Frontometaphyseal dysplasia can affect the structure and therefore function of Filamin domains 16 and 17. Employing X-ray crystallography, the structure of these domains was first solved for the human Filamin B. The interaction seen between domains 16 and 17 is broken by shear force as revealed by steered mo…
2016
We determine knotting probabilities and typical sizes of knots in double-stranded DNA for chains of up to half a million base pairs with computer simulations of a coarse-grained bead-stick model: Single trefoil knots and composite knots which include at least one trefoil as a prime factor are shown to be common in DNA chains exceeding 250,000 base pairs, assuming physiologically relevant salt conditions. The analysis is motivated by the emergence of DNA nanopore sequencing technology, as knots are a potential cause of erroneous nucleotide reads in nanopore sequencing devices and may severely limit read lengths in the foreseeable future. Even though our coarse-grained model is only based on …