Search results for " Sequencing"
showing 10 items of 976 documents
FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications
2017
Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…
Q-nexus: a comprehensive and efficient analysis pipeline designed for ChIP-nexus
2016
Background: ChIP-nexus, an extension of the ChIP-exo protocol, can be used to map the borders of protein-bound DNA sequences at nucleotide resolution, requires less input DNA and enables selective PCR duplicate removal using random barcodes. However, the use of random barcodes requires additional preprocessing of the mapping data, which complicates the computational analysis. To date, only a very limited number of software packages are available for the analysis of ChIP-exo data, which have not yet been systematically tested and compared on ChIP-nexus data. Results: Here, we present a comprehensive software package for ChIP-nexus data that exploits the random barcodes for selective removal …
Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions.
2020
Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, w…
Feasibility of sample size calculation for RNA-seq studies
2017
Sample size calculation is a crucial step in study design but is not yet fully established for RNA sequencing (RNA-seq) analyses. To evaluate feasibility and provide guidance, we evaluated RNA-seq sample size tools identified from a systematic search. The focus was on whether real pilot data would be needed for reliable results and on identifying tools that would perform well in scenarios with different levels of biological heterogeneity and fold changes (FCs) between conditions. We used simulations based on real data for tool evaluation. In all settings, the six evaluated tools provided widely different answers, which were strongly affected by FC. Although all tools failed for small FCs, s…
2016
We determine knotting probabilities and typical sizes of knots in double-stranded DNA for chains of up to half a million base pairs with computer simulations of a coarse-grained bead-stick model: Single trefoil knots and composite knots which include at least one trefoil as a prime factor are shown to be common in DNA chains exceeding 250,000 base pairs, assuming physiologically relevant salt conditions. The analysis is motivated by the emergence of DNA nanopore sequencing technology, as knots are a potential cause of erroneous nucleotide reads in nanopore sequencing devices and may severely limit read lengths in the foreseeable future. Even though our coarse-grained model is only based on …
NOX2ko Mice Show Largely Increased Expression of a Mutated NOX2 mRNA Encoding an Inactive NOX2 Protein
2020
Background: The superoxide-generating enzyme nicotinamide adenine dinucleotide phosphate (NADPH) oxidase (NOX2 or gp91phox, the phagocytic isoform) was reported as a major source of oxidative stress in various human diseases. Genetic deletion is widely used to study the impact of NOX2-derived reactive oxygen species (ROS) on disease development and progression in various animal models. Here, we investigate why NOX2 knockout mice show no NOX2 activity but express NOX2 mRNA and protein. Methods and Results: Oxidative burst (NOX2-dependent formation of ROS) was measured by L-012-based chemiluminescence and was largely absent in whole blood of NOX2 knockout mice. Protein expression was still de…
2016
RNA 2'-O-methylation is one of the ubiquitous nucleotide modifications found in many RNA types from Bacteria, Archaea and Eukarya. RNAs bearing 2'-O-methylations show increased resistance to degradation and enhanced stability in helices. While the exact role of each 2'-O-Me residue remained elusive, the catalytic protein Fibrillarin (Nop1 in yeast) responsible for 2'-O-methylation in eukaryotes, is associated with human pathologies. Therefore, there is an urgent need to precisely map and quantify hundreds of 2'-O-Me residues in RNA using high-throughput technologies. Here, we develop a reliable protocol using alkaline fragmentation of total RNA coupled to a commonly used ligation approach, …
STAG1 mutations cause a novel cohesinopathy characterised by unspecific syndromic intellectual disability
2017
Item does not contain fulltext BACKGROUND: Cohesinopathies are rare neurodevelopmental disorders arising from a dysfunction in the cohesin pathway, which enables chromosome segregation and regulates gene transcription. So far, eight genes from this pathway have been reported in human disease. STAG1 belongs to the STAG subunit of the core cohesin complex, along with five other subunits. This work aimed to identify the phenotype ascribed to STAG1 mutations. METHODS: Among patients referred for intellectual disability (ID) in genetics departments worldwide, array-comparative genomic hybridisation (CGH), gene panel, whole-exome sequencing or whole-genome sequencing were performed following the …
Diagnostic odyssey in severe neurodevelopmental disorders: toward clinical whole-exome sequencing as a first-line diagnostic test
2016
The current standard of care for diagnosis of severe intellectual disability (ID) and epileptic encephalopathy (EE) results in a diagnostic yield of ∼50%. Affected individuals nonetheless undergo multiple clinical evaluations and low-yield laboratory tests often referred to as a 'diagnostic odyssey'. This study was aimed at assessing the utility of clinical whole-exome sequencing (WES) in individuals with undiagnosed and severe forms of ID and EE, and the feasibility of its implementation in routine practice by a small regional genetic center. We performed WES in a cohort of 43 unrelated individuals with undiagnosed ID and/or EE. All individuals had undergone multiple clinical evaluations a…
A Novel Role for CSRP1 in a Lebanese Family with Congenital Cardiac Defects
2017
Despite an obvious role for consanguinity in congenital heart disease (CHD), most studies fail to document a monogenic model of inheritance except for few cases. We hereby describe a first-degree cousins consanguineous Lebanese family with 7 conceived children: 2 died in utero of unknown causes, 3 have CHD, and 4 have polydactyly. The aim of the study is to unveil the genetic variant(s) causing these phenotypes using next generation sequencing (NGS) technology. Targeted exome sequencing identified a heterozygous duplication in CSRP1 which leads to a potential frameshift mutation at position 154 of the protein. This mutation is inherited from the father, and segregates only with the CHD phen…