Search results for "genomics"
showing 10 items of 1255 documents
FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications
2017
Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…
Integrative analysis of structural variations using short-reads and linked-reads yields highly specific and sensitive predictions.
2020
Genetic diseases are driven by aberrations of the human genome. Identification of such aberrations including structural variations (SVs) is key to our understanding. Conventional short-reads whole genome sequencing (cWGS) can identify SVs to base-pair resolution, but utilizes only short-range information and suffers from high false discovery rate (FDR). Linked-reads sequencing (10XWGS) utilizes long-range information by linkage of short-reads originating from the same large DNA molecule. This can mitigate alignment-based artefacts especially in repetitive regions and should enable better prediction of SVs. However, an unbiased evaluation of this technology is not available. In this study, w…
Shell palaeoproteomics: first application of peptide mass fingerprinting for the rapid identification of mollusc shells in archaeology.
2020
10 pages; International audience; Molluscs were one of the most widely-used natural resources in the past, and their shells are abundant among archaeological findings. However, our knowledge of the variety of shells that were circulating in prehistoric times (and thus their socio-economic and cultural value) is scarce due to the difficulty of achieving taxonomic determination of fragmented and/or worked remains. This study aims to obtain molecular barcodes based on peptide mass fingerprints (PMFs) of intracrystalline proteins, in order to obtain shell identification. Palaeoproteomic applications on shells are challenging, due to low concentration of molluscan proteins and an incomplete unde…
Use of deep learning methods to translate drug-induced gene expression changes from rat to human primary hepatocytes
2020
In clinical trials, animal and cell line models are often used to evaluate the potential toxic effects of a novel compound or candidate drug before progressing to human trials. However, relating the results of animal and in vitro model exposures to relevant clinical outcomes in the human in vivo system still proves challenging, relying on often putative orthologs. In recent years, multiple studies have demonstrated that the repeated dose rodent bioassay, the current gold standard in the field, lacks sufficient sensitivity and specificity in predicting toxic effects of pharmaceuticals in humans. In this study, we evaluate the potential of deep learning techniques to translate the pattern of …
"Islands of divergence" in the Atlantic cod genome represent polymorphic chromosomal rearrangements
2016
- In several species genetic differentiation across environmental gradients or between geographically separate populations has been reported to center at “genomic islands of divergence,” resulting in heterogeneous differentiation patterns across genomes. Here, genomic regions of elevated divergence were observed on three chromosomes of the highly mobile fish Atlantic cod (Gadus morhua) within geographically fine-scaled coastal areas. The “genomic islands” extended at least 5, 9.5, and 13 megabases on linkage groups 2, 7, and 12, respectively, and coincided with large blocks of linkage disequilibrium. For each of these three chromosomes, pairs of segregating, highly divergent alleles were id…
MiasDB: A Database of Molecular Interactions Associated with Alternative Splicing of Human Pre-mRNAs.
2016
Alternative splicing (AS) is pervasive in human multi-exon genes and is a major contributor to expansion of the transcriptome and proteome diversity. The accurate recognition of alternative splice sites is regulated by information contained in networks of protein-protein and protein-RNA interactions. However, the mechanisms leading to splice site selection are not fully understood. Although numerous databases have been built to describe AS, molecular interaction databases associated with AS have only recently emerged. In this study, we present a new database, MiasDB, that provides a description of molecular interactions associated with human AS events. This database covers 938 interactions …
Measuring the clustering effect of BWT via RLE
2017
Abstract The Burrows–Wheeler Transform (BWT) is a reversible transformation on which are based several text compressors and many other tools used in Bioinformatics and Computational Biology. The BWT is not actually a compressor, but a transformation that performs a context-dependent permutation of the letters of the input text that often create runs of equal letters (clusters) longer than the ones in the original text, usually referred to as the “clustering effect” of BWT. In particular, from a combinatorial point of view, great attention has been given to the case in which the BWT produces the fewest number of clusters (cf. [5] , [16] , [21] , [23] ). In this paper we are concerned about t…
Genomics of speciation and introgression in Princess cichlid fishes from Lake Tanganyika.
2016
How variation in the genome translates into biological diversity and new species originate has endured as the mystery of mysteries in evolutionary biology. African cichlid fishes are prime model systems to address speciation-related questions for their remarkable taxonomic and phenotypic diversity, and the possible role of gene flow in this process. Here, we capitalize on genome sequencing and phylogenomic analyses to address the relative impacts of incomplete lineage sorting, introgression and hybrid speciation in the Neolamprologus savoryi-complex (the 'Princess cichlids') from Lake Tanganyika. We present a time-calibrated species tree based on whole-genome sequences and provide strong ev…
Epigenetic regulation of DNA repair genes and implications for tumor therapy
2017
DNA repair represents the first barrier against genotoxic stress causing metabolic changes, inflammation and cancer. Besides its role in preventing cancer, DNA repair needs also to be considered during cancer treatment with radiation and DNA damaging drugs as it impacts therapy outcome. The DNA repair capacity is mainly governed by the expression level of repair genes. Alterations in the expression of repair genes can occur due to mutations in their coding or promoter region, changes in the expression of transcription factors activating or repressing these genes, and/or epigenetic factors changing histone modifications and CpG promoter methylation or demethylation levels. In this review we …
Evaluation of the RYR1 gene genetic diversity in the Latvian White pig breed
2016
The ryanodine receptor 1 (RYR1) is a calcium ion channel in the sarcoplasmic reticulum of skeletal muscle. Multiple polymorphic loci have been identified in the RYR1 gene in human and animals and some of them are associated with certain phenotypes. However, there are still few data on the RYR1 genetic variability in pig and only the missense mutation Arg615Cys, associated with the malignant hyperthermia, porcine stress syndrome and meat quality, has been studied in several commercial and local breeds. Aim. To genotype the rs344435545 (C1972T, Arg615Cys), rs196953058 (T8434C, Phe2769Leu) and rs323041392 (G12484A, Asp4119Asn) in the Latvian local pig breed Latvian White and to evaluate the ev…