Search results for "format"
showing 10 items of 24643 documents
Efficient Algorithms for Sequence Analysis with Entropic Profiles
2017
Entropy, being closely related to repetitiveness and compressibility, is a widely used information-related measure to assess the degree of predictability of a sequence. Entropic profiles are based on information theory principles, and can be used to study the under-/over-representation of subwords, by also providing information about the scale of conserved DNA regions. Here, we focus on the algorithmic aspects related to entropic profiles. In particular, we propose linear time algorithms for their computation that rely on suffix-based data structures, more specifically on the truncated suffix tree (TST) and on the enhanced suffix array (ESA). We performed an extensive experimental campaign …
Enabling openness of valuable information resources: Curbing data subtractability and exclusion
2019
In this paper we investigate how data openness can be made possible in communal settings. We adopt a utility perspective that foregrounds the use value of data, conceptualizing them as “goods.” On the basis of this conceptualization we explore 2 key goods' attributes: subtractability and exclusion. Our theoretical basis is built upon concepts from the theory of the commons, power theorizing, and notions related to data and information. Empirically, we investigate openness in the genetics domain through a longitudinal study of the evolving communal infrastructure for data related to 2 genes influencing women's susceptibility to breast and ovarian cancer (BRCA1 and BRCA2). We follow the conti…
Influence of pathway topology and functional class on the molecular evolution of human metabolic genes
2018
Metabolic networks comprise thousands of enzymatic reactions functioning in a controlled manner and have been shaped by natural selection. Thanks to the genome data, the footprints of adaptive (positive) selection are detectable, and the strength of purifying selection can be measured. This has made possible to know where, in the metabolic network, adaptive selection has acted and where purifying selection is more or less strong and efficient. We have carried out a comprehensive molecular evolutionary study of all the genes involved in the human metabolism. We investigated the type and strength of the selective pressures that acted on the enzyme-coding genes belonging to metabolic pathways …
Discriminating graph pattern mining from gene expression data
2016
We consider the problem of mining gene expression data in order to single out interesting features that characterize healthy/unhealthy samples of an input dataset. We present and approach based on a network model of the input gene expression data, where there is a labelled graph for each sample. To the best of our knowledge, this is the first attempt to build a different graph for each sample and, then, to have a database of graphs for representing a sample set. Out main goal is that of singling out interesting differences between healthy and unhealthy samples, through the extraction of "discriminating patterns" among graphs belonging to the two different sample sets. Differently from the …
Applying Conceptual Modeling to Better Understand the Human Genome
2016
The objective of the work is to present the benefits of the application of Conceptual Modeling (CM) in complex domains, such as genomics. This paper explains the evolution of a Conceptual Schema of the Human Genome (CSHG), which seeks to provide a clear and precise understanding of the human genome. We want to highlighting all the advantages of the application of CM in a complex domain such as Genomic Information Systems (GeIS). We show how over time this model has evolved, thus we have discovered better forms of representation. As we advanced in exploring the domain, we understood that we should be extending and incorporating the new concepts detected into our model. Here we present and di…
All-atom simulations to studying metallodrugs/target interactions.
2021
Abstract Metallodrugs are extensively used to treat and diagnose distinct disease types. The unique physical–chemical properties of metal ions offer tantalizing opportunities to tailor effective scaffolds for selectively targeting specific biomolecules. Modern experimental techniques have collected a large body of structural data concerning the interactions of metallodrugs with their biomolecular targets, although being unable to exhaustively assess the molecular basis of their mechanism of action. In this scenario, the complementary use of accurate computational methods allows uncovering the minutiae of metallodrugs/targets interactions and their underlying mechanism of action at an atomic…
Differential binding cell-SELEX method to identify cell-specific aptamers using high-throughput sequencing
2018
AbstractAptamers have in recent years emerged as a viable alternative to antibodies. High-throughput sequencing (HTS) has revolutionized aptamer research by increasing the number of reads from a few (using Sanger sequencing) to millions (using an HTS approach). Despite the availability and advantages of HTS compared to Sanger sequencing, there are only 50 aptamer HTS sequencing samples available on public databases. HTS data in aptamer research are primarily used to compare sequence enrichment between subsequent selection cycles. This approach does not take full advantage of HTS because the enrichment of sequences during selection can be due to inefficient negative selection when using live…
Deep learning architectures for prediction of nucleosome positioning from sequences data
2018
Abstract Background Nucleosomes are DNA-histone complex, each wrapping about 150 pairs of double-stranded DNA. Their function is fundamental for one of the primary functions of Chromatin i.e. packing the DNA into the nucleus of the Eukaryote cells. Several biological studies have shown that the nucleosome positioning influences the regulation of cell type-specific gene activities. Moreover, computational studies have shown evidence of sequence specificity concerning the DNA fragment wrapped into nucleosomes, clearly underlined by the organization of particular DNA substrings. As the main consequence, the identification of nucleosomes on a genomic scale has been successfully performed by com…
Application of Graph Clustering and Visualisation Methods to Analysis of Biomolecular Data
2018
In this paper we present an approach based on integrated use of graph clustering and visualisation methods for semi-supervised discovery of biologically significant features in biomolecular data sets. We describe several clustering algorithms that have been custom designed for analysis of biomolecular data and feature an iterated two step approach involving initial computation of thresholds and other parameters used in clustering algorithms, which is followed by identification of connected graph components, and, if needed, by adjustment of clustering parameters for processing of individual subgraphs.
Quantitatively characterizing drug-induced arrhythmic contractile motions of human stem cell-derived cardiomyocytes.
2018
Quantification of abnormal contractile motions of cardiac tissue has been a noteworthy challenge and significant limitation in assessing and classifying the drug-induced arrhythmias (i.e. Torsades de pointes). To overcome these challenges, researchers have taken advantage of computational image processing tools to measure contractile motion from cardiomyocytes derived from human induced pluripotent stem cells (hiPSC-CMs). However, the amplitude and frequency analysis of contractile motion waveforms doesn't produce sufficient information to objectively classify the degree of variations between two or more sets of cardiac contractile motions. In this paper, we generated contractile motion dat…