Search results for " Informatica"

showing 10 items of 978 documents

Clustering of low-correlated spatial gene expression patterns in the mouse brain in the Allen Brain Atlas

2018

In this paper, clustering techniques are applied to spatial gene expression patterns with a low genomic correlation between the sagittal and coronal projections. The data analysed here are hosted on an available public DB named ABA (Allen Brain Atlas). The results are compared to those obtained by Bohland et al. on the complementary dataset (high correlation values). We prove that, by analysing a reduced dataset,hence reducing the computational burden, we get the same accuracy in highlighting different neuroanatomical region.

0301 basic medicineSettore INF/01 - InformaticaComputer scienceBrain atlasComputer Science ApplicationGenomicsComputational biologySagittal planeCorrelation03 medical and health sciences030104 developmental biology0302 clinical medicinemedicine.anatomical_structureComputer Networks and CommunicationHardware and ArchitectureCoronal planeGene expressionmedicineComputer Vision and Pattern RecognitionElectrical and Electronic EngineeringCluster analysis030217 neurology & neurosurgery

researchProduct

Block Sorting-Based Transformations on Words: Beyond the Magic BWT

2018

The Burrows-Wheeler Transform (BWT) is a word transformation introduced in 1994 for Data Compression and later results have contributed to make it a fundamental tool for the design of self-indexing compressed data structures. The Alternating Burrows-Wheeler Transform (ABWT) is a more recent transformation, studied in the context of Combinatorics on Words, that works in a similar way, using an alternating lexicographical order instead of the usual one. In this paper we study a more general class of block sorting-based transformations. The transformations in this new class prove to be interesting combinatorial tools that offer new research perspectives. In particular, we show that all the tra…

0301 basic medicineSettore INF/01 - InformaticaComputer scienceData_CODINGANDINFORMATIONTHEORY0102 computer and information sciencesBlock sortingData structureLexicographical order01 natural sciencesUpper and lower bounds03 medical and health sciencesCombinatorics on words030104 developmental biology010201 computation theory & mathematicsArithmeticCompressed Data Structures Block Sorting Combinatorics on Words AlgorithmsData compression

researchProduct

A Deep Learning Model for Epigenomic Studies

2016

Epigenetics is the study of heritable changes in gene expression that does not involve changes to the underlying DNA sequence, i.e. a change in phenotype not involved by a change in genotype. At least three main factor seems responsible for epigenetic change including DNA methylation, histone modification and non-coding RNA, each one sharing having the same property to affect the dynamic of the chromatin structure by acting on Nucleosomes posi- tion. A nucleosome is a DNA-histone complex, where around 150 base pairs of double-stranded DNA is wrapped. The role of nucleosomes is to pack the DNA into the nucleus of the Eukaryote cells, to form the Chromatin. Nucleosome positioning plays an imp…

0301 basic medicineSettore INF/01 - InformaticabiologyBase pairdeep learningGenomicsComputational biologyBioinformaticsChromatin03 medical and health sciences030104 developmental biologyHistoneclassificationDNA methylationbiology.proteinNucleosomeEpigeneticsnucleosome positioningEpigenomics2016 12th International Conference on Signal-Image Technology & Internet-Based Systems (SITIS)

researchProduct

Discovering discriminative graph patterns from gene expression data

2016

We consider the problem of mining gene expression data in order to single out interesting features characterizing healthy/unhealthy samples of an input dataset. We present an approach based on a network model of the input gene expression data, where there is a labelled graph for each sample. To the best of our knowledge, this is the first attempt to build a different graph for each sample and, then, to have a database of graphs for representing a sample set. Our main goal is that of singling out interesting differences between healthy and unhealthy samples, through the extraction of "discriminative patterns" among graphs belonging to the two different sample sets. Differently from the other…

0301 basic medicineSettore INF/01 - Informaticabusiness.industryComputer science0206 medical engineeringpattern discovery subgraph extraction biological networksPattern recognition02 engineering and technologyGraph03 medical and health sciencesComputingMethodologies_PATTERNRECOGNITION030104 developmental biologyDiscriminative modelGraph patternsArtificial intelligencebusiness020602 bioinformaticsBiological networkNetwork modelProceedings of the 31st Annual ACM Symposium on Applied Computing

researchProduct

Recurrent Deep Neural Networks for Nucleosome Classification

2020

Nucleosomes are the fundamental repeating unit of chromatin. A nucleosome is an 8 histone proteins complex, in which approximately 147–150 pairs of DNA bases bind. Several biological studies have clearly stated that the regulation of cell type-specific gene activities are influenced by nucleosome positioning. Bioinformatic studies have improved those results showing proof of sequence specificity in nucleosomes’ DNA fragment. In this work, we present a recurrent neural network that uses nucleosome sequence features representation for their classification. In particular, we implement an architecture which stacks convolutional and long short-term memory layers, with the main purpose to avoid t…

0301 basic medicineSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazionibiologySettore INF/01 - InformaticaComputer scienceComputational biologyChromatin03 medical and health scienceschemistry.chemical_compound030104 developmental biologyHistoneRecurrent neural networkchemistryFragment (logic)biology.proteinNucleosomeNucleosome classification Epigenetic Deep learning networks Recurrent Neural NetworksGeneDNASequence (medicine)

researchProduct

The intrinsic combinatorial organization and information theoretic content of a sequence are correlated to the DNA encoded nucleosome organization of…

2015

Abstract Motivation: Thanks to research spanning nearly 30 years, two major models have emerged that account for nucleosome organization in chromatin: statistical and sequence specific. The first is based on elegant, easy to compute, closed-form mathematical formulas that make no assumptions of the physical and chemical properties of the underlying DNA sequence. Moreover, they need no training on the data for their computation. The latter is based on some sequence regularities but, as opposed to the statistical model, it lacks the same type of closed-form formulas that, in this case, should be based on the DNA sequence only. Results: We contribute to close this important methodological gap …

0301 basic medicineStatistics and ProbabilityNucleosome organizationComputational biologyBiologyType (model theory)BiochemistryGenomeDNA sequencing03 medical and health sciencesComputational Theory and MathematicNucleosomeMolecular BiologySequence (medicine)GeneticsGenomeSettore INF/01 - InformaticaEukaryotaComputer Science Applications1707 Computer Vision and Pattern RecognitionStatistical modelDNAChromatinNucleosomesComputer Science ApplicationsChromatinSettore BIO/18 - GeneticaComputational Mathematics030104 developmental biologyComputational Theory and MathematicsComputational MathematicBioinformatics

researchProduct

In vitro versus in vivo compositional landscapes of histone sequence preferences in eucaryotic genomes

2018

Abstract Motivation Although the nucleosome occupancy along a genome can be in part predicted by in vitro experiments, it has been recently observed that the chromatin organization presents important differences in vitro with respect to in vivo. Such differences mainly regard the hierarchical and regular structures of the nucleosome fiber, whose existence has long been assumed, and in part also observed in vitro, but that does not apparently occur in vivo. It is also well known that the DNA sequence has a role in determining the nucleosome occupancy. Therefore, an important issue is to understand if, and to what extent, the structural differences in the chromatin organization between in vit…

0301 basic medicineStatistics and Probabilityved/biology.organism_classification_rank.speciesComputational biologySaccharomyces cerevisiaeGenomeBiochemistryDNA sequencingHistones03 medical and health sciences0302 clinical medicineIn vivoComputational Theory and MathematicNucleosomeAnimalsModel organismCaenorhabditis elegansMolecular BiologySequence (medicine)GenomebiologySettore INF/01 - Informaticaved/biologyComputer Science ApplicationChromatinComputer Science ApplicationsChromatinNucleosomesComputational Mathematics030104 developmental biologyHistoneEukaryotic CellsComputational Theory and Mathematicsbiology.proteinComputer Vision and Pattern RecognitionSequence Analysis030217 neurology & neurosurgery

researchProduct

An effective extension of the applicability of alignment-free biological sequence comparison algorithms with Hadoop

2016

Alignment-free methods are one of the mainstays of biological sequence comparison, i.e., the assessment of how similar two biological sequences are to each other, a fundamental and routine task in computational biology and bioinformatics. They have gained popularity since, even on standard desktop machines, they are faster than methods based on alignments. However, with the advent of Next-Generation Sequencing Technologies, datasets whose size, i.e., number of sequences and their total length, is a challenge to the execution of alignment-free methods on those standard machines are quite common. Here, we propose the first paradigm for the computation of k-mer-based alignment-free methods for…

0301 basic medicineTheoretical computer science030102 biochemistry & molecular biologySettore INF/01 - InformaticaComputer scienceComputationExtension (predicate logic)Information SystemHash tableDistributed computingTask (project management)Theoretical Computer Science03 medical and health sciences030104 developmental biologyAlignment-free sequence comparison and analysisHadoopHardware and Architecturealignment-free sequence comparison and analysis; distributed computing; Hadoop; MapReduce; software; theoretical computer science; information systems; hardware and architectureSequence comparisonMapReduceAlignment-free sequence comparison and analysiAlignment-free sequence comparison and analysis; Distributed computing; Hadoop; MapReduce; Theoretical Computer Science; Software; Information Systems; Hardware and ArchitectureSoftwareInformation Systems

researchProduct

Deep learning models for bacteria taxonomic classification of metagenomic data.

2018

Background An open challenge in translational bioinformatics is the analysis of sequenced metagenomes from various environmental samples. Of course, several studies demonstrated the 16S ribosomal RNA could be considered as a barcode for bacteria classification at the genus level, but till now it is hard to identify the correct composition of metagenomic data from RNA-seq short-read data. 16S short-read data are generated using two next generation sequencing technologies, i.e. whole genome shotgun (WGS) and amplicon (AMP); typically, the former is filtered to obtain short-reads belonging to a 16S shotgun (SG), whereas the latter take into account only some specific 16S hypervariable regions.…

0301 basic medicineTime FactorsDBNComputer scienceBiochemistryStructural BiologyRNA Ribosomal 16SDatabases Geneticlcsh:QH301-705.5Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazionibiologySettore INF/01 - InformaticaShotgun sequencingApplied MathematicsAmpliconClassificationComputer Science Applicationslcsh:R858-859.7DNA microarrayShotgunAlgorithmsCNN030106 microbiologyk-mer representationlcsh:Computer applications to medicine. Medical informaticsDNA sequencing03 medical and health sciencesMetagenomicDeep LearningMolecular BiologyBacteriaModels GeneticPhylumbusiness.industryDeep learningResearchReproducibility of ResultsPattern recognitionBiological classification16S ribosomal RNAbiology.organism_classificationAmpliconHypervariable region030104 developmental biologyTaxonlcsh:Biology (General)MetagenomicsMetagenomeArtificial intelligenceMetagenomicsNeural Networks ComputerbusinessClassifier (UML)BacteriaBMC bioinformatics

researchProduct

An Integrative Framework for the Construction of Big Functional Networks

2018

We present a methodology for biological data integration, aiming at building and analysing large functional networks which model complex genotype-phenotype associations. A functional network is a graph where nodes represent cellular components (e.g., genes, proteins, mRNA, etc.) and edges represent associations among such molecules. Different types of components may cohesist in the same network, and associations may be related to physical[biochemical interactions or functional/phenotipic relationships. Due to both the large amount of involved information and the computational complexity typical of the problems in this domain, the proposed framework is based on big data technologies (Spark a…

0301 basic medicinebiological networkBiological dataTheoretical computer scienceSettore INF/01 - InformaticaComputational complexity theoryComputer sciencebusiness.industryBig dataNoSQLcomputer.software_genreFunctional networks03 medical and health sciences030104 developmental biologyGraph (abstract data type)big data technologiesbig data technologiebusinesscomputerIntegrative approacheBiological network2018 IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

researchProduct