Search results for " Informatica"

showing 10 items of 978 documents

Deep learning network for exploiting positional information in nucleosome related sequences

2017

A nucleosome is a DNA-histone complex, wrapping about 150 pairs of double-stranded DNA. The role of nucleosomes is to pack the DNA into the nucleus of the Eukaryote cells to form the Chromatin. Nucleosome positioning genome wide play an important role in the regulation of cell type-specific gene activities. Several biological studies have shown sequence specificity of nucleosome presence, clearly underlined by the organization of precise nucleotides substrings. Taking into consideration such advances, the identification of nucleosomes on a genomic scale has been successfully performed by DNA sequence features representation and classical supervised classification methods such as Support Vec…

0301 basic medicineComputer scienceSpeech recognitionCell02 engineering and technologyComputational biologyGenomeDNA sequencing03 medical and health scienceschemistry.chemical_compoundDeep Learning0202 electrical engineering electronic engineering information engineeringmedicineNucleosomeNucleotideGeneSettore ING-INF/05 - Sistemi Di Elaborazione Delle Informazionichemistry.chemical_classificationSequenceSettore INF/01 - Informaticabiologybusiness.industryDeep learningnucleosomebiology.organism_classificationSubstringChromatinIdentification (information)030104 developmental biologymedicine.anatomical_structurechemistry020201 artificial intelligence & image processingEukaryoteNucleosome classification Epigenetic Deep learning networks Recurrent Neural NetworksArtificial intelligencebusinessDNA
researchProduct

A REST-based framework to support non-invasive and early coeliac disease diagnosis

2019

The health sector has traditionally been one of the early adopters of databases, from the most simple Electronic Health Record (formerly Computer-Based Patient Record) systems in use in general practice, hospitals and intensive care units to big data, multidata based systems used to support diagnosis and care decisions. In this paper we present a framework to support non-invasive and early diagnosis of coeliac disease. The proposed framework makes use of well-known technologies and techniques, both hardware and software, put together in a novel way. The main goals of our framework are: (1) providing users with a reliable and fast repository of a large amount of data; (2) to make such reposi…

0301 basic medicineDecision support systemData processingSIMPLE (military communications protocol)Settore INF/01 - Informaticabusiness.industryComputer scienceBig dataHealth care systemData scienceDatabase03 medical and health sciencesEarly adopter030104 developmental biology0302 clinical medicineSoftwareIntensive careRest (finance)030211 gastroenterology & hepatologybusinessCoeliac disease diagnosis supportHealth care systems
researchProduct

Strategies for structuring interdisciplinary education in Systems Biology: an European perspective

2016

Systems Biology is an approach to biology and medicine that has the potential to lead to a better understanding of how biological properties emerge from the interaction of genes, proteins, molecules, cells and organisms. The approach aims at elucidating how these interactions govern biological function by employing experimental data, mathematical models and computational simulations. As Systems Biology is inherently multidisciplinary, education within this field meets numerous hurdles including departmental barriers, availability of all required expertise locally, appropriate teaching material and example curricula. As university education at the Bachelor’s level is traditionally built upon…

0301 basic medicineEngineeringSystems biologymedia_common.quotation_subjectStructuringGeneral Biochemistry Genetics and Molecular BiologyArticleEducation03 medical and health sciences0302 clinical medicineExcellenceMultidisciplinary approachDrug DiscoveryComputingMilieux_COMPUTERSANDEDUCATIONLife ScienceSystems and Synthetic BiologyInnovation/dk/atira/pure/sustainabledevelopmentgoals/industry_innovation_and_infrastructureCurriculummedia_commonVLAGFlexibility (engineering)Systeem en Synthetische BiologieScience & TechnologyManagement sciencebusiness.industry4. EducationApplied MathematicsINF/01 - INFORMATICAGAPGénéralitésSystems Biology Training and education3. Good healthComputer Science Applications030104 developmental biologyAction (philosophy)Modeling and Simulationand InfrastructureSDG 9 - Industry Innovation and InfrastructureMathematical & Computational BiologySystems biologybusinessDisciplineSDG 9 - IndustryLife Sciences & Biomedicine030217 neurology & neurosurgery
researchProduct

FASTdoop: A versatile and efficient library for the input of FASTA and FASTQ files for MapReduce Hadoop bioinformatics applications

2017

Abstract Summary MapReduce Hadoop bioinformatics applications require the availability of special-purpose routines to manage the input of sequence files. Unfortunately, the Hadoop framework does not provide any built-in support for the most popular sequence file formats like FASTA or BAM. Moreover, the development of these routines is not easy, both because of the diversity of these formats and the need for managing efficiently sequence datasets that may count up to billions of characters. We present FASTdoop, a generic Hadoop library for the management of FASTA and FASTQ files. We show that, with respect to analogous input management routines that have appeared in the Literature, it offers…

0301 basic medicineFASTQ formatStatistics and ProbabilityComputer scienceSequence analysismedia_common.quotation_subjectInformation Storage and RetrievalBioinformaticscomputer.software_genreGenomeBiochemistryDomain (software engineering)03 medical and health sciencesComputational Theory and MathematicHumansGenomic libraryQuality (business)DNA sequencingFASTQ; NGS; FASTQ; DNA sequencingMolecular Biologymedia_commonGene LibrarySequenceDatabaseSettore INF/01 - InformaticaGenome HumanComputer Science Applications1707 Computer Vision and Pattern RecognitionGenomicsSequence Analysis DNAFASTQFile formatComputer Science ApplicationsStatistics and Probability; Biochemistry; Molecular Biology; Computer Science Applications1707 Computer Vision and Pattern Recognition; Computational Theory and Mathematics; Computational MathematicsComputational Mathematics030104 developmental biologyComputational Theory and MathematicsNGSDatabase Management Systemscomputer
researchProduct

Detecting mutations by eBWT

2018

In this paper we develop a theory describing how the extended Burrows-Wheeler Transform (eBWT) of a collection of DNA fragments tends to cluster together the copies of nucleotides sequenced from a genome G. Our theory accurately predicts how many copies of any nucleotide are expected inside each such cluster, and how an elegant and precise LCP array based procedure can locate these clusters in the eBWT. Our findings are very general and can be applied to a wide range of different problems. In this paper, we consider the case of alignment-free and reference-free SNPs discovery in multiple collections of reads. We note that, in accordance with our theoretical results, SNPs are clustered in th…

0301 basic medicineFOS: Computer and information sciences000 Computer science knowledge general worksBWT LCP Array SNPs Reference-free Assembly-freeLCP ArraySettore INF/01 - Informatica[SDV]Life Sciences [q-bio]Reference-freeAssembly-freeSNP03 medical and health sciences030104 developmental biologyBWTBWT; LCP Array; SNPs; Reference-free; Assembly-freeComputer ScienceComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)[INFO]Computer Science [cs]SoftwareSNPs
researchProduct

Alignment-free sequence comparison using absent words

2018

Sequence comparison is a prerequisite to virtually all comparative genomic analyses. It is often realised by sequence alignment techniques, which are computationally expensive. This has led to increased research into alignment-free techniques, which are based on measures referring to the composition of sequences in terms of their constituent patterns. These measures, such as $q$-gram distance, are usually computed in time linear with respect to the length of the sequences. In this paper, we focus on the complementary idea: how two sequences can be efficiently compared based on information that does not occur in the sequences. A word is an {\em absent word} of some sequence if it does not oc…

0301 basic medicineFOS: Computer and information sciencesFormal Languages and Automata Theory (cs.FL)Computer Science - Formal Languages and Automata TheorySequence alignmentInformation System0102 computer and information sciencesCircular wordAbsent words01 natural sciencesUpper and lower boundsSequence comparisonTheoretical Computer ScienceCombinatorics03 medical and health sciencesComputer Science - Data Structures and AlgorithmsData Structures and Algorithms (cs.DS)Absent wordCircular wordsMathematicsSequenceSettore INF/01 - InformaticaProcess (computing)q-gramComputer Science Applications1707 Computer Vision and Pattern Recognitionq-gramsComposition (combinatorics)Computer Science Applications030104 developmental biologyComputational Theory and MathematicsForbidden words010201 computation theory & mathematicsFocus (optics)Forbidden wordWord (computer architecture)Information SystemsInteger (computer science)
researchProduct

Analysis of low-correlated spatial gene expression patterns: A clustering approach in the mouse brain data hosted in the Allen Brain Atlas

2018

The Allen Brain Atlas (ABA) provides a similar gene expression dataset by genome-scale mapping of the C57BL/6J mouse brain. In this study, the authors describe a method to extract the spatial information of gene expression patterns across a set of 1047 genes. The genes were chosen from among the 4104 genes having the lowest Pearson correlation coefficient used to compare the expression patterns across voxels in a single hemisphere for available coronal and sagittal volumes. The set of genes analysed in this study is the one discarded in the article by Bohland et al. , which was considered to be of a lower consistency, not a reliable dataset. Following a normalisation task with a global and …

0301 basic medicineImage registrationGenomicsBiologycomputer.software_genre03 medical and health sciencessymbols.namesake0302 clinical medicineVoxelmedicineCluster analysisSpatial analysisSettore INF/01 - Informaticabusiness.industryBrain atlasPattern recognitionSagittal planePearson product-moment correlation coefficient030104 developmental biologymedicine.anatomical_structuresymbolsMorphometric similarity cluster analysis gene expression patternsComputer Vision and Pattern RecognitionArtificial intelligencebusinesscomputer030217 neurology & neurosurgerySoftware
researchProduct

Alignment Free Dissimilarities for Nucleosome Classification

2016

Epigenetic mechanisms such as nucleosome positioning, histone modifications and DNA methylation play an important role in the regulation of cell type-specific gene activities, yet how epigenetic patterns are established and maintained remains poorly understood. Recent studies have shown a role of DNA sequences in recruitment of epigenetic regulators. For this reason, the use of more suitable similarities or dissimilarity between DNA sequences could help in the context of epigenetic studies. In particular, alignment-free dissimilarities have already been successfully applied to identify distinct sequence features that are associated with epigenetic patterns and to predict epigenomic profiles…

0301 basic medicineNearest neighbour classifiersKnn classifierSettore INF/01 - Informatica030102 biochemistry & molecular biologybiologyComputer scienceSpeech recognitionEpigeneticContext (language use)Computational biologyL-tuples03 medical and health sciences030104 developmental biologyHistoneSimilarity (network science)DNA methylationbiology.proteinNucleosomeEpigeneticsAlignment free DNA sequence dissimilaritiesk-mersNucleosome classificationEpigenomics
researchProduct

On the structural connectivity of large-scale models of brain networks at cellular level

2021

AbstractThe brain’s structural connectivity plays a fundamental role in determining how neuron networks generate, process, and transfer information within and between brain regions. The underlying mechanisms are extremely difficult to study experimentally and, in many cases, large-scale model networks are of great help. However, the implementation of these models relies on experimental findings that are often sparse and limited. Their predicting power ultimately depends on how closely a model’s connectivity represents the real system. Here we argue that the data-driven probabilistic rules, widely used to build neuronal network models, may not be appropriate to represent the dynamics of the …

0301 basic medicineProcess (engineering)Computer scienceScienceModels NeurologicalCellular levelMachine learningcomputer.software_genreArticle03 medical and health sciencesComputational biophysics0302 clinical medicineSettore MAT/05 - Analisi MatematicamedicineBiological neural networkHumansSettore MAT/07 - Fisica MatematicaOn the structural connectivity of large-scale models of brain networks at cellular levelSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniNeuronsMultidisciplinaryNetwork modelsSettore INF/01 - Informaticabusiness.industryQRProbabilistic logicBrain030104 developmental biologymedicine.anatomical_structureMathematical framework Neuron networks Large‑scale model Data‑driven probabilistic rules Modeling cellular-level brain networksMedicineNeuronArtificial intelligencebusinessScale modelcomputer030217 neurology & neurosurgeryScientific Reports
researchProduct

Variable Ranking Feature Selection for the Identification of Nucleosome Related Sequences

2018

Several recent works have shown that K-mer sequence representation of a DNA sequence can be used for classification or identification of nucleosome positioning related sequences. This representation can be computationally expensive when k grows, making the complexity in spaces of exponential dimension. This issue effects significantly the classification task computed by a general machine learning algorithm used for the purpose of sequence classification. In this paper, we investigate the advantage offered by the so-called Variable Ranking Feature Selection method to select the most informative k − mers associated to a set of DNA sequences, for the final purpose of nucleosome/linker classifi…

0301 basic medicineSequenceSettore INF/01 - InformaticaEpigenomic030102 biochemistry & molecular biologybusiness.industryComputer scienceDeep learningPattern recognitionFeature selectionDNA sequencesNucleosomesRanking (information retrieval)Set (abstract data type)03 medical and health sciencesVariable (computer science)030104 developmental biologyDimension (vector space)Feature selectionDeep learning modelsArtificial intelligenceDeep learning models Feature selection DNA sequences Epigenomic NucleosomesRepresentation (mathematics)business
researchProduct