Search results for "DATA MINING"
showing 10 items of 907 documents
Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistic…
2008
Abstract Background Inferring cluster structure in microarray datasets is a fundamental task for the so-called -omic sciences. It is also a fundamental question in Statistics, Data Analysis and Classification, in particular with regard to the prediction of the number of clusters in a dataset, usually established via internal validation measures. Despite the wealth of internal measures available in the literature, new ones have been recently proposed, some of them specifically for microarray data. Results We consider five such measures: Clest, Consensus (Consensus Clustering), FOM (Figure of Merit), Gap (Gap Statistics) and ME (Model Explorer), in addition to the classic WCSS (Within Cluster…
OntoSTEP: Enriching product model data using ontologies
2012
The representation and management of product lifecycle information is critical to any manufacturing organization. Different modeling languages are used at different lifecycle stages, for example STEP's EXPRESS may be used at a detailed design stage, while UML may be used for initial design stages. It is necessary to consolidate product information created using these different languages to build a coherent knowledge base. In this paper, we present an approach to enable the translation of STEP schema and its instances to Ontology Web Language (OWL). This gives a model-which we call OntoSTEP-that can easily be integrated with any OWL ontologies to create a semantically rich model. As an examp…
Applying fully tensorial ICA to fMRI data
2016
There are two aspects in functional magnetic resonance imaging (fMRI) data that make them awkward to analyse with traditional multivariate methods - high order and high dimension. The first of these refers to the tensorial nature of observations as array-valued elements instead of vectors. Although this can be circumvented by vectorizing the array, doing so simultaneously loses all the structural information in the original observations. The second aspect refers to the high dimensionality along each dimension making the concept of dimension reduction a valuable tool in the processing of fMRI data. Different methods of tensor dimension reduction are currently gaining popUlarity in literature…
Information Transfer in Linear Multivariate Processes Assessed through Penalized Regression Techniques: Validation and Application to Physiological N…
2020
The framework of information dynamics allows the dissection of the information processed in a network of multiple interacting dynamical systems into meaningful elements of computation that quantify the information generated in a target system, stored in it, transferred to it from one or more source systems, and modified in a synergistic or redundant way. The concepts of information transfer and modification have been recently formulated in the context of linear parametric modeling of vector stochastic processes, linking them to the notion of Granger causality and providing efficient tools for their computation based on the state&ndash
Scienze sociali computazionali e fenomeni criminali: una ricognizione
2016
L’espressione “scienze sociali computazionali” sta diventando sempre più comune nel lessico delle scienze della società. Si tratta di un campo di studi che, originando da settori della sociologia più orientati alla ricerca quantitativa, si ibrida con contributi provenienti dall’informatica e dalle cosiddette scienze della complessità. Nella prima parte del capitolo, dopo un primo paragrafo riguardante aspetti definitori ed un tentativo di classificazione delle scienze sociali computazionali, vengono presentate le tre famiglie di tecniche più importanti che caratterizzano questo approccio: il data mining, l’analisi di rete, e la simulazione al computer; con una maggiore attenzione prestata a…
Crystal structure of (E)-pent-2-enoic acid
2015
The molecule of the title compound, C5H8O2, a low-melting α,β-unsaturated carboxylic acid, is essentially planar [maximum displacement = 0.0239 (13) Å]. In the crystal, molecules are linked into centrosymmetric dimersviapairs of O—H...O hydrogen bonds.
Web mining e Application Programming Interfaces: caratteristiche, strumenti, prospettive e limiti
2014
Beyond Tandem Analysis: Joint Dimension Reduction and Clustering in R
2019
We present the R package clustrd which implements a class of methods that combine dimension reduction and clustering of continuous or categorical data. In particular, for continuous data, the package contains implementations of factorial K-means and reduced K-means; both methods combine principal component analysis with K-means clustering. For categorical data, the package provides MCA K-means, i-FCB and cluster correspondence analysis, which combine multiple correspondence analysis with K-means. Two examples on real data sets are provided to illustrate the usage of the main functions.
CoproID predicts the source of coprolites and paleofeces using microbiome composition and host DNA content
2020
Shotgun metagenomics applied to archaeological feces (paleofeces) can bring new insights into the composition and functions of human and animal gut microbiota from the past. However, paleofeces often undergo physical distortions in archaeological sediments, making their source species difficult to identify on the basis of fecal morphology or microscopic features alone. Here we present a reproducible and scalable pipeline using both host and microbial DNA to infer the host source of fecal material. We apply this pipeline to newly sequenced archaeological specimens and show that we are able to distinguish morphologically similar human and canine paleofeces, as well as non-fecal sediments, fro…
Atlas construction and image analysis using statistical cardiac models
2010
International audience; This paper presents a brief overview of current trends in the construction of population and multi-modal heart atlases in our group and their application to atlas-based cardiac image analysis. The technical challenges around the construction of these atlases are organized around two main axes: groupwise image registration of anatomical, motion and fiber images and construction of statistical shape models. Application-wise, this paper focuses on the extraction of atlas-based biomarkers for the detection of local shape or motion abnormalities, addressing several cardiac applications where the extracted information is used to study and grade different pathologies. The p…