Search results for "DATA MINING"

showing 10 items of 907 documents

Computational cluster validation for microarray data analysis: experimental assessment of Clest, Consensus Clustering, Figure of Merit, Gap Statistic…

2008

Abstract Background Inferring cluster structure in microarray datasets is a fundamental task for the so-called -omic sciences. It is also a fundamental question in Statistics, Data Analysis and Classification, in particular with regard to the prediction of the number of clusters in a dataset, usually established via internal validation measures. Despite the wealth of internal measures available in the literature, new ones have been recently proposed, some of them specifically for microarray data. Results We consider five such measures: Clest, Consensus (Consensus Clustering), FOM (Figure of Merit), Gap (Gap Statistics) and ME (Model Explorer), in addition to the classic WCSS (Within Cluster…

clustering microarray dataMicroarrayComputer scienceStatistics as Topiccomputer.software_genrelcsh:Computer applications to medicine. Medical informaticsBiochemistryStructural BiologyDatabases GeneticConsensus clusteringStatisticsCluster (physics)AnimalsCluster AnalysisHumansCluster analysislcsh:QH301-705.5Molecular BiologyOligonucleotide Array Sequence AnalysisStructure (mathematical logic)Microarray analysis techniquesApplied MathematicsComputational BiologyComputer Science ApplicationsBenchmarkingComputingMethodologies_PATTERNRECOGNITIONlcsh:Biology (General)Gene chip analysislcsh:R858-859.7Data miningDNA microarraycomputerAlgorithmsSoftwareResearch ArticleBMC Bioinformatics
researchProduct

OntoSTEP: Enriching product model data using ontologies

2012

The representation and management of product lifecycle information is critical to any manufacturing organization. Different modeling languages are used at different lifecycle stages, for example STEP's EXPRESS may be used at a detailed design stage, while UML may be used for initial design stages. It is necessary to consolidate product information created using these different languages to build a coherent knowledge base. In this paper, we present an approach to enable the translation of STEP schema and its instances to Ontology Web Language (OWL). This gives a model-which we call OntoSTEP-that can easily be integrated with any OWL ontologies to create a semantically rich model. As an examp…

computer.internet_protocolbusiness.industryModeling languageComputer scienceProgramming languageProtégéOntology (information science)computer.software_genreComputer Graphics and Computer-Aided DesignIndustrial and Manufacturing EngineeringOWL-SComputer Science ApplicationsMetamodelingProduct lifecycleKnowledge baseUnified Modeling LanguageData miningbusinesscomputercomputer.programming_languageComputer-Aided Design
researchProduct

Applying fully tensorial ICA to fMRI data

2016

There are two aspects in functional magnetic resonance imaging (fMRI) data that make them awkward to analyse with traditional multivariate methods - high order and high dimension. The first of these refers to the tensorial nature of observations as array-valued elements instead of vectors. Although this can be circumvented by vectorizing the array, doing so simultaneously loses all the structural information in the original observations. The second aspect refers to the high dimensionality along each dimension making the concept of dimension reduction a valuable tool in the processing of fMRI data. Different methods of tensor dimension reduction are currently gaining popUlarity in literature…

computer.software_genre01 natural sciencesTask (project management)010104 statistics & probability03 medical and health sciences0302 clinical medicineDimension (vector space)medicinePreprocessorTensor0101 mathematicsMathematicsta112medicine.diagnostic_testbusiness.industryDimensionality reductionfMRIPattern recognitionIndependent component analysisdataPrincipal component analysisData miningArtificial intelligencefunctional magnetic resonance imaging databusinessFunctional magnetic resonance imagingcomputer030217 neurology & neurosurgery2016 IEEE Signal Processing in Medicine and Biology Symposium (SPMB)
researchProduct

Information Transfer in Linear Multivariate Processes Assessed through Penalized Regression Techniques: Validation and Application to Physiological N…

2020

The framework of information dynamics allows the dissection of the information processed in a network of multiple interacting dynamical systems into meaningful elements of computation that quantify the information generated in a target system, stored in it, transferred to it from one or more source systems, and modified in a synergistic or redundant way. The concepts of information transfer and modification have been recently formulated in the context of linear parametric modeling of vector stochastic processes, linking them to the notion of Granger causality and providing efficient tools for their computation based on the state&ndash

conditional transfer entropyInformation transferlinear predictionDynamical systems theoryComputer scienceState–space modelsGeneral Physics and Astronomylcsh:AstrophysicsNetwork topologycomputer.software_genrenetwork physiology01 natural sciencesArticle03 medical and health sciences0302 clinical medicinepenalized regression techniquelcsh:QB460-4660103 physical sciencesEntropy (information theory)Statistics::Methodologylcsh:Science010306 general physicspartial information decompositionmultivariate time series analysisinformation dynamics; partial information decomposition; entropy; conditional transfer entropy; network physiology; multivariate time series analysis; State–space models; vector autoregressive model; penalized regression techniques; linear predictionState–space modellcsh:QC1-999multivariate time series analysiInformation dynamicData pointpenalized regression techniquesAutoregressive modelSettore ING-INF/06 - Bioingegneria Elettronica E InformaticaParametric modelOrdinary least squaresvector autoregressive modellcsh:QData mininginformation dynamicsentropycomputerlcsh:Physics030217 neurology & neurosurgery
researchProduct

Scienze sociali computazionali e fenomeni criminali: una ricognizione

2016

L’espressione “scienze sociali computazionali” sta diventando sempre più comune nel lessico delle scienze della società. Si tratta di un campo di studi che, originando da settori della sociologia più orientati alla ricerca quantitativa, si ibrida con contributi provenienti dall’informatica e dalle cosiddette scienze della complessità. Nella prima parte del capitolo, dopo un primo paragrafo riguardante aspetti definitori ed un tentativo di classificazione delle scienze sociali computazionali, vengono presentate le tre famiglie di tecniche più importanti che caratterizzano questo approccio: il data mining, l’analisi di rete, e la simulazione al computer; con una maggiore attenzione prestata a…

criminalitàbig datanetwork analysisimulazione ad agentidata miningscienze sociali computazionali
researchProduct

Crystal structure of (E)-pent-2-enoic acid

2015

The molecule of the title compound, C5H8O2, a low-melting α,β-unsaturated carboxylic acid, is essentially planar [maximum displacement = 0.0239 (13) Å]. In the crystal, molecules are linked into centrosymmetric dimersviapairs of O—H...O hydrogen bonds.

crystal structurehydrogen bondunsaturated carb­oxy­lic acidHydrogen bondDimerGeneral ChemistryCrystal structureCondensed Matter Physicscomputer.software_genredimerData ReportsCrystallcsh:ChemistryCrystallographychemistry.chemical_compoundPlanarchemistrylcsh:QD1-999General Materials ScienceData miningMaximum displacementcomputerunsaturated carboxylic acidActa Crystallographica Section E: Crystallographic Communications
researchProduct

Web mining e Application Programming Interfaces: caratteristiche, strumenti, prospettive e limiti

2014

data mining web mining big data API computational social sciences web semanticoSettore SPS/07 - Sociologia Generale
researchProduct

Beyond Tandem Analysis: Joint Dimension Reduction and Clustering in R

2019

We present the R package clustrd which implements a class of methods that combine dimension reduction and clustering of continuous or categorical data. In particular, for continuous data, the package contains implementations of factorial K-means and reduced K-means; both methods combine principal component analysis with K-means clustering. For categorical data, the package provides MCA K-means, i-FCB and cluster correspondence analysis, which combine multiple correspondence analysis with K-means. Two examples on real data sets are provided to illustrate the usage of the main functions.

dimension reduction; clustering; principal component analysis; multiple correspondence analysis; K-meansStatistics and Probabilitydimension reduction clustering principal component analysis multiple correspon-dence analysis K-meansFactorialmultiple correspon-dence analysisMultiple correspondence analysiComputer sciencedimension reductionprincipal component analysisk-meansmultiple correspondence analysisPrincipal component analysicomputer.software_genre01 natural sciencesCorrespondence analysis010104 statistics & probabilityMultiple correspondence analysis0101 mathematicsCluster analysisCategorical variablelcsh:Statisticslcsh:HA1-4737Dimensionality reductionk-means clusteringK-meanPrincipal component analysisData miningHA29-32Statistics Probability and UncertaintycomputerSoftwareclusteringJournal of Statistical Software
researchProduct

CoproID predicts the source of coprolites and paleofeces using microbiome composition and host DNA content

2020

Shotgun metagenomics applied to archaeological feces (paleofeces) can bring new insights into the composition and functions of human and animal gut microbiota from the past. However, paleofeces often undergo physical distortions in archaeological sediments, making their source species difficult to identify on the basis of fecal morphology or microscopic features alone. Here we present a reproducible and scalable pipeline using both host and microbial DNA to infer the host source of fecal material. We apply this pipeline to newly sequenced archaeological specimens and show that we are able to distinguish morphologically similar human and canine paleofeces, as well as non-fecal sediments, fro…

dogsArcheologyMicrobial DNAData Mining and Machine LearningCoprolitemicrobiomeendogenous DNAlcsh:MedicineMorphology (biology)Genomechemistry.chemical_compoundPaleofecesDog0601 history and archaeologyGutArqueologia Metodologia0303 health sciences060102 archaeologyGeneral NeuroscienceGeneral Medicine06 humanities and the artsGenomicsNextflowmachine learningnextflowgutGeneral Agricultural and Biological SciencesShotgun metagenomicsPaleofecesHumanpaleofecesBioinformaticsBiologyMicrobiologyGeneral Biochemistry Genetics and Molecular Biologydiversity03 medical and health sciencesEndogenous DNAMachine learningcoprolitedog molecular analysishumanMicrobiomeancient DNAgenome030304 developmental biology030306 microbiologyHost (biology)lcsh:RcultureAncient DNAarcheologychemistryEvolutionary biologyAnthropologyCoproliteMicrobiomedietDNAPeerJ
researchProduct

Atlas construction and image analysis using statistical cardiac models

2010

International audience; This paper presents a brief overview of current trends in the construction of population and multi-modal heart atlases in our group and their application to atlas-based cardiac image analysis. The technical challenges around the construction of these atlases are organized around two main axes: groupwise image registration of anatomical, motion and fiber images and construction of statistical shape models. Application-wise, this paper focuses on the extraction of atlas-based biomarkers for the detection of local shape or motion abnormalities, addressing several cardiac applications where the extracted information is used to study and grade different pathologies. The p…

education.field_of_studyAtlas (topology)Computer sciencebusiness.industryPopulationImage registration02 engineering and technologycomputer.software_genreIndependent component analysisMotion (physics)030218 nuclear medicine & medical imagingImage (mathematics)03 medical and health sciences0302 clinical medicine0202 electrical engineering electronic engineering information engineeringMyocardial motion[INFO.INFO-IM]Computer Science [cs]/Medical Imaging020201 artificial intelligence & image processingComputer visionData miningArtificial intelligenceeducationbusinesscomputer
researchProduct