Search results for "Data mining"

showing 10 items of 907 documents

Power estimation for non-standardized multisite studies

2016

A concern for researchers planning multisite studies is that scanner and T1-weighted sequence-related biases on regional volumes could overshadow true effects, especially for studies with a heterogeneous set of scanners and sequences. Current approaches attempt to harmonize data by standardizing hardware, pulse sequences, and protocols, or by calibrating across sites using phantom-based corrections to ensure the same raw image intensities. We propose to avoid harmonization and phantom-based correction entirely. We hypothesized that the bias of estimated regional volumes is scaled between sites due to the contrast and gradient distortion differences between scanners and sequences. Given this…

Computer scienceCognitive Neurosciencecomputer.software_genreSensitivity and Specificity050105 experimental psychologyImaging phantomArticleSet (abstract data type)03 medical and health sciences0302 clinical medicineDistortionImage Interpretation Computer-AssistedCalibrationmedicine[INFO.INFO-IM]Computer Science [cs]/Medical ImagingHumans0501 psychology and cognitive sciencesSegmentationComputer Simulation10. No inequalityScalingModels Statisticalmedicine.diagnostic_test05 social sciencesContrast (statistics)BrainReproducibility of ResultsMagnetic resonance imagingEquipment DesignScale factorImage EnhancementMagnetic Resonance ImagingUnited StatesEquipment Failure AnalysisEuropeNeurologyOrdinary least squaresData miningFunction and Dysfunction of the Nervous SystemArtifactscomputer030217 neurology & neurosurgeryAlgorithms

researchProduct

Computation of Psycho-Acoustic Annoyance Using Deep Neural Networks

2019

Psycho-acoustic parameters have been extensively used to evaluate the discomfort or pleasure produced by the sounds in our environment. In this context, wireless acoustic sensor networks (WASNs) can be an interesting solution for monitoring subjective annoyance in certain soundscapes, since they can be used to register the evolution of such parameters in time and space. Unfortunately, the calculation of the psycho-acoustic parameters involved in common annoyance models implies a significant computational cost, and makes difficult the acquisition and transmission of these parameters at the nodes. As a result, monitoring psycho-acoustic annoyance becomes an expensive and inefficient task. Thi…

Computer scienceComputationsubjective annoyanceContext (language use)Annoyance02 engineering and technologycomputer.software_genre01 natural sciencesConvolutional neural networklcsh:TechnologyReduction (complexity)lcsh:Chemistryconvolutional neural networks0202 electrical engineering electronic engineering information engineeringWirelessGeneral Materials Sciencewireless acoustic sensor networksInstrumentationlcsh:QH301-705.5Fluid Flow and Transfer Processesbusiness.industrylcsh:TProcess Chemistry and Technology010401 analytical chemistryGeneral EngineeringRegression analysislcsh:QC1-9990104 chemical sciencesComputer Science Applicationspsycho-acoustic parametersTransmission (telecommunications)lcsh:Biology (General)lcsh:QD1-999lcsh:TA1-2040020201 artificial intelligence & image processingData miningbusinesslcsh:Engineering (General). Civil engineering (General)Zwicker modelcomputerlcsh:PhysicsApplied Sciences

researchProduct

Classification of reference models: a methodology and its application

2003

Classification is an important tool for perception and can be found in numerous scientific disciplines. Several application areas of classification are described in the context of information modeling. The usefulness of classification for reuse resp. selection of reference models is emphasized. A methodology to systematically create classification systems will be introduced. Furthermore, a classification system for reference models will be developed with the aid of the proposed methodology. This classification system gives a comprehensive, but abstract survey of 26 reference models found in the literature.

Computer scienceContext (language use)Reusecomputer.software_genreComputingMethodologies_PATTERNRECOGNITIONApplication areasInformation modelTaxonomy (general)Selection (linguistics)Data miningReference modelcomputerScientific disciplinesInformation SystemsInformation Systems and e-Business Management

researchProduct

Querying and reasoning over large scale building data sets

2016

International audience; The architectural design and construction domains work on a daily basis with massive amounts of data. Properly managing, exchanging and exploiting these data is an ever ongoing challenge in this domain. This has resulted in large semantic RDF graphs that are to be combined with a significant number of other data sets (building product catalogues, regulation data, geometric point cloud data, simulation data, sensor data), thus making an already huge dataset even larger. Making these big data available at high performance rates and speeds and into the correct (intuitive) formats is therefore an incredibly high challenge in this domain. Yet, hardly any benchmark is avai…

Computer scienceData managementBig data[ INFO.INFO-WB ] Computer Science [cs]/Web0211 other engineering and technologiesifcOWL02 engineering and technologySemantic data modelcomputer.software_genreDomain (software engineering)[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Set (abstract data type)benchmarksemantic webbig data021105 building & construction0202 electrical engineering electronic engineering information engineering[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]Semantic Web[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB]business.industry[INFO.INFO-WB]Computer Science [cs]/WebData set[ INFO.INFO-DB ] Computer Science [cs]/Databases [cs.DB]Building information modelingBenchmark (computing)reasoning020201 artificial intelligence & image processingData miningbusinesscomputer

researchProduct

Basic Sampling Techniques

2004

Computer scienceData miningcomputer.software_genrecomputer

researchProduct

Data mining and information retrieval

2007

Computer scienceData miningcomputer.software_genrecomputer

researchProduct

Executable Data Quality Models

2017

The paper discusses an external solution for data quality management in information systems. In contradiction to traditional data quality assurance methods, the proposed approach provides the usage of a domain specific language (DSL) for description data quality models. Data quality models consists of graphical diagrams, which elements contain requirements for data object's values and procedures for data object's analysis. The DSL interpreter makes the data quality model executable therefore ensuring measurement and improving of data quality. The described approach can be applied: (1) to check the completeness, accuracy and consistency of accumulated data; (2) to support data migration in c…

Computer scienceData transformation02 engineering and technologycomputer.software_genreData modeling0203 mechanical engineering0202 electrical engineering electronic engineering information engineeringInformation systemLogical data modelGeneral Environmental ScienceData elementDatabaseInformation qualityData warehouseData mapping020303 mechanical engineering & transportsData modelData qualityGeneral Earth and Planetary Sciences020201 artificial intelligence & image processingData pre-processingData architectureData miningSoftware architecturecomputerData migrationData virtualizationProcedia Computer Science

researchProduct

Editing prototypes in the finite sample size case using alternative neighborhoods

1998

The recently introduced concept of Nearest Centroid Neighborhood is applied to discard outliers and prototypes 111 class overlapping regions in order to improve the performance of the Nearest Neighbor rule through an editing procedure, This approach is related to graph based editing algorithms which also define alternative neighborhoods in terms of geornetric relations, Classical editing algorithms are compared to these alternative editing schemes using several synthetic and real data problems. The empirical results show that, the proposed editing algorithm constitutes a good trade-off among performance and computational burden.

Computer scienceDelaunay triangulationbusiness.industryCentroidMachine learningcomputer.software_genreClass (biology)k-nearest neighbors algorithmSample size determinationPattern recognition (psychology)OutlierArtificial intelligenceData miningbusinesscomputer

researchProduct

Entropy-Based Classifier Enhancement to Handle Imbalanced Class Problem

2017

The paper presents a possible enhancement of entropy-based classifiers to handle problems, caused by the class imbalance in the original dataset. The proposed method was tested on synthetic data in order to analyse its robustness in the controlled environment with different class proportions. As also the proposed method was tested on the real medical data with imbalanced classes and compared to the original classification algorithm results. The medical field was chosen for testing due to frequent situations with uneven class ratios.

Computer scienceEntropy (statistical thermodynamics)business.industryDecision treePattern recognition02 engineering and technologycomputer.software_genre01 natural sciencesSynthetic data010305 fluids & plasmasEntropy (classical thermodynamics)0103 physical sciences0202 electrical engineering electronic engineering information engineeringGeneral Earth and Planetary SciencesEntropy (information theory)020201 artificial intelligence & image processingArtificial intelligenceData miningEntropy (energy dispersal)businessEntropy (arrow of time)computerGeneral Environmental ScienceEntropy (order and disorder)Procedia Computer Science

researchProduct

MetNet: A two-level approach to reconstructing and comparing metabolic networks

2021

Metabolic pathway comparison and interaction between different species can detect important information for drug engineering and medical science. In the literature, proposals for reconstructing and comparing metabolic networks present two main problems: network reconstruction requires usually human intervention to integrate information from different sources and, in metabolic comparison, the size of the networks leads to a challenging computational problem. We propose to automatically reconstruct a metabolic network on the basis of KEGG database information. Our proposal relies on a two-level representation of the huge metabolic network: the first level is graph-based and depicts pathways a…

Computer scienceEnzyme MetabolismMetabolic networkcomputer.software_genreBiochemistryInfographics0302 clinical medicineCluster AnalysisEnzyme ChemistryData ManagementMammals0303 health sciencesMultidisciplinaryBasis (linear algebra)Settore INF/01 - InformaticaQRChemical ReactionsEukaryotaGraphChemistryVertebratesPhysical SciencesMedicineCarbohydrate MetabolismData miningMetabolic PathwaysComputational problemGraphsNetwork AnalysisMetabolic Networks and PathwaysResearch ArticleComputer and Information SciencesComputingMethodologies_SIMULATIONANDMODELINGScience03 medical and health sciencesMetabolic NetworksSimilarity (psychology)Xenobiotic MetabolismAnimalsHumansMetabolomicsKEGGRepresentation (mathematics)Symbiosis030304 developmental biologyData VisualizationOrganismsBiology and Life SciencesMetabolismMetabolic pathwayComputingMethodologies_PATTERNRECOGNITIONMetabolismAmniotesEnzymologycomputerZoology030217 neurology & neurosurgerySoftwarePLoS ONE

researchProduct