Search results for "Data mining"

showing 10 items of 907 documents

No-Reference 3D Mesh Quality Assessment Based on Dihedral Angles Model and Support Vector Regression

2016

International audience; 3D meshes are subject to various visual distortions during their transmission and geometrical processing. Several works have tried to evaluate the visual quality using either full reference or reduced reference approaches. However, these approaches require the presence of the reference mesh which is not available in such practical situations. In this paper, the main contribution lies in the design of a computational method to automatically predict the perceived mesh quality without reference and without knowing beforehand the distortion type. Following the no-reference (NR) quality assessment principle, the proposed method focuses only on the distorted mesh. Specific…

Gamma distribution[ INFO.INFO-TS ] Computer Science [cs]/Signal and Image Processing[ INFO ] Computer Science [cs]Computer science02 engineering and technologycomputer.software_genre[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]Quality (physics)[INFO.INFO-TS]Computer Science [cs]/Signal and Image ProcessingVisual maskingDistortion0202 electrical engineering electronic engineering information engineeringGamma distribution[INFO]Computer Science [cs]Polygon mesh[ INFO.INFO-AI ] Computer Science [cs]/Artificial Intelligence [cs.AI]No-reference mesh quality assessmentVisual masking effect020207 software engineeringSupport vector machineSupport vector regressionQuality ScoreHuman visual system modelDihedral angles020201 artificial intelligence & image processingData miningAlgorithmcomputer

researchProduct

Identification and visualisation of differential isoform expression in RNA-seq time series

2017

AbstractAs sequencing technologies improve their capacity to detect distinct transcripts of the same gene and to address complex experimental designs such as longitudinal studies, there is a need to develop statistical methods for the analysis of isoform expression changes in time series data. Iso-maSigPro is a new functionality of the R package maSigPro for transcriptomics time series data analysis. Iso-maSigPro identifies genes with a differential isoform usage across time. The package also includes new clustering and visualization functions that allow grouping of genes with similar expression patterns at the isoform level, as well as those genes with a shift in major expressed isoform. T…

Gene isoform0303 health sciencesComputer scienceRNA-SeqComputational biologycomputer.software_genreExpression (mathematics)VisualizationBioconductorTranscriptome03 medical and health sciences0302 clinical medicineData miningGenecomputer030217 neurology & neurosurgery030304 developmental biology

researchProduct

Data Analytics in Healthcare: A Tertiary Study

2022

AbstractThe field of healthcare has seen a rapid increase in the applications of data analytics during the last decades. By utilizing different data analytic solutions, healthcare areas such as medical image analysis, disease recognition, outbreak monitoring, and clinical decision support have been automated to various degrees. Consequently, the intersection of healthcare and data analytics has received scientific attention to the point of numerous secondary studies. We analyze studies on healthcare data analytics, and provide a wide overview of the subject. This is a tertiary study, i.e., a systematic review of systematic reviews. We identified 45 systematic secondary studies on data analy…

General Computer ScienceComputer Networks and Communicationsterveydenhuoltodata-analytiikkahealthcaredata miningtekoälyartificial intelligenceComputer Graphics and Computer-Aided DesignComputer Science Applicationsmachine learningkoneoppiminendataComputational Theory and Mathematicsbig dataArtificial Intelligencetiedonlouhintadata analyticsSN Computer Science

researchProduct

Exploring Multiobjective Optimization for Multiview Clustering

2018

We present a new multiview clustering approach based on multiobjective optimization. In contrast to existing clustering algorithms based on multiobjective optimization, it is generally applicable to data represented by two or more views and does not require specifying the number of clusters a priori . The approach builds upon the search capability of a multiobjective simulated annealing based technique, AMOSA, as the underlying optimization technique. In the first version of the proposed approach, an internal cluster validity index is used to assess the quality of different partitionings obtained using different views. A new way of checking the compatibility of these different partitioning…

General Computer ScienceComputer science02 engineering and technologycomputer.software_genreMulti-objective optimizationCluster validity index020204 information systemsSimulated annealingNew mutation0202 electrical engineering electronic engineering information engineeringA priori and a posteriori020201 artificial intelligence & image processingData miningCluster analysisMultiple viewcomputerACM Transactions on Knowledge Discovery from Data

researchProduct

Content quality assessment and acceptance testing in location‐based services

2006

In this paper, we develop and evaluate an approach to assessing the content quality in a location‐based service (LBS). The proposed approach, instead of assessing the quality in absolute terms such as completeness or accuracy, measures the effect that the imperfection of the content is having on the reliability of that specific LBS. We apply the basic ideas from Software Reliability Engineering (SRE), but develop a modification of SRE, 2‐Branch, in order to separate content quality from other factors, such as positioning imprecision, and to reduce the measurement error. In our experimental study, we first compare 2‐Branch to the standard SRE, after which we experimentally analyze some prope…

General Computer ScienceComputer sciencemedia_common.quotation_subjectContext (language use)computer.software_genreSoftware qualityOracleTheoretical Computer ScienceReliability engineeringAcceptance testingLocation-based serviceQuality (business)Data miningcomputerReliability (statistics)media_commonStatistical hypothesis testingInternational Journal of Pervasive Computing and Communications

researchProduct

HyperLabelMe : A Web Platform for Benchmarking Remote-Sensing Image Classifiers

2017

HyperLabelMe is a web platform that allows the automatic benchmarking of remote-sensing image classifiers. To demonstrate this platform's attributes, we collected and harmonized a large data set of labeled multispectral and hyperspectral images with different numbers of classes, dimensionality, noise sources, and levels. The registered user can download training data pairs (spectra and land cover/use labels) and submit the predictions for unseen testing spectra. The system then evaluates the accuracy and robustness of the classifier, and it reports different scores as well as a ranked list of the best methods and users. The system is modular, scalable, and ever-growing in data sets and clas…

General Computer ScienceContextual image classificationComputer scienceMultispectral imageRegistered user020206 networking & telecommunications02 engineering and technologyBenchmarkingcomputer.software_genreData setStatistical classificationComputingMethodologies_PATTERNRECOGNITIONRobustness (computer science)ITC-ISI-JOURNAL-ARTICLE0202 electrical engineering electronic engineering information engineeringGeneral Earth and Planetary Sciences020201 artificial intelligence & image processingData miningElectrical and Electronic EngineeringInstrumentationcomputerClassifier (UML)IEEE Geoscience and Remote Sensing Magazine

researchProduct

Adapted Transfer of Distance Measures for Quantitative Structure-Activity Relationships and Data-Driven Selection of Source Datasets

2012

Quantitative structure–activity relationships are regression models relating chemical structure to biological activity. Such models allow to make predictions for toxicologically relevant endpoints, which constitute the target outcomes of experiments. The task is often tackled by instance-based methods, which are all based on the notion of chemical (dis-)similarity. Our starting point is the observation by Raymond and Willett that the two families of chemical distance measures, fingerprint-based and maximum common subgraph-based measures, provide orthogonal information about chemical similarity. This paper presents a novel method for finding suitable combinations of them, called adapted tran…

General Computer Sciencebusiness.industryComputer scienceFingerprint (computing)Chemical similaritycomputer.software_genreMachine learningDistance measuresData-drivenTask (project management)Similarity (network science)Learning curveData miningArtificial intelligencebusinessTransfer of learningcomputerThe Computer Journal

researchProduct

2014

This paper investigates the proficiency of support vector machine (SVM) using datasets generated by Tennessee Eastman process simulation for fault detection. Due to its excellent performance in generalization, the classification performance of SVM is satisfactory. SVM algorithm combined with kernel function has the nonlinear attribute and can better handle the case where samples and attributes are massive. In addition, with forehand optimizing the parameters using the cross-validation technique, SVM can produce high accuracy in fault detection. Therefore, there is no need to deal with original data or refer to other algorithms, making the classification problem simple to handle. In order to…

GeneralizationApplied MathematicsProcess (computing)computer.software_genreFault detection and isolationSupport vector machineNonlinear systemComputingMethodologies_PATTERNRECOGNITIONRanking SVMBenchmark (computing)Data miningProcess simulationcomputerAnalysisMathematicsAbstract and Applied Analysis

researchProduct

MetaCache-GPU: Ultra-Fast Metagenomic Classification

2021

The cost of DNA sequencing has dropped exponentially over the past decade, making genomic data accessible to a growing number of scientists. In bioinformatics, localization of short DNA sequences (reads) within large genomic sequences is commonly facilitated by constructing index data structures which allow for efficient querying of substrings. Recent metagenomic classification pipelines annotate reads with taxonomic labels by analyzing their $k$-mer histograms with respect to a reference genome database. CPU-based index construction is often performed in a preprocessing phase due to the relatively high cost of building irregular data structures such as hash maps. However, the rapidly growi…

Genomics (q-bio.GN)FOS: Computer and information sciencesSource codeComputer sciencemedia_common.quotation_subjectHash functionContext (language use)MinHashcomputer.software_genreData structureHash tableComputer Science - Distributed Parallel and Cluster ComputingFOS: Biological sciencesPreprocessorQuantitative Biology - GenomicsDistributed Parallel and Cluster Computing (cs.DC)Data miningcomputermedia_commonReference genome50th International Conference on Parallel Processing

researchProduct

Analysing the presence of school-shooting related communities at social media sites

2010

Surprisingly cruel mass murders and attacks have been witnessed in the educational institutions of the Western world since the 1970s. These are often referred to as 'school shootings'. There have been over 300 known incidents around the world and the number is growing. Social network sites (SNSs) have enabled the perpetrators to express their views and intentions. Our result is that since about 2005, all major school shooters have had a presence in SNS and some have left traces that would have made possible to evaluate their intentions to carry out a rampage. A further hypothesis is that future school shooters will behave in a similar manner and would thus be traceable in the digital sphere…

GeographySocial media miningSocial networkbusiness.industryOntologyWestern worldSocial mediaCriminologyMultimedia data miningRelation (history of concept)businessSocial psychologyInternational Journal of Multimedia Intelligence and Security

researchProduct