Search results for "Mining"

showing 10 items of 1730 documents

A Windowing strategy for Distributed Data Mining optimized through GPUs

2017

Abstract This paper introduces an optimized Windowing based strategy for inducing decision trees in Distributed Data Mining scenarios. Windowing consists in selecting a sample of the available training examples (the window) to induce a decision tree with an usual algorithm, e.g., J48; finding instances not covered by this tree (counter examples) in the remaining training examples, adding them to the window to induce a new tree; and repeating until a termination criterion is met. In this way, the number of training examples required to induce the tree is reduced considerably, while maintaining the expected accuracy levels; which is paid in terms of time performance. Our proposed enhancements…

Computer sciencebusiness.industryMulti-agent systemDecision treeProcess (computing)Window (computing)02 engineering and technologyMachine learningcomputer.software_genreRandom forestTree (data structure)C4.5 algorithmArtificial Intelligence020204 information systemsSignal Processing0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingComputer Vision and Pattern RecognitionArtificial intelligenceData miningbusinesscomputerSoftwarePattern Recognition Letters
researchProduct

Sectors on sectors (SonS): A new hierarchical clustering visualization tool

2011

Clustering techniques have been widely applied to extract information from high-dimensional data structures in the last few years. Graphs are especially relevant for clustering, but many graphs associated with hierarchical clustering do not give any information about the values of the centroids' attributes and the relationships among them. In this paper, we propose a new visualization approach for hierarchical cluster analysis in which the above-mentioned information is available. The method is based on pie charts. The pie charts are divided into several pie segments or sectors corresponding to each cluster. The radius of each pie segment is proportional to the number of patterns included i…

Computer sciencebusiness.industryPie chartcomputer.software_genreSynthetic datalaw.inventionHierarchical clusteringVisualizationSet (abstract data type)Information extractionData visualizationlawData miningbusinessCluster analysiscomputer2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)
researchProduct

JABB: Moving Towards The Future.

2012

Computer sciencebusiness.industryPrimary stabilityBiomedical EngineeringBiophysicsBioengineeringBiocompatible MaterialsGeneral MedicineData scienceBiomechanical PhenomenaBiomaterialsText miningTotal knee arthroplastyCruciate retainingOriginal ArticlePeriodicals as TopicbusinessTransversal support tibial plateauForecastingJournal of applied biomaterialsfunctional materials
researchProduct

sar: Automatic generation of statistical reports using Stata and Microsoft Word for Windows

2013

The output provided by most Stata commands is plain text not suitable to be presented or published. After the numerical and graphical outputs are obtained, the user has to copy them into a word processor to complete the editing process. Some Stata commands help you to obtain well-formatted output, especially tabulated results in LATEX or other formats, but they are not a complete solution nor are they friendly tools. Stata automatic report (Sar) is an easy-to-use macro for Microsoft Word for Windows that allows a powerful integration between Stata and Word. With Sar, the user can retrieve numerical results and graphs from Stata and automatically insert them into a well-formatted Word docum…

Computer sciencebusiness.industryProgramming languagePlain textWord processingProcess (computing)computer.file_formatcomputer.software_genreAutomationMathematics (miscellaneous)WorkflowSar Stata Automation object report automation Microsoft Word reproducible research Automation OLEData miningMacrobusinesscomputerWord (computer architecture)
researchProduct

Clustering categorical data: A stability analysis framework

2011

Clustering to identify inherent structure is an important first step in data exploration. The k-means algorithm is a popular choice, but K-means is not generally appropriate for categorical data. A specific extension of k-means for categorical data is the k-modes algorithm. Both of these partition clustering methods are sensitive to the initialization of prototypes, which creates the difficulty of selecting the best solution for a given problem. In addition, selecting the number of clusters can be an issue. Further, the k-modes method is especially prone to instability when presented with ‘noisy’ data, since the calculation of the mode lacks the smoothing effect inherent in the calculation …

Computer sciencebusiness.industrySingle-linkage clusteringCorrelation clusteringConstrained clusteringcomputer.software_genreMachine learningDetermining the number of clusters in a data setData stream clusteringCURE data clustering algorithmConsensus clusteringData miningArtificial intelligenceCluster analysisbusinesscomputer2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM)
researchProduct

Adding Synthetic Detail to Natural Terrain Using a Wavelet Approach

2002

Terrain representation is a basic topic in the field of interactive graphics. The amount of data required for good quality terrain representation offers an important challenge to developers of such systems. For users of these applications the accuracy of geographical data is less important than their natural visual appearance. This makes it possible to mantain a limited geographical data base for the system and to extend it generating synthetic data.In this paper we combine fractal and wavelet theories to provide extra data which keeps the natural essence of actual information available. The new levels of detail(LOD) for the terrain are obtained applying an inverse Wavelet Transform (WT) to…

Computer sciencebusiness.industryWavelet transformImage processingTerraincomputer.software_genreFractal dimensionField (computer science)Set (abstract data type)Computer graphicsWaveletFractalComputer visionData miningArtificial intelligenceRepresentation (mathematics)businesscomputer
researchProduct

<title>Dynamic integration of multiple data mining techniques in a knowledge discovery management system</title>

1999

One of the most important directions in improvement of data mining and knowledge discovery, is the integration of multiple classification techniques of an ensemble of classifiers. An integration technique should be able to estimate and select the most appropriate component classifiers from the ensemble. We present two variations of an advanced dynamic integration technique with two distance metrics. The technique is one variation of the stacked generalization method, with an assumption that each of the component classifiers is the best one, inside a certain sub area of the entire domain area. Our technique includes two phases: the learning phase and the application phase. During the learnin…

Computer sciencebusiness.industryWeighted votingcomputer.software_genreMachine learningExpert systemMultiple dataMatrix (mathematics)Information extractionComputingMethodologies_PATTERNRECOGNITIONKnowledge extractionManagement systemData miningArtificial intelligencebusinesscomputerClassifier (UML)Data Mining and Knowledge Discovery: Theory, Tools, and Technology
researchProduct

Multivariate statistical technique over QoS variables to analyze video quality metrics on IEEE 802.11ac networks

2017

[EN] We present the results from a measurementbasedperformance evaluation of wireless networks basedon IEEE 802.11ac standard in an indoor environment, withthe aim to analyze their performance under high definitionstreaming video applications. We focus our study on analyzingthe highest performance of these standards using off-theshelfequipment as well as the behavior of Quality of Servicevariables and how they affect to the video quality. Thus, wehave analyzed and measured these variables and have applieda multivariate statistical technique, called Factor Analysis,and finally discuss their behavior.

Computer sciencebusiness.industryWireless networkQuality of servicemedia_common.quotation_subjectVideo quality metricFactor analysiVideo qualitycomputer.software_genreQuality of experienceMultivariate statistical techniqueQuality of serviceIEEE 802.11acQuality (business)TelematicsQuality of experienceTelematicsData miningbusinessFocus (optics)computerComputer networkmedia_commonProceedings XIII Jornadas de Ingenieria Telematica - JITEL2017
researchProduct

Photonic non-contact estimation of blood lactate level

2015

The ability to measure the blood lactate level in a non-invasive, non-contact manner is very appealing to the sports industry as well as the home care field. That is mainly because this substance level is an imperative parameter in the course of devolving a personal workout programs. Moreover, the blood lactate level is also a pivotal means in estimation of muscles' performance capability. In this manuscript we propose an optical non-contact approach to estimate the concentration level of this parameter. Firstly, we introduce the connection between the physiological muscle tremor and the lactate blood levels. Secondly, we suggest a photonic optical method to estimate the physiological tremo…

Computer sciencebusiness.industrycomputer.software_genreAtomic and Molecular Physics and OpticsArticlePhysiological tremorElectromagnetic opticsProof of conceptControl theoryBlood lactateData miningPhotonicsbusinesscomputerLaser beamsBiotechnology
researchProduct

Mass Spectrometry in Food Quality and Safety

2015

Abstract In recent years, mass spectrometry has gained a wide recognition as a selective and fast technique for the analysis and assessment of a wide range of food products. The state of the art in the determination of safety and quality of food is presented to illustrate the capability of this technique for classification and grading, defect and disease detection, distribution and visualization of chemical attributes, and evaluations of overall quality of meat, fish, fruits, vegetables, and other food products. The features of mass spectrometry for each category were summarized in the aspects of the investigated quality and safety attributes, the used systems (triple quadrupole, quadrupole…

Computer sciencebusiness.industrymedia_common.quotation_subjectMass spectrometrycomputer.software_genreFood safetyOrbitrapTriple quadrupole mass spectrometerlaw.inventionChemometricslawData analysisQuality (business)Data miningFood qualitybusinesscomputermedia_common
researchProduct