Search results for "Dimensionality reduction"

showing 10 items of 120 documents

Semisupervised kernel orthonormalized partial least squares

2012

This paper presents a semisupervised kernel orthonormalized partial least squares (SS-KOPLS) algorithm for non-linear feature extraction. The proposed method finds projections that minimize the least squares regression error in Hilbert spaces and incorporates the wealth of unlabeled information to deal with small size labeled datasets. The method relies on combining a standard RBF kernel using labeled information, and a generative kernel learned by clustering all available data. The positive definiteness of the kernels is proven, and the structure and information content of the derived kernels is studied. The effectiveness of the proposed method is successfully illustrated in standard UCI d…

business.industryFeature extractionNonlinear dimensionality reductionPattern recognitionComputingMethodologies_PATTERNRECOGNITIONKernel methodVariable kernel density estimationKernel (statistics)Radial basis function kernelPartial least squares regressionArtificial intelligenceCluster analysisbusinessMathematics2012 IEEE International Workshop on Machine Learning for Signal Processing
researchProduct

Applying fully tensorial ICA to fMRI data

2016

There are two aspects in functional magnetic resonance imaging (fMRI) data that make them awkward to analyse with traditional multivariate methods - high order and high dimension. The first of these refers to the tensorial nature of observations as array-valued elements instead of vectors. Although this can be circumvented by vectorizing the array, doing so simultaneously loses all the structural information in the original observations. The second aspect refers to the high dimensionality along each dimension making the concept of dimension reduction a valuable tool in the processing of fMRI data. Different methods of tensor dimension reduction are currently gaining popUlarity in literature…

computer.software_genre01 natural sciencesTask (project management)010104 statistics & probability03 medical and health sciences0302 clinical medicineDimension (vector space)medicinePreprocessorTensor0101 mathematicsMathematicsta112medicine.diagnostic_testbusiness.industryDimensionality reductionfMRIPattern recognitionIndependent component analysisdataPrincipal component analysisData miningArtificial intelligencefunctional magnetic resonance imaging databusinessFunctional magnetic resonance imagingcomputer030217 neurology & neurosurgery2016 IEEE Signal Processing in Medicine and Biology Symposium (SPMB)
researchProduct

Using affinity perturbations to detect web traffic anomalies

2013

The initial training phase of machine learning algorithms is usually computationally expensive as it involves the processing of huge matrices. Evolving datasets are challenging from this point of view because changing behavior requires updating the training. We propose a method for updating the training profile efficiently and a sliding window algorithm for online processing of the data in smaller fractions. This assumes the data is modeled by a kernel method that includes spectral decomposition. We demonstrate the algorithm with a web server request log where an actual intrusion attack is known to happen. Updating the kernel dynamically using a sliding window technique, prevents the proble…

diffuusiokarttaulottuvuuden pienennysweb trafficverkkoliikenneeigenvalue problemdiffusion mapsominaisarvo-ongelmaperturbaatioteoriaanomaly detectionpoikkeavuuden havaitseminenperturbation theorydimensionality reduction
researchProduct

Beyond Tandem Analysis: Joint Dimension Reduction and Clustering in R

2019

We present the R package clustrd which implements a class of methods that combine dimension reduction and clustering of continuous or categorical data. In particular, for continuous data, the package contains implementations of factorial K-means and reduced K-means; both methods combine principal component analysis with K-means clustering. For categorical data, the package provides MCA K-means, i-FCB and cluster correspondence analysis, which combine multiple correspondence analysis with K-means. Two examples on real data sets are provided to illustrate the usage of the main functions.

dimension reduction; clustering; principal component analysis; multiple correspondence analysis; K-meansStatistics and Probabilitydimension reduction clustering principal component analysis multiple correspon-dence analysis K-meansFactorialmultiple correspon-dence analysisMultiple correspondence analysiComputer sciencedimension reductionprincipal component analysisk-meansmultiple correspondence analysisPrincipal component analysicomputer.software_genre01 natural sciencesCorrespondence analysis010104 statistics & probabilityMultiple correspondence analysis0101 mathematicsCluster analysisCategorical variablelcsh:Statisticslcsh:HA1-4737Dimensionality reductionk-means clusteringK-meanPrincipal component analysisData miningHA29-32Statistics Probability and UncertaintycomputerSoftwareclusteringJournal of Statistical Software
researchProduct

Application of Electronic Nose for Evaluation of Wastewater Treatment Process Effects at Full-Scale WWTP

2019

This paper presents the results of studies aiming at the assessment and classification of wastewater using an electronic nose. During the experiment, an attempt was made to classify the medium based on an analysis of signals from a gas sensor array, the intensity of which depended on the levels of volatile compounds in the headspace gas mixture above the wastewater table. The research involved samples collected from the mechanical and biological treatment devices of a full-scale wastewater treatment plant (WWTP), as well as wastewater analysis. The measurements were carried out with a metal-oxide-semiconductor (MOS) gas sensor array, when coupled with a computing unit (e.g., a computer with…

electronic noseBioengineering010501 environmental scienceslcsh:Chemical technology01 natural scienceslcsh:ChemistrySensor arraywastewater treatment processesChemical Engineering (miscellaneous)lcsh:TP1-1185multidimensional data analysisProcess engineering0105 earth and related environmental sciencesMultidimensional analysisElectronic nosebusiness.industryProcess Chemistry and TechnologyDimensionality reduction010401 analytical chemistrySupervised learningodor nuisances0104 chemical sciencesgas sensor arraylcsh:QD1-999WastewaterPrincipal component analysiswastewater qualityEnvironmental scienceSewage treatmentbusinessProcesses
researchProduct

Large-scale nonlinear dimensionality reduction for network intrusion detection

2017

International audience; Network intrusion detection (NID) is a complex classification problem. In this paper, we combine classification with recent and scalable nonlinear dimensionality reduction (NLDR) methods. Classification and DR are not necessarily adversarial, provided adequate cluster magnification occurring in NLDR methods like $t$-SNE: DR mitigates the curse of dimensionality, while cluster magnification can maintain class separability. We demonstrate experimentally the effectiveness of the approach by analyzing and comparing results on the big KDD99 dataset, using both NLDR quality assessment and classification rate for SVMs and random forests. Since data involves features of mixe…

intrusion detection[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][ SPI.SIGNAL ] Engineering Sciences [physics]/Signal and Image processing[INFO.INFO-LG] Computer Science [cs]/Machine Learning [cs.LG][ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][ INFO.INFO-LG ] Computer Science [cs]/Machine Learning [cs.LG][STAT.ML] Statistics [stat]/Machine Learning [stat.ML][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]ComputingMethodologies_PATTERNRECOGNITION[STAT.ML]Statistics [stat]/Machine Learning [stat.ML][INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]Gower[SPI.SIGNAL]Engineering Sciences [physics]/Signal and Image processing[ STAT.ML ] Statistics [stat]/Machine Learning [stat.ML][SPI.SIGNAL] Engineering Sciences [physics]/Signal and Image processingdimensionality reduction
researchProduct

Knowledge discovery using diffusion maps

2013

knowledge discoveryskientometriikkaanalyysimenetelmätdata miningvalvontajärjestelmätanomaly detectionkoneoppiminentoiminnallinen magneettikuvausdatabig datamanifold learningalgoritmitdiffusion mapstiedonlouhintateollisuuskyberturvallisuusclusteringdimensionality reduction
researchProduct

Information-theoretic assessment of cardiovascular-brain networks during sleep

2015

This study was aimed at detecting the structure of the physiological network underlying the regulation of the cardiovascular and brain systems during normal sleep. To this end, we measured from the polysomnographic recordings of 10 healthy subjects the normalized spectral power of heart rate variability in the high frequency band (HF) and the EEG power in the δ, θ, α, σ, and β bands. Then, the causal statistical dependencies within and between these six time series were assessed in terms of internal information (conditional self entropy, CSE) and information transfer (transfer entropy, TE) computed via a linear method exploiting multiple regression models and a nonlinear method combining ne…

medicine.diagnostic_testComputer sciencebusiness.industrySpeech recognitionDimensionality reductionPattern recognitionElectroencephalographyEntropy estimationNonlinear systemLinear regressionComputer ScienceSettore ING-INF/06 - Bioingegneria Elettronica E InformaticamedicineHeart rate variabilityEntropy (information theory)Transfer entropyArtificial intelligencebusinessCardiology and Cardiovascular Medicine
researchProduct

A visualization technique for accessing solution pool in interactive methods of multiobjective optimization

2015

<pre>Interactive methods of <span>multiobjective</span> optimization repetitively derive <span>Pareto</span> optimal solutions based on decision maker's preference information and present the obtained solutions for his/her consideration. Some interactive methods save the obtained solutions into a solution pool and, at each iteration, allow the decision maker considering any of solutions obtained earlier. This feature contributes to the flexibility of exploring the <span>Pareto</span> optimal set and learning about the optimization problem. However, in the case of many objective functions, the accumulation of derived solutions makes accessing the sol…

multidimensional scalingMathematical optimizationOptimization problemComputer Networks and CommunicationsComputer sciencevisualisointiPareto front visualizationcomputer.software_genreMulti-objective optimizationSet (abstract data type)menetelmätMultidimensional scalingMultiobjective optimizationdimensionality reductionFlexibility (engineering)pareto-tehokkuusDimensionality reductionMultiobjective optimization ; interactive methods ; Pareto front visualization ; dimensionality reduction ; multidimensional scalinginteractive methodsNIMBUSmonitavoiteoptimointiComputer Science ApplicationsVisualizationComputational Theory and MathematicsFeature (computer vision)interaktiivisuusData miningcomputer
researchProduct

Intrusion detection applications using knowledge discovery and data mining

2014

pääsynvalvontaintrusion detectionknowledge discoverydata miningvalvontajärjestelmätanomaly detectionbig dataalgoritmitklusterianalyysitietoturvatiedonlouhintakyberturvallisuusverkkohyökkäyksetdimensionality reductionclustering
researchProduct