Search results for "Component analysis"

showing 10 items of 562 documents

Blind Source Separation Based on Joint Diagonalization in R: The Packages JADE and BSSasymp

2017

Blind source separation (BSS) is a well-known signal processing tool which is used to solve practical data analysis problems in various fields of science. In BSS, we assume that the observed data consists of linear mixtures of latent variables. The mixing system and the distributions of the latent variables are unknown. The aim is to find an estimate of an unmixing matrix which then transforms the observed data back to latent sources. In this paper we present the R packages JADE and BSSasymp. The package JADE offers several BSS methods which are based on joint diagonalization. Package BSSasymp contains functions for computing the asymptotic covariance matrices as well as their data-based es…

Statistics and ProbabilityComputer scienceJADE (programming language)02 engineering and technologyLatent variableMachine learningcomputer.software_genre01 natural sciencesBlind signal separation010104 statistics & probabilityMatrix (mathematics)nonstationary source separationMixing (mathematics)0202 electrical engineering electronic engineering information engineeringsecond order source separation0101 mathematicslcsh:Statisticslcsh:HA1-4737computer.programming_languageta113Signal processingta112matematiikkamultivariate time seriesmathematicsbusiness.industryEstimator020206 networking & telecommunicationsriippumattomien komponenttien analyysiindependent component analysis; multivariate time series; nonstationary source separation; performance indices; second order source separationIndependent component analysisperformance indicesstatisticsindependent component analysisArtificial intelligenceStatistics Probability and UncertaintybusinesscomputerAlgorithmSoftwareJournal of Statistical Software

researchProduct

Fast Estimation of the Median Covariation Matrix with Application to Online Robust Principal Components Analysis

2017

International audience; The geometric median covariation matrix is a robust multivariate indicator of dispersion which can be extended without any difficulty to functional data. We define estimators, based on recursive algorithms, that can be simply updated at each new observation and are able to deal rapidly with large samples of high dimensional data without being obliged to store all the data in memory. Asymptotic convergence properties of the recursive algorithms are studied under weak conditions. The computation of the principal components can also be performed online and this approach can be useful for online outlier detection. A simulation study clearly shows that this robust indicat…

Statistics and ProbabilityComputer scienceMathematics - Statistics TheoryStatistics Theory (math.ST)01 natural sciences010104 statistics & probabilityMatrix (mathematics)Dimension (vector space)Geometric medianStochastic gradientFOS: Mathematics0101 mathematicsL1-median010102 general mathematicsEstimator[STAT.TH]Statistics [stat]/Statistics Theory [stat.TH]Geometric medianCovariance[ STAT.TH ] Statistics [stat]/Statistics Theory [stat.TH]Functional dataMSC: 62G05 62L20Principal component analysisProjection pursuitAnomaly detectionRecursive robust estimationStatistics Probability and UncertaintyAlgorithm

researchProduct

Weighted samples, kernel density estimators and convergence

2003

This note extends the standard kernel density estimator to the case of weighted samples in several ways. In the first place I consider the obvious extension by substituting the simple sum in the definition of the estimator by a weighted sum, but I also consider other alternatives of introducing weights, based on adaptive kernel density estimators, and consider the weights as indicators of the informational content of the observations and in this sense as signals of the local density of the data. All these ideas are shown using the Penn World Table in the context of the macroeconomic convergence issue.

Statistics and ProbabilityEconomics and EconometricsMathematical optimizationKernel density estimationEstimatorMultivariate kernel density estimationKernel principal component analysisMathematics (miscellaneous)Penn World TableKernel embedding of distributionsVariable kernel density estimationKernel (statistics)Applied mathematicsSocial Sciences (miscellaneous)MathematicsEmpirical Economics

researchProduct

Symmetrised M-estimators of multivariate scatter

2007

AbstractIn this paper we introduce a family of symmetrised M-estimators of multivariate scatter. These are defined to be M-estimators only computed on pairwise differences of the observed multivariate data. Symmetrised Huber's M-estimator and Dümbgen's estimator serve as our examples. The influence functions of the symmetrised M-functionals are derived and the limiting distributions of the estimators are discussed in the multivariate elliptical case to consider the robustness and efficiency properties of estimators. The symmetrised M-estimators have the important independence property; they can therefore be used to find the independent components in the independent component analysis (ICA).

Statistics and ProbabilityElliptical distributionInfluence functionMultivariate statisticsNumerical AnalysisEstimatorEfficiencyM-estimatorM-estimatorIndependent component analysisEfficient estimatorScatter matrixScatter matrixMathematics::Category TheoryStatisticsApplied mathematicsStatistics Probability and UncertaintyRobustnessElliptical distributionIndependence (probability theory)MathematicsJournal of Multivariate Analysis

researchProduct

Robust estimation and inference for bivariate line-fitting in allometry.

2011

In allometry, bivariate techniques related to principal component analysis are often used in place of linear regression, and primary interest is in making inferences about the slope. We demonstrate that the current inferential methods are not robust to bivariate contamination, and consider four robust alternatives to the current methods -- a novel sandwich estimator approach, using robust covariance matrices derived via an influence function approach, Huber's M-estimator and the fast-and-robust bootstrap. Simulations demonstrate that Huber's M-estimators are highly efficient and robust against bivariate contamination, and when combined with the fast-and-robust bootstrap, we can make accurat…

Statistics and ProbabilityHeteroscedasticityAnalysis of VarianceCovariance matrixRobust statisticsEstimatorGeneral MedicineBivariate analysisCovarianceBiostatisticsStatistics::ComputationEfficient estimatorPrincipal component analysisStatisticsEconometricsStatistics::MethodologyBody SizeStatistics Probability and UncertaintyMathematicsProbabilityBiometrical journal. Biometrische Zeitschrift

researchProduct

2019

In the independent component model, the multivariate data are assumed to be a mixture of mutually independent latent components. The independent component analysis (ICA) then aims at estimating these latent components. In this article, we study an ICA method which combines the use of linear and quadratic autocorrelations to enable efficient estimation of various kinds of stationary time series. Statistical properties of the estimator are studied by finding its limiting distribution under general conditions, and the asymptotic variances are derived in the case of ARMA-GARCH model. We use the asymptotic results and a finite sample simulation study to compare different choices of a weight coef…

Statistics and ProbabilityHeteroscedasticityStochastic volatilityApplied Mathematics05 social sciencesAutocorrelationAsymptotic distributionEstimator01 natural sciencesIndependent component analysis010104 statistics & probabilityComponent analysis0502 economics and businessTest statisticApplied mathematics0101 mathematicsStatistics Probability and Uncertainty050205 econometrics MathematicsJournal of Time Series Analysis

researchProduct

STATIS and DISTATIS: optimum multitable principal component analysis and three way metric multidimensional scaling

2012

STATIS is an extension of principal component analysis PCA tailored to handle multiple data tables that measure sets of variables collected on the same observations, or, alternatively, as in a variant called dual-STATIS, multiple data tables where the same variables are measured on different sets of observations. STATIS proceeds in two steps: First it analyzes the between data table similarity structure and derives from this analysis an optimal set of weights that are used to compute a linear combination of the data tables called the compromise that best represents the information common to the different data tables; Second, the PCA of this compromise gives an optimal map of the observation…

Statistics and ProbabilityMathematical optimizationSimilarity (geometry)[STAT.TH]Statistics [stat]/Statistics Theory [stat.TH]Linear discriminant analysiscomputer.software_genre01 natural sciences[ STAT.TH ] Statistics [stat]/Statistics Theory [stat.TH]Correspondence analysisSet (abstract data type)010104 statistics & probability03 medical and health sciences0302 clinical medicine[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]Multiple factor analysisPrincipal component analysisMetric (mathematics)Data miningMultidimensional scaling[ MATH.MATH-ST ] Mathematics [math]/Statistics [math.ST]0101 mathematicscomputer030217 neurology & neurosurgeryComputingMilieux_MISCELLANEOUSMathematics

researchProduct

Multiple factor analysis: principal component analysis for multitable and multiblock data sets

2013

Multiple factor analysis MFA, also called multiple factorial analysis is an extension of principal component analysis PCA tailored to handle multiple data tables that measure sets of variables coll...

Statistics and ProbabilityMeasure (data warehouse)business.industryPattern recognitionMultiple dataMultiple correspondence analysisRelationship squareMultiple factor analysisPrincipal component analysisArtificial intelligenceFactorial analysisGeneralized singular value decompositionbusinessMathematicsWiley Interdisciplinary Reviews: Computational Statistics

researchProduct

Affine-invariant rank tests for multivariate independence in independent component models

2016

We consider the problem of testing for multivariate independence in independent component (IC) models. Under a symmetry assumption, we develop parametric and nonparametric (signed-rank) tests. Unlike in independent component analysis (ICA), we allow for the singular cases involving more than one Gaussian independent component. The proposed rank tests are based on componentwise signed ranks, à la Puri and Sen. Unlike the Puri and Sen tests, however, our tests (i) are affine-invariant and (ii) are, for adequately chosen scores, locally and asymptotically optimal (in the Le Cam sense) at prespecified densities. Asymptotic local powers and asymptotic relative efficiencies with respect to Wilks’…

Statistics and ProbabilityMultivariate statisticssingular information matricesRank (linear algebra)Gaussianuniform local asymptotic02 engineering and technology01 natural sciencesdistribution-free testsCombinatoricstests for multivariate independence010104 statistics & probabilitysymbols.namesakenormaalius0202 electrical engineering electronic engineering information engineeringApplied mathematics0101 mathematicsStatistique mathématiqueIndependence (probability theory)Parametric statisticsMathematicsDistribution-free testsuniform local asymptotic normalityNonparametric statistics020206 networking & telecommunicationsIndependent component analysisrank testsAsymptotically optimal algorithmsymbolsindependent component models62H1562G35Statistics Probability and UncertaintyUniform local asymptotic normality62G10

researchProduct

Gamma Kernel Intensity Estimation in Temporal Point Processes

2011

In this article, we propose a nonparametric approach for estimating the intensity function of temporal point processes based on kernel estimators. In particular, we use asymmetric kernel estimators characterized by the gamma distribution, in order to describe features of observed point patterns adequately. Some characteristics of these estimators are analyzed and discussed both through simulated results and applications to real data from different seismic catalogs.

Statistics and ProbabilityNonparametric statisticsEstimatorKernel principal component analysisPoint processVariable kernel density estimationKernel embedding of distributionsModeling and SimulationKernel (statistics)Bounded domainStatisticsGamma distributionGamma kernel estimatorIntensity functionTemporal point processes.Settore SECS-S/01 - StatisticaMathematicsCommunications in Statistics - Simulation and Computation

researchProduct