Search results for "selection"

showing 10 items of 1940 documents

Optimal band selection for future satellite sensor dedicated to soil science

2009

Hyperspectral imaging systems could be used for identifying the different soil types from the satellites. However, detecting the reflectance of the soils in all the wavelengths involves the use of a large number of sensors with high accuracy and also creates a problem in transmitting the data to earth stations for processing. The current sensors can reach a bandwidth of 20 nm and hence, the reflectance obtained using the sensors are the integration of reflectance obtained in each of the wavelength present in the spectral band. Moreover, not all spectral bands contribute equally to classification and hence, identifying the bands necessary to have a good classification is necessary to reduce …

Statistical classificationContextual image classificationComputer scienceBandwidth (signal processing)Hyperspectral imagingSatelliteFeature selectionSpectral bandsData transmissionRemote sensing2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing

researchProduct

Conflict and segregation in networks: An experiment on the interplay between individual preferences and social influence

2016

We examine the interplay between a person's individual preference and the social influence others exert. We provide a model of network relationships with conflicting preferences, where individuals are better off coordinating with those around them, but where not all have a preference for the same action. We test our model in an experiment, varying the level of conflicting preferences between individuals. Our findings suggest that preferences are more salient than social influence, under conflicting preferences: subjects relate mainly with others who have the same preferences. This leads to two undesirable outcomes: network segregation and social inefficiency. The same force that helps peopl…

Statistics and Probability0209 industrial biotechnology021103 operations researchApplied Mathematicsjel:D85jel:C72jel:D820211 other engineering and technologiesjel:C6202 engineering and technologyEconomiaHeterogeneity Social Networks Formation Equilibrium selectionPreferenceTest (assessment)020901 industrial engineering & automationAction (philosophy)SalientEquilibrium selectionModeling and SimulationEconomicsInefficiencySocial psychologySocial influenceJournal of Dynamics and Games

researchProduct

A penalized approach to covariate selection through quantile regression coefficient models

2019

The coefficients of a quantile regression model are one-to-one functions of the order of the quantile. In standard quantile regression (QR), different quantiles are estimated one at a time. Another possibility is to model the coefficient functions parametrically, an approach that is referred to as quantile regression coefficients modeling (QRCM). Compared with standard QR, the QRCM approach facilitates estimation, inference and interpretation of the results, and generates more efficient estimators. We designed a penalized method that can address the selection of covariates in this particular modelling framework. Unlike standard penalized quantile regression estimators, in which model selec…

Statistics and Probability05 social sciencesQuantile regression model01 natural sciencesQuantile regressionInspiratory capacity010104 statistics & probabilitypenalized quantile regression coefficients modelling (QRCM p )Lasso penalty0502 economics and businessCovariateStatisticsPenalized integrated loss minimization (PILM)tuning parameter selection0101 mathematicsStatistics Probability and UncertaintySelection (genetic algorithm)050205 econometrics MathematicsQuantile

researchProduct

Asymptotic optimality of myopic information-based strategies for Bayesian adaptive estimation

2016

This paper presents a general asymptotic theory of sequential Bayesian estimation giving results for the strongest, almost sure convergence. We show that under certain smoothness conditions on the probability model, the greedy information gain maximization algorithm for adaptive Bayesian estimation is asymptotically optimal in the sense that the determinant of the posterior covariance in a certain neighborhood of the true parameter value is asymptotically minimal. Using this result, we also obtain an asymptotic expression for the posterior entropy based on a novel definition of almost sure convergence on "most trials" (meaning that the convergence holds on a fraction of trials that converge…

Statistics and ProbabilityAsymptotic analysisMathematical optimizationPosterior probabilityBayesian probabilityMathematics - Statistics TheoryStatistics Theory (math.ST)050105 experimental psychologydifferential entropyDifferential entropyactive data selection03 medical and health sciences0302 clinical medicineactive learningFOS: Mathematics0501 psychology and cognitive sciencescost of observationdecision theoryMathematicsD-optimalityBayes estimatorSequential estimation05 social sciencesBayesian adaptive estimationAsymptotically optimal algorithmConvergence of random variablesasymptotic optimalitysequential estimation030217 neurology & neurosurgery

researchProduct

Breaking the curse of dimensionality in quadratic discriminant analysis models with a novel variant of a Bayes classifier enhances automated taxa ide…

2013

Macroinvertebrate samples are commonly used in biomonitoring to study changes on aquatic ecosystems. Traditionally, specimens are identified manually to taxa by human experts being time-consuming and cost intensive. Using the image data of 35 taxa and 64 features, we propose a novel variant of the quadratic discriminant analysis for breaking the curse of dimensionality in quadratic discriminant analysis models. Our variant, called a random Bayes array (RBA), uses bagging and random feature selection similar to random forest. We explore several variations of RBA. We consider three classification (i.e taxa identification) decisions: majority vote, averaged posterior probabilities, and a novel…

Statistics and ProbabilityBayes' theoremEcological ModelingBayesian probabilityStatisticsPosterior probabilityFeature selectionContext (language use)Bayes classifierQuadratic classifierMathematicsRandom forestEnvironmetrics

researchProduct

Automatic variable selection for exposure-driven propensity score matching with unmeasured confounders.

2020

Multivariable model building for propensity score modeling approaches is challenging. A common propensity score approach is exposure-driven propensity score matching, where the best model selection strategy is still unclear. In particular, the situation may require variable selection, while it is still unclear if variables included in the propensity score should be associated with the exposure and the outcome, with either the exposure or the outcome, with at least the exposure or with at least the outcome. Unmeasured confounders, complex correlation structures, and non-normal covariate distributions further complicate matters. We consider the performance of different modeling strategies in …

Statistics and ProbabilityBiometryModels StatisticalComputer scienceModel selectionFeature selectionGeneral MedicineVariance (accounting)01 natural sciencesOutcome (game theory)Correlation010104 statistics & probability03 medical and health sciencesAutomation0302 clinical medicineCovariatePropensity score matchingStatisticsMultivariate Analysis030212 general & internal medicine0101 mathematicsStatistics Probability and UncertaintyPropensity ScoreCounterexampleBiometrical journal. Biometrische ZeitschriftREFERENCES

researchProduct

Cluster-Localized Sparse Logistic Regression for SNP Data

2012

The task of analyzing high-dimensional single nucleotide polymorphism (SNP) data in a case-control design using multivariable techniques has only recently been tackled. While many available approaches investigate only main effects in a high-dimensional setting, we propose a more flexible technique, cluster-localized regression (CLR), based on localized logistic regression models, that allows different SNPs to have an effect for different groups of individuals. Separate multivariable regression models are fitted for the different groups of individuals by incorporating weights into componentwise boosting, which provides simultaneous variable selection, hence sparse fits. For model fitting, th…

Statistics and ProbabilityBoosting (machine learning)Computer scienceMultivariable calculusComputational BiologyHigh-Throughput Nucleotide SequencingFeature selectionRegression analysisModels TheoreticalLogistic regressioncomputer.software_genrePolymorphism Single NucleotideRegressionComputational MathematicsLogistic ModelsData Interpretation StatisticalGeneticsCluster AnalysisHumansData miningCluster analysisMolecular BiologyUnit-weighted regressioncomputerGenome-Wide Association StudyStatistical Applications in Genetics and Molecular Biology

researchProduct

Sample size planning for survival prediction with focus on high-dimensional data

2011

Sample size planning should reflect the primary objective of a trial. If the primary objective is prediction, the sample size determination should focus on prediction accuracy instead of power. We present formulas for the determination of training set sample size for survival prediction. Sample size is chosen to control the difference between optimal and expected prediction error. Prediction is carried out by Cox proportional hazards models. The general approach considers censoring as well as low-dimensional and high-dimensional explanatory variables. For dimension reduction in the high-dimensional setting, a variable selection step is inserted. If not all informative variables are included…

Statistics and ProbabilityClustering high-dimensional dataClinical Trials as TopicLung NeoplasmsModels StatisticalKaplan-Meier EstimateEpidemiologyProportional hazards modelDimensionality reductionGene ExpressionFeature selectionKaplan-Meier EstimateBiostatisticsPrognosisBrier scoreSample size determinationCarcinoma Non-Small-Cell LungSample SizeCensoring (clinical trials)StatisticsHumansProportional Hazards ModelsMathematicsStatistics in Medicine

researchProduct

Bayesian subset selection for additive and linear loss function

1979

Given k independent samples of common size n from k populations πj,…,πk with distribution the problem is to select a non-empty subset form {πj,…,πk}, which is associated with "good" (large) θ-values. We consider this problem from a Bayesian approach. By choosing additive and especially linear loss functions we try to fill a gap lying in between the results of Deely and Gupta (1968) and more recent papers due to Goel and Rubin (1977), Gupta and Hsu (1978) and other authors. It is shown that under acertain "normal model" Seal's procedure turns out to be Bayes w.r.t. an unrealistic loss function where as Gupta's maximunl means procedure turns out to be ( for large n) asymptotically Bayes w.r. …

Statistics and ProbabilityCombinatoricsBayes' theoremDistribution (mathematics)Selection (relational algebra)Bayesian probabilityStatisticsGoelKalman filterFunction (mathematics)RegressionMathematicsCommunications in Statistics - Theory and Methods

researchProduct

A generalized predictive criterion for model selection

2002

Given a random sample from some unknown model belonging to a finite class of parametric models, assume that the estimate of the density of a future observation is of interest San Martini & Spezzaferri (1984) proposed for this problem a predictive criterion based on the logarithmic utility function. The present authors investigate a generalization of this criterion that uses as a loss function an element of the class of α-divergences discussed by Ali & Silvey (1966) and Csiszar (1967). They also discuss briefly the case in which the class of models considered is not exhaustive. Un critere de prevision generalise pour la selection de modeles Supposons que l'on cherche a estimer la densite d'u…

Statistics and ProbabilityCombinatoricsmodel selectionModel selectionCalculusloss function; model selection; α-divergencesStatistics Probability and Uncertaintyα-divergencesMathematicsloss function

researchProduct