Search results for "Resampling"
showing 10 items of 40 documents
Multiple testing in candidate gene situations: a comparison of classical, discrete, and resampling-based procedures.
2011
In candidate gene association studies, usually several elementary hypotheses are tested simultaneously using one particular set of data. The data normally consist of partly correlated SNP information. Every SNP can be tested for association with the disease, e.g., using the Cochran-Armitage test for trend. To account for the multiplicity of the test situation, different types of multiple testing procedures have been proposed. The question arises whether procedures taking into account the discreteness of the situation show a benefit especially in case of correlated data. We empirically evaluate several different multiple testing procedures via simulation studies using simulated correlated SN…
Graphical representation of some duality relations in stochastic population models
2007
We derive a unified stochastic picture for the duality of a resampling-selection model with a branching-coalescing particle process (cf. http://www.ams.org/mathscinet-getitem?mr=MR2123250) and for the self-duality of Feller's branching diffusion with logistic growth (cf. math/0509612). The two dual processes are approximated by particle processes which are forward and backward processes in a graphical representation. We identify duality relations between the basic building blocks of the particle processes which lead to the two dualities mentioned above.
Tailoring sparse multivariable regression techniques for prognostic single-nucleotide polymorphism signatures.
2011
When seeking prognostic information for patients, modern technologies provide a huge amount of genomic measurements as a starting point. For single-nucleotide polymorphisms (SNPs), there may be more than one million covariates that need to be simultaneously considered with respect to a clinical endpoint. Although the underlying biological problem cannot be solved on the basis of clinical cohorts of only modest size, some important SNPs might still be identified. Sparse multivariable regression techniques have recently become available for automatically identifying prognostic molecular signatures that comprise relatively few covariates and provide reasonable prediction performance. For illus…
A weighted combined effect measure for the analysis of a composite time-to-first-event endpoint with components of different clinical relevance
2018
Composite endpoints combine several events within a single variable, which increases the number of expected events and is thereby meant to increase the power. However, the interpretation of results can be difficult as the observed effect for the composite does not necessarily reflect the effects for the components, which may be of different magnitude or even point in adverse directions. Moreover, in clinical applications, the event types are often of different clinical relevance, which also complicates the interpretation of the composite effect. The common effect measure for composite endpoints is the all-cause hazard ratio, which gives equal weight to all events irrespective of their type …
Robustness of PET Radiomics Features: Impact of Co-Registration with MRI
2021
Radiomics holds great promise in the field of cancer management. However, the clinical application of radiomics has been hampered by uncertainty about the robustness of the features extracted from the images. Previous studies have reported that radiomics features are sensitive to changes in voxel size resampling and interpolation, image perturbation, or slice thickness. This study aims to observe the variability of positron emission tomography (PET) radiomics features under the impact of co-registration with magnetic resonance imaging (MRI) using the difference percentage coefficient, and the Spearman’s correlation coefficient for three groups of images: (i) original PET, (ii) PET after co-…
Validation procedures in radiological diagnostic models. Neural network and logistic regression
1999
The objective of this paper is to compare the performance of two predictive radiological models, logistic regression (LR) and neural network (NN), with five different resampling methods. One hundred and sixty-seven patients with proven calvarial lesions as the only known disease were enrolled. Clinical and CT data were used for LR and NN models. Both models were developed with cross validation, leave-one-out and three different bootstrap algorithms. The final results of each model were compared with error rate and the area under receiver operating characteristic curves (Az). The neural network obtained statistically higher Az than LR with cross validation. The remaining resampling validatio…
Evaluation of the effect of chance correlations on variable selection using Partial Least Squares -Discriminant Analysis
2013
Variable subset selection is often mandatory in high throughput metabolomics and proteomics. However, depending on the variable to sample ratio there is a significant susceptibility of variable selection towards chance correlations. The evaluation of the predictive capabilities of PLSDA models estimated by cross-validation after feature selection provides overly optimistic results if the selection is performed on the entire set and no external validation set is available. In this work, a simulation of the statistical null hypothesis is proposed to test whether the discrimination capability of a PLSDA model after variable selection estimated by cross-validation is statistically higher than t…
Extrapolation of the Species Accumulation Curve for Incomplete Species Samplings: A New Nonparametric Approach to Estimate the Degree of Sample Compl…
2015
9 pages; International audience; Incomplete species samplings are deemed to remain the common practice in those groups of animals, such as small or micro- invertebrates, with numerous species that often are more or less difficult to detect in the field. Thus, extrapolating the Species Accumulation Curve as far as possible beyond the actual sample size may thus serve as a useful (although imperfect) surrogate to the desired, but practically inaccessible, complete samplings. In this context, several kinds of theoretical or empirical models for the Species Accumulation Curve and also a lot of estimators of the asymptotic limit of the Curve (i.e. total species richness) have been proposed. The …
Difficulty of getting accurate and precise estimates of population size: The case of the Siberian flying squirrel in Finland
2008
Accurate estimates of population size and distribution are a prerequisite for effective management of populations, but for most species a reliable estimation of the absolute population size is very difficult. In 1998, the Finnish Ministry of Environment set up a working group to plan a national-level survey for estimation and monitoring of the population size of Siberian flying squirrel (Pteromys volans). In 2006, the population size was reported to be 143 000 females. However, evaluation of the magnitude of possible biases was not attempted. Our aim was to test the population size estimate by conducting a resampling study with the methods of the national survey on an eartagged population o…
Alder pollen in Finland ripens after a short exposure to warm days in early spring, showing biennial variation in the onset of pollen ripening
2017
We developed a temperature sum model to predict the daily pollen release of alder, based on pollen data collected with pollen traps at seven locations in Finland over the years 2000–2014. We estimated the model parameters by minimizing the sum of squared errors (SSE) of the model, with weights that put more weight on binary recognition of daily presence or absence of pollen. The model results suggest that alder pollen ripens after a couple of warm days in February, while the whole pollen release period typically takes up to 4 weeks. We tested the model residuals against air humidity, precipitation and wind speed, but adding these meteorological features did not improve the model prediction …