Search results for "algorithm"

showing 10 items of 4887 documents

Segmented mixed models with random changepoints: a maximum likelihood approach with application to treatment for depression study

2014

We present a simple and effective iterative procedure to estimate segmented mixed models in a likelihood based framework. Random effects and covariates are allowed for each model parameter, including the changepoint. The method is practical and avoids the computational burdens related to estimation of nonlinear mixed effects models. A conventional linear mixed model with proper covariates that account for the changepoints is the key to our estimating algorithm. We illustrate the method via simulations and using data from a randomized clinical trial focused on change in depressive symptoms over time which characteristically show two separate phases of change.

Statistics and ProbabilityMixed modelMaximum likelihoodrandom changepointRandom effects modelpsychiatric longitudinal dataGeneralized linear mixed modelNonlinear systemchangepointmixed segmented regressionStatisticsCovariateMixed effectsStatistics Probability and Uncertaintynonlinear mixed modelSettore SECS-S/01 - StatisticaAlgorithmDepressive symptomsMathematics
researchProduct

ballaxy: web services for structural bioinformatics.

2014

Abstract Motivation: Web-based workflow systems have gained considerable momentum in sequence-oriented bioinformatics. In structural bioinformatics, however, such systems are still relatively rare; while commercial stand-alone workflow applications are common in the pharmaceutical industry, academic researchers often still rely on command-line scripting to glue individual tools together. Results: In this work, we address the problem of building a web-based system for workflows in structural bioinformatics. For the underlying molecular modelling engine, we opted for the BALL framework because of its extensive and well-tested functionality in the field of structural bioinformatics. The large …

Statistics and ProbabilityModels MolecularComputer sciencecomputer.software_genreBiochemistryWorkflowStructural bioinformaticsUser-Computer InterfaceHumansMolecular Biologybusiness.industryComputational BiologySequence Analysis DNAData structureComputer Science ApplicationsVisualizationSystems IntegrationComputational MathematicsWorkflowComputational Theory and MathematicsScripting languageWeb serviceSoftware engineeringbusinesscomputerAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct

Assessment of the probabilities for evolutionary structural changes in protein folds.

2007

Abstract Motivation: The evolution of protein sequences can be described by a stepwise process, where each step involves changes of a few amino acids. In a similar manner, the evolution of protein folds can be at least partially described by an analogous process, where each step involves comparatively simple changes affecting few secondary structure elements. A number of such evolution steps, justified by biologically confirmed examples, have previously been proposed by other researchers. However, unlike the situation with sequences, as far as we know there have been no attempts to estimate the comparative probabilities for different kinds of such structural changes. Results: We have tried …

Statistics and ProbabilityModels MolecularProtein FoldingProtein domainStructural alignmentBiologyBiochemistrySet (abstract data type)Evolution MolecularProtein structureSimilarity (network science)Sequence Analysis ProteinComputer SimulationMolecular BiologyProtein secondary structureConserved SequenceSequenceModels GeneticSequence Homology Amino AcidProteinsStructural Classification of Proteins databaseComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsModels ChemicalData Interpretation Statisticalsense organsAlgorithmSequence AlignmentBioinformatics (Oxford, England)
researchProduct

On a set of data for the membrane potential in a neuron

2006

We consider a set of data where the membrane potential in a pyramidal neuron is measured almost continuously in time, under varying experimental conditions. We use nonparametric estimates for the diffusion coefficient and the drift in view to contribute to the discussion which type of diffusion process is suitable to model the membrane potential in a neuron (more exactly: in a particular type of neuron under particular experimental conditions).

Statistics and ProbabilityModels NeurologicalNeural ConductionAction PotentialsTetrodotoxinType (model theory)Statistics NonparametricGeneral Biochemistry Genetics and Molecular BiologyMembrane PotentialsSet (abstract data type)MiceStatisticsAnimalsDiffusion (business)MathematicsCerebral CortexNeuronsMembrane potentialStochastic ProcessesQuantitative Biology::Neurons and CognitionGeneral Immunology and MicrobiologyStochastic processPyramidal CellsApplied MathematicsNonparametric statisticsGeneral MedicineElectrophysiologyElectrophysiologynervous systemDiffusion processModeling and SimulationPotassiumGeneral Agricultural and Biological SciencesBiological systemAlgorithmsMathematical Biosciences
researchProduct

Tests for Differentiation in Gene Expression Using a Data-Driven Order or Weights for Hypotheses

2005

In the analysis of gene expression by microarrays there are usually few subjects, but high-dimensional data. By means of techniques, such as the theory of spherical tests or with suitable permutation tests, it is possible to sort the endpoints or to give weights to them according to specific criteria determined by the data while controlling the multiple type I error rate. The procedures developed so far are based on a sequential analysis of weighted p-values (corresponding to the endpoints), including the most extreme situation of weighting leading to a complete order of p-values. When the data for the endpoints have approximately equal variances, these procedures show good power properties…

Statistics and ProbabilityModels StatisticalModels GeneticBiometricsGene Expression ProfilingWord error rateFamilywise error rateGeneral MedicineData-drivenWeightingData Interpretation StatisticalsortComputer Simulationp-valueStatistics Probability and UncertaintyAlgorithmAlgorithmsOligonucleotide Array Sequence AnalysisMathematicsType I and type II errorsBiometrical Journal
researchProduct

Multicanonical Monte Carlo simulations

1998

Canonical Monte Carlo simulations of disordered systems like spin glasses and systems undergoing first-order phase transitions are severely hampered by rare event states which lead to exponentially diverging autocorrelation times with increasing system size and hence to exponentially large statistical errors. One possibility to overcome this problem is the multicanonical reweighting method. Using standard local update algorithms it could be demonstrated that the dependence of autocorrelation times on the system size V is well described by a less divergent power law, τ∝Vα, with 1<α<3, depending on the system. After a brief review of the basic ideas, combinations of multicanonical reweighting…

Statistics and ProbabilityMultigrid methodMonte Carlo methodAutocorrelationExponentWang and Landau algorithmStatistical physicsCondensed Matter PhysicsRandom walkPower lawOrder of magnitudeMathematicsPhysica A: Statistical Mechanics and its Applications
researchProduct

CARE: context-aware sequencing read error correction.

2020

Abstract Motivation Error correction is a fundamental pre-processing step in many Next-Generation Sequencing (NGS) pipelines, in particular for de novo genome assembly. However, existing error correction methods either suffer from high false-positive rates since they break reads into independent k-mers or do not scale efficiently to large amounts of sequencing reads and complex genomes. Results We present CARE—an alignment-based scalable error correction algorithm for Illumina data using the concept of minhashing. Minhashing allows for efficient similarity search within large sequencing read collections which enables fast computation of high-quality multiple alignments. Sequencing errors ar…

Statistics and ProbabilityMultiple sequence alignmentComputer scienceSequence assemblyHigh-Throughput Nucleotide SequencingContext (language use)Sequence Analysis DNAcomputer.software_genreBiochemistryGenomeComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsHumansHuman genomeData miningError detection and correctionMolecular BiologycomputerSequence AlignmentAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct

On stability issues in deriving multivariable regression models

2014

In many areas of science where empirical data are analyzed, a task is often to identify important variables with influence on an outcome. Most often this is done by using a variable selection strategy in the context of a multivariable regression model. Using a study on ozone effects in children (n = 496, 24 covariates), we will discuss aspects relevant for deriving a suitable model. With an emphasis on model stability, we will explore and illustrate differences between predictive models and explanatory models, the key role of stopping criteria, and the value of bootstrap resampling (with and without replacement). Bootstrap resampling will be used to assess variable selection stability, to d…

Statistics and ProbabilityMultivariable calculusStability (learning theory)Context (language use)Regression analysisFeature selectionGeneral MedicineVariance (accounting)StatisticsCovariateEconometricsStatistics Probability and UncertaintySelection (genetic algorithm)MathematicsBiometrical Journal
researchProduct

Affine-invariant rank tests for multivariate independence in independent component models

2016

We consider the problem of testing for multivariate independence in independent component (IC) models. Under a symmetry assumption, we develop parametric and nonparametric (signed-rank) tests. Unlike in independent component analysis (ICA), we allow for the singular cases involving more than one Gaussian independent component. The proposed rank tests are based on componentwise signed ranks, à la Puri and Sen. Unlike the Puri and Sen tests, however, our tests (i) are affine-invariant and (ii) are, for adequately chosen scores, locally and asymptotically optimal (in the Le Cam sense) at prespecified densities. Asymptotic local powers and asymptotic relative efficiencies with respect to Wilks’…

Statistics and ProbabilityMultivariate statisticssingular information matricesRank (linear algebra)Gaussianuniform local asymptotic02 engineering and technology01 natural sciencesdistribution-free testsCombinatoricstests for multivariate independence010104 statistics & probabilitysymbols.namesakenormaalius0202 electrical engineering electronic engineering information engineeringApplied mathematics0101 mathematicsStatistique mathématiqueIndependence (probability theory)Parametric statisticsMathematicsDistribution-free testsuniform local asymptotic normalityNonparametric statistics020206 networking & telecommunicationsIndependent component analysisrank testsAsymptotically optimal algorithmsymbolsindependent component models62H1562G35Statistics Probability and UncertaintyUniform local asymptotic normality62G10
researchProduct

A parallel and sensitive software tool for methylation analysis on multicore platforms.

2015

Abstract Motivation: DNA methylation analysis suffers from very long processing time, as the advent of Next-Generation Sequencers has shifted the bottleneck of genomic studies from the sequencers that obtain the DNA samples to the software that performs the analysis of these samples. The existing software for methylation analysis does not seem to scale efficiently neither with the size of the dataset nor with the length of the reads to be analyzed. As it is expected that the sequencers will provide longer and longer reads in the near future, efficient and scalable methylation software should be developed. Results: We present a new software tool, called HPG-Methyl, which efficiently maps bis…

Statistics and ProbabilityMutation rateTime FactorsComputer scienceReal-time computingBisulfite sequencingMolecular Sequence DataGenomicsParallel computingcomputer.software_genremedicine.disease_causeBiochemistryGenomeBottleneckchemistry.chemical_compoundSoftwareMutation RateDatabases GeneticmedicineHumansSulfitesMolecular BiologyMutationMulti-core processorGenomeBase Sequencebusiness.industryHigh-Throughput Nucleotide SequencingMethylationGenomicsDNA MethylationOriginal PapersComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicschemistryDNA methylationScalabilityMutationCompilerbusinesscomputerSequence AnalysisDNAAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct