Search results for " function"

showing 10 items of 9395 documents

The Power of Word-Frequency Based Alignment-Free Functions: a Comprehensive Large-Scale Experimental Analysis

2021

Abstract Motivation Alignment-free (AF) distance/similarity functions are a key tool for sequence analysis. Experimental studies on real datasets abound and, to some extent, there are also studies regarding their control of false positive rate (Type I error). However, assessment of their power, i.e. their ability to identify true similarity, has been limited to some members of the D2 family. The corresponding experimental studies have concentrated on short sequences, a scenario no longer adequate for current applications, where sequence lengths may vary considerably. Such a State of the Art is methodologically problematic, since information regarding a key feature such as power is either mi…

Statistics and ProbabilitySequenceSimilarity (geometry)Settore INF/01 - Informaticasequence analysisComputer sciencepower statisticsAlignment-Free Genomic Analysis Big Data Software Platforms Bioinformatics AlgorithmsScale (descriptive set theory)Function (mathematics)computer.software_genreBiochemistryComputer Science ApplicationsSet (abstract data type)Computational MathematicsRange (mathematics)Computational Theory and Mathematicssequence analysis; power statistics; alignment-free functionsalignment-free functionsData miningCompleteness (statistics)Molecular BiologycomputerType I and type II errors
researchProduct

Estimating growth charts via nonparametric quantile regression: a practical framework with application in ecology.

2013

We discuss a practical and effective framework to estimate reference growth charts via regression quantiles. Inequality constraints are used to ensure both monotonicity and non-crossing of the estimated quantile curves and penalized splines are employed to model the nonlinear growth patterns with respect to age. A companion R package is presented and relevant code discussed to favour spreading and application of the proposed methods.

Statistics and ProbabilitySettore BIO/07 - EcologiaStatistics::TheoryEcology (disciplines)Nonparametric statisticsMonotonic functionRegressionStatistics::ComputationQuantile regressionNonlinear systemR packageStatisticsEconometricsStatistics::MethodologyGrowth charts Nonparametric regression quantiles Penalized splines P. oceanica modelling R softwareStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaGeneral Environmental ScienceMathematicsQuantile
researchProduct

A note on Malliavin smoothness on the Lévy space

2017

We consider Malliavin calculus based on the Itô chaos decomposition of square integrable random variables on the Lévy space. We show that when a random variable satisfies a certain measurability condition, its differentiability and fractional differentiability can be determined by weighted Lebesgue spaces. The measurability condition is satisfied for all random variables if the underlying Lévy process is a compound Poisson process on a finite time interval. peerReviewed

Statistics and ProbabilitySmoothness (probability theory)matematiikkaLévy processMalliavin calculus010102 general mathematicsMalliavin calculus01 natural sciencesLévy processinterpolation010104 statistics & probability60H07Mathematics::ProbabilitySquare-integrable functionCompound Poisson processApplied mathematicsinterpolointiDifferentiable functiontila0101 mathematicsStatistics Probability and UncertaintyLp spaceRandom variable60G51MathematicsElectronic Communications in Probability
researchProduct

kmcEx: memory-frugal and retrieval-efficient encoding of counted k-mers.

2018

Abstract Motivation K-mers along with their frequency have served as an elementary building block for error correction, repeat detection, multiple sequence alignment, genome assembly, etc., attracting intensive studies in k-mer counting. However, the output of k-mer counters itself is large; very often, it is too large to fit into main memory, leading to highly narrowed usability. Results We introduce a novel idea of encoding k-mers as well as their frequency, achieving good memory saving and retrieval efficiency. Specifically, we propose a Bloom filter-like data structure to encode counted k-mers by coupled-bit arrays—one for k-mer representation and the other for frequency encoding. Exper…

Statistics and ProbabilitySource codeComputer sciencemedia_common.quotation_subject0206 medical engineeringHash function02 engineering and technologyBiochemistry03 medical and health sciencesEncoding (memory)Molecular BiologyTime complexity030304 developmental biologyBlock (data storage)media_common0303 health sciencesSequence Analysis DNAData structureComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsError detection and correctionAlgorithmSequence Alignment020602 bioinformaticsAlgorithmsSoftwareBioinformatics (Oxford, England)
researchProduct

Volatility in Financial Markets: Stochastic Models and Empirical Results

2002

We investigate the historical volatility of the 100 most capitalized stocks traded in US equity markets. An empirical probability density function (pdf) of volatility is obtained and compared with the theoretical predictions of a lognormal model and of the Hull and White model. The lognormal model well describes the pdf in the region of low values of volatility whereas the Hull and White model better approximates the empirical pdf for large values of volatility. Both models fails in describing the empirical pdf over a moderately large volatility range.

Statistics and ProbabilityStatistical Finance (q-fin.ST)Statistical Mechanics (cond-mat.stat-mech)Stochastic modellingEconophysicFinancial marketFOS: Physical sciencesQuantitative Finance - Statistical FinanceStatistical and Nonlinear PhysicsProbability density functionStochastic processeCondensed Matter PhysicsEmpirical probabilitySettore FIS/07 - Fisica Applicata(Beni Culturali Ambientali Biol.e Medicin)FOS: Economics and businessVolatilityLognormal modelHullEconomicsEconometricsMathematical PhysicVolatility (finance)Condensed Matter - Statistical Mechanics
researchProduct

Escape Times in Fluctuating Metastable Potential and Acceleration of Diffusion in Periodic Fluctuating Potentials

2004

The problems of escape from metastable state in randomly flipping potential and of diffusion in fast fluctuating periodic potentials are considered. For the overdamped Brownian particle moving in a piecewise linear dichotomously fluctuating metastable potential we obtain the mean first-passage time (MFPT) as a function of the potential parameters, the noise intensity and the mean rate of switchings of the dichotomous noise. We find noise enhanced stability (NES) phenomenon in the system investigated and the parameter region of the fluctuating potential where the effect can be observed. For the diffusion of the overdamped Brownian particle in a fast fluctuating symmetric periodic potential w…

Statistics and ProbabilityStatistical Mechanics (cond-mat.stat-mech)FOS: Physical sciencesSawtooth waveCondensed Matter PhysicsNoise (electronics)Fluctuating Metastable PotentialPiecewise linear functionClassical mechanicsMetastabilityPiecewiseEffective diffusion coefficientStatistical physicsDiffusion (business)Brownian motionCondensed Matter - Statistical MechanicsMathematics
researchProduct

Heavy-tailed targets and (ab)normal asymptotics in diffusive motion

2010

We investigate temporal behavior of probability density functions (pdfs) of paradigmatic jump-type and continuous processes that, under confining regimes, share common heavy-tailed asymptotic (target) pdfs. Namely, we have shown that under suitable confinement conditions, the ordinary Fokker-Planck equation may generate non-Gaussian heavy-tailed pdfs (like e.g. Cauchy or more general L\'evy stable distribution) in its long time asymptotics. For diffusion-type processes, our main focus is on their transient regimes and specifically the crossover features, when initially infinite number of the pdf moments drops down to a few or none at all. The time-dependence of the variance (if in existence…

Statistics and ProbabilityStatistical Mechanics (cond-mat.stat-mech)Stochastic processMathematical analysisCrossoverProbability (math.PR)Cauchy distributionFOS: Physical sciencesProbability and statisticsProbability density functionMathematical Physics (math-ph)Condensed Matter Physicslaw.inventionlawUniversal TimePhysics - Data Analysis Statistics and ProbabilityExponentFOS: MathematicsFokker–Planck equationCondensed Matter - Statistical MechanicsMathematical PhysicsMathematics - ProbabilityData Analysis Statistics and Probability (physics.data-an)Mathematics
researchProduct

Parameter orthogonality and conditional profile likelihood: the exponential power function case

1999

Orthogonality, according to Fisher’s metrics, between the parameters of a probability density function, as well as giving rise to a series of statistical implications, makes it possible to express a function of conditional profile likelihood with better properties than the ordinary profile likelihood function. In the present paper the parameters of exponential power function are made orthogonal and the conditional profile likelihood of the shape parameter p is determined in order to study its properties with reference to p estimation. Moreover, by means of a simulation plan, a comparison is made between the estimates of p obtained from the conditional profile log-likelihood and those obtain…

Statistics and ProbabilityStatisticsApplied mathematicsProbability density functionDensity estimationConditional probability distributionLikelihood functionLikelihood principleConditional varianceShape parameterExponential functionMathematicsCommunications in Statistics - Theory and Methods
researchProduct

The Induced Smoothed lasso: A practical framework for hypothesis testing in high dimensional regression.

2020

This paper focuses on hypothesis testing in lasso regression, when one is interested in judging statistical significance for the regression coefficients in the regression equation involving a lot of covariates. To get reliable p-values, we propose a new lasso-type estimator relying on the idea of induced smoothing which allows to obtain appropriate covariance matrix and Wald statistic relatively easily. Some simulation experiments reveal that our approach exhibits good performance when contrasted with the recent inferential tools in the lasso framework. Two real data analyses are presented to illustrate the proposed framework in practice.

Statistics and ProbabilityStatistics::TheoryInduced smoothingEpidemiologyComputer scienceFeature selectionWald test01 natural sciencesasthma researchStatistics::Machine Learning010104 statistics & probability03 medical and health sciencesHealth Information ManagementLasso (statistics)Linear regressionsparse modelsStatistics::MethodologyComputer Simulation0101 mathematicssandwich formula030304 developmental biologyStatistical hypothesis testing0303 health sciencesCovariance matrixlung functionRegression analysisStatistics::Computationsparse modelResearch DesignAlgorithmSmoothingvariable selectionStatistical methods in medical research
researchProduct

Clusters of effects curves in quantile regression models

2018

In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM per…

Statistics and ProbabilityStatistics::TheoryMultivariate statistics05 social sciencesUnivariateFunctional data analysis01 natural sciencesQuantile regressionQuantile regression coefficients modeling Multivariate analysis Functional data analysis Curves clustering Variable selection010104 statistics & probabilityComputational Mathematics0502 economics and businessParametric modelCovariateStatistics::MethodologyApplied mathematics0101 mathematicsStatistics Probability and UncertaintyCluster analysisSettore SECS-S/01 - Statistica050205 econometrics MathematicsQuantile
researchProduct