Search results for "Theoretical Computer Science"

showing 10 items of 1151 documents

On-line Construction of Two-Dimensional Suffix Trees

1999

AbstractWe say that a data structure is builton-lineif, at any instant, we have the data structure corresponding to the input we have seen up to that instant. For instance, consider the suffix tree of a stringx[1,n]. An algorithm building iton-lineis such that, when we have read the firstisymbols ofx[1,n], we have the suffix tree forx[1,i]. We present a new technique, which we refer to asimplicit updates, based on which we obtain: (a) an algorithm for theon-lineconstruction of the Lsuffix tree of ann×nmatrixA—this data structure is the two-dimensional analog of the suffix tree of a string; (b) simple algorithms implementing primitive operations forLZ1-typeon-line losslessimage compression m…

Statistics and ProbabilityCompressed suffix arrayNumerical AnalysisControl and OptimizationAlgebra and Number TheoryTheoretical computer scienceApplied MathematicsGeneral MathematicsSuffix treeString (computer science)Generalized suffix treelaw.inventionLongest common substring problemTree (data structure)lawSuffixAlgorithmFM-indexMathematicsJournal of Complexity

researchProduct

Fast and universal estimation of latent variable models using extended variational approximations

2022

AbstractGeneralized linear latent variable models (GLLVMs) are a class of methods for analyzing multi-response data which has gained considerable popularity in recent years, e.g., in the analysis of multivariate abundance data in ecology. One of the main features of GLLVMs is their capacity to handle a variety of responses types, such as (overdispersed) counts, binomial and (semi-)continuous responses, and proportions data. On the other hand, the inclusion of unobserved latent variables poses a major computational challenge, as the resulting marginal likelihood function involves an intractable integral for non-normally distributed responses. This has spurred research into a number of approx…

Statistics and ProbabilityComputational Theory and Mathematicsmultivariate abundance datamuuttujatlaplace approximationmulti-response dataordinationStatistics Probability and Uncertaintyvariational approximationsgeneralized linear latent variable modelsestimointiTheoretical Computer ScienceStatistics and Computing

researchProduct

Stochastic Learning for SAT- Encoded Graph Coloring Problems

2010

The graph coloring problem (GCP) is a widely studied combinatorial optimization problem due to its numerous applications in many areas, including time tabling, frequency assignment, and register allocation. The need for more efficient algorithms has led to the development of several GC solvers. In this paper, the authors introduce a team of Finite Learning Automata, combined with the random walk algorithm, using Boolean satisfiability encoding for the GCP. The authors present an experimental analysis of the new algorithm’s performance compared to the random walk technique, using a benchmark set containing SAT-encoding graph coloring test sets.

Statistics and ProbabilityDiscrete mathematicsControl and OptimizationTheoretical computer scienceComparability graphComputer Science ApplicationsGreedy coloringComputational MathematicsEdge coloringComputational Theory and MathematicsModeling and SimulationGraph (abstract data type)Decision Sciences (miscellaneous)Graph coloringFractional coloringGraph factorizationList coloringMathematicsInternational Journal of Applied Metaheuristic Computing

researchProduct

Reducing the effect of the data order in algorithms for constructing phylogenetic trees.

1988

Statistics and ProbabilityElectronic Data ProcessingTheoretical computer sciencePhylogenetic treeComputer scienceBiochemistryComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsMolecular BiologyAlgorithmAlgorithmsPhylogenySoftwareComputer applications in the biosciences : CABIOS

researchProduct

Study Design in Causal Models

2014

The causal assumptions, the study design and the data are the elements required for scientific inference in empirical research. The research is adequately communicated only if all of these elements and their relations are described precisely. Causal models with design describe the study design and the missing-data mechanism together with the causal structure and allow the direct application of causal calculus in the estimation of the causal effects. The flow of the study is visualized by ordering the nodes of the causal diagram in two dimensions by their causal order and the time of the observation. Conclusions on whether a causal or observational relationship can be estimated from the coll…

Statistics and ProbabilityEmpirical researchTheoretical computer scienceGraph (abstract data type)Graphical modelStatistics Probability and UncertaintyCausal structureMissing dataCausalityStructural equation modelingCausal modelMathematicsScandinavian Journal of Statistics

researchProduct

A Software Tool for the Exponential Power Distribution: The normalp Package

2005

In this paper we present the normalp package, a package for the statistical environment R that has a set of tools for dealing with the exponential power distribution. In this package there are functions to compute the density function, the distribution function and the quantiles from an exponential power distribution and to generate pseudo-random numbers from the same distribution. Moreover, methods concerning the estimation of the distribution parameters are described and implemented. It is also possible to estimate linear regression models when we assume the random errors distributed according to an exponential power distribution. A set of functions is designed to perform simulation studi…

Statistics and ProbabilityExponential distributionTheoretical computer scienceComputer scienceAsymptotic distributionDistribution fittingLaplace distributionExponential familyGamma distributionStatistics Probability and UncertaintyNatural exponential familyProbability integral transformAlgorithmlcsh:Statisticslcsh:HA1-4737exponential power distribution R estimation linear regressionSoftwareJournal of Statistical Software

researchProduct

The conditional censored graphical lasso estimator

2020

© 2020, Springer Science+Business Media, LLC, part of Springer Nature. In many applied fields, such as genomics, different types of data are collected on the same system, and it is not uncommon that some of these datasets are subject to censoring as a result of the measurement technologies used, such as data generated by polymerase chain reactions and flow cytometer. When the overall objective is that of network inference, at possibly different levels of a system, information coming from different sources and/or different steps of the analysis can be integrated into one model with the use of conditional graphical models. In this paper, we develop a doubly penalized inferential procedure for…

Statistics and ProbabilityFOS: Computer and information sciencesComputer scienceGaussianInferenceData typeTheoretical Computer Sciencehigh-dimensional settingDatabase normalizationMethodology (stat.ME)symbols.namesakeLasso (statistics)Graphical modelConditional Gaussian graphical modelcensored graphical lassoStatistics - MethodologyHigh-dimensional settingconditional Gaussian graphical modelssparsityEstimatorCensoring (statistics)Censored graphical lassoComputational Theory and MathematicssymbolsCensored dataStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaSparsityAlgorithm

researchProduct

Identifying Causal Effects with the R Package causaleffect

2017

Do-calculus is concerned with estimating the interventional distribution of an action from the observed joint probability distribution of the variables in a given causal structure. All identifiable causal effects can be derived using the rules of do-calculus, but the rules themselves do not give any direct indication whether the effect in question is identifiable or not. Shpitser and Pearl constructed an algorithm for identifying joint interventional distributions in causal models, which contain unobserved variables and induce directed acyclic graphs. This algorithm can be seen as a repeated application of the rules of do-calculus and known properties of probabilities, and it ultimately eit…

Statistics and ProbabilityFOS: Computer and information sciencesTheoretical computer sciencecausalityDistribution (number theory)C-componentComputer sciencecausal model02 engineering and technologyCausal structureMethodology (stat.ME)03 medical and health sciences0302 clinical medicinedo-calculusJoint probability distribution0202 electrical engineering electronic engineering information engineering030212 general & internal medicineDAG; do-calculus; causality; causal model; identifiability; graph; C-component; hedge; d-separationlcsh:Statisticslcsh:HA1-4737Statistics - Methodologycomputer.programming_languageCausal modelta112DAGd-separationgraphhedgeidentifiabilityExpression (mathematics)PEARL (programming language)Action (philosophy)kausaliteetti020201 artificial intelligence & image processingStatistics Probability and UncertaintycomputerSoftware

researchProduct

Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter

2018

A large class of modeling and prediction problems involves outcomes that belong to an exponential family distribution. Generalized linear models (GLMs) are a standard way of dealing with such situations. Even in high-dimensional feature spaces GLMs can be extended to deal with such situations. Penalized inference approaches, such as the $$\ell _1$$ or SCAD, or extensions of least angle regression, such as dgLARS, have been proposed to deal with GLMs with high-dimensional feature spaces. Although the theory underlying these methods is in principle generic, the implementation has remained restricted to dispersion-free models, such as the Poisson and logistic regression models. The aim of this…

Statistics and ProbabilityGeneralized linear modelMathematical optimizationGeneralized linear modelsPredictor-corrector algorithmGeneralized linear model02 engineering and technologyPoisson distributionDANTZIG SELECTOR01 natural sciencesCross-validationHigh-dimensional inferenceTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeExponential familyLEAST ANGLE REGRESSION0202 electrical engineering electronic engineering information engineeringApplied mathematicsStatistics::Methodology0101 mathematicsCROSS-VALIDATIONMathematicsLeast-angle regressionLinear model020206 networking & telecommunicationsProbability and statisticsVARIABLE SELECTIONEfficient estimatorPredictor-corrector algorithmComputational Theory and MathematicsDispersion paremeterLINEAR-MODELSsymbolsSHRINKAGEStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaStatistics and Computing

researchProduct

Splitting the dynamics of large biochemical interaction networks

2003

This article is inscribed in the general motivation of understanding the dynamics on biochemical networks including metabolic and genetic interactions. Our approach is continuous modeling by differential equations. We address the problem of the huge size of those systems. We present a mathematical tool for reducing the size of the model, master-slave synchronization, and fit it to the biochemical context.

Statistics and ProbabilityMaster slave synchronizationModularity (networks)Theoretical computer scienceGeneral Immunology and MicrobiologyDifferential equationSystems BiologyQuantitative Biology::Molecular NetworksApplied MathematicsSystems biologyDynamics (mechanics)Context (language use)General MedicineBiologyBioinformaticsModels BiologicalGeneral Biochemistry Genetics and Molecular BiologyCell Physiological PhenomenaGene Expression RegulationModeling and SimulationSynchronization (computer science)AnimalsGeneral Agricultural and Biological SciencesAlgorithmsJournal of Theoretical Biology

researchProduct