Search results for " Computer Science"

showing 10 items of 3983 documents

Study Design in Causal Models

2014

The causal assumptions, the study design and the data are the elements required for scientific inference in empirical research. The research is adequately communicated only if all of these elements and their relations are described precisely. Causal models with design describe the study design and the missing-data mechanism together with the causal structure and allow the direct application of causal calculus in the estimation of the causal effects. The flow of the study is visualized by ordering the nodes of the causal diagram in two dimensions by their causal order and the time of the observation. Conclusions on whether a causal or observational relationship can be estimated from the coll…

Statistics and ProbabilityEmpirical researchTheoretical computer scienceGraph (abstract data type)Graphical modelStatistics Probability and UncertaintyCausal structureMissing dataCausalityStructural equation modelingCausal modelMathematicsScandinavian Journal of Statistics
researchProduct

A Software Tool for the Exponential Power Distribution: The normalp Package

2005

In this paper we present the normalp package, a package for the statistical environment R that has a set of tools for dealing with the exponential power distribution. In this package there are functions to compute the density function, the distribution function and the quantiles from an exponential power distribution and to generate pseudo-random numbers from the same distribution. Moreover, methods concerning the estimation of the distribution parameters are described and implemented. It is also possible to estimate linear regression models when we assume the random errors distributed according to an exponential power distribution. A set of functions is designed to perform simulation studi…

Statistics and ProbabilityExponential distributionTheoretical computer scienceComputer scienceAsymptotic distributionDistribution fittingLaplace distributionExponential familyGamma distributionStatistics Probability and UncertaintyNatural exponential familyProbability integral transformAlgorithmlcsh:Statisticslcsh:HA1-4737exponential power distribution R estimation linear regressionSoftwareJournal of Statistical Software
researchProduct

The conditional censored graphical lasso estimator

2020

© 2020, Springer Science+Business Media, LLC, part of Springer Nature. In many applied fields, such as genomics, different types of data are collected on the same system, and it is not uncommon that some of these datasets are subject to censoring as a result of the measurement technologies used, such as data generated by polymerase chain reactions and flow cytometer. When the overall objective is that of network inference, at possibly different levels of a system, information coming from different sources and/or different steps of the analysis can be integrated into one model with the use of conditional graphical models. In this paper, we develop a doubly penalized inferential procedure for…

Statistics and ProbabilityFOS: Computer and information sciencesComputer scienceGaussianInferenceData typeTheoretical Computer Sciencehigh-dimensional settingDatabase normalizationMethodology (stat.ME)symbols.namesakeLasso (statistics)Graphical modelConditional Gaussian graphical modelcensored graphical lassoStatistics - MethodologyHigh-dimensional settingconditional Gaussian graphical modelssparsityEstimatorCensoring (statistics)Censored graphical lassoComputational Theory and MathematicssymbolsCensored dataStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaSparsityAlgorithm
researchProduct

Comparative Evaluation of Community Detection Algorithms: A Topological Approach

2012

International audience; Community detection is one of the most active fields in complex networks analysis, due to its potential value in practical applications. Many works inspired by different paradigms are devoted to the development of algorithmic solutions allowing to reveal the network structure in such cohesive subgroups. Comparative studies reported in the literature usually rely on a performance measure considering the community structure as a partition (Rand Index, Normalized Mutual information, etc.). However, this type of comparison neglects the topological properties of the communities. In this article, we present a comprehensive comparative study of a representative set of commu…

Statistics and ProbabilityFOS: Computer and information sciencesPhysics - Physics and SocietyComputer science[INFO.INFO-OH]Computer Science [cs]/Other [cs.OH]Rand indexFOS: Physical sciences02 engineering and technologyPhysics and Society (physics.soc-ph)Topology01 natural sciencesMeasure (mathematics)010305 fluids & plasmasSet (abstract data type)Development (topology)0103 physical sciences0202 electrical engineering electronic engineering information engineeringEquivalence (measure theory)Random graphSocial and Information Networks (cs.SI)Computer Science - Social and Information NetworksStatistical and Nonlinear PhysicsNetwork dynamicsPartition (database)[ INFO.INFO-OH ] Computer Science [cs]/Other [cs.OH]020201 artificial intelligence & image processingStatistics Probability and Uncertainty
researchProduct

Identifying Causal Effects with the R Package causaleffect

2017

Do-calculus is concerned with estimating the interventional distribution of an action from the observed joint probability distribution of the variables in a given causal structure. All identifiable causal effects can be derived using the rules of do-calculus, but the rules themselves do not give any direct indication whether the effect in question is identifiable or not. Shpitser and Pearl constructed an algorithm for identifying joint interventional distributions in causal models, which contain unobserved variables and induce directed acyclic graphs. This algorithm can be seen as a repeated application of the rules of do-calculus and known properties of probabilities, and it ultimately eit…

Statistics and ProbabilityFOS: Computer and information sciencesTheoretical computer sciencecausalityDistribution (number theory)C-componentComputer sciencecausal model02 engineering and technologyCausal structureMethodology (stat.ME)03 medical and health sciences0302 clinical medicinedo-calculusJoint probability distribution0202 electrical engineering electronic engineering information engineering030212 general & internal medicineDAG; do-calculus; causality; causal model; identifiability; graph; C-component; hedge; d-separationlcsh:Statisticslcsh:HA1-4737Statistics - Methodologycomputer.programming_languageCausal modelta112DAGd-separationgraphhedgeidentifiabilityExpression (mathematics)PEARL (programming language)Action (philosophy)kausaliteetti020201 artificial intelligence & image processingStatistics Probability and UncertaintycomputerSoftware
researchProduct

Extended differential geometric LARS for high-dimensional GLMs with general dispersion parameter

2018

A large class of modeling and prediction problems involves outcomes that belong to an exponential family distribution. Generalized linear models (GLMs) are a standard way of dealing with such situations. Even in high-dimensional feature spaces GLMs can be extended to deal with such situations. Penalized inference approaches, such as the $$\ell _1$$ or SCAD, or extensions of least angle regression, such as dgLARS, have been proposed to deal with GLMs with high-dimensional feature spaces. Although the theory underlying these methods is in principle generic, the implementation has remained restricted to dispersion-free models, such as the Poisson and logistic regression models. The aim of this…

Statistics and ProbabilityGeneralized linear modelMathematical optimizationGeneralized linear modelsPredictor-€“corrector algorithmGeneralized linear model02 engineering and technologyPoisson distributionDANTZIG SELECTOR01 natural sciencesCross-validationHigh-dimensional inferenceTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeExponential familyLEAST ANGLE REGRESSION0202 electrical engineering electronic engineering information engineeringApplied mathematicsStatistics::Methodology0101 mathematicsCROSS-VALIDATIONMathematicsLeast-angle regressionLinear model020206 networking & telecommunicationsProbability and statisticsVARIABLE SELECTIONEfficient estimatorPredictor-corrector algorithmComputational Theory and MathematicsDispersion paremeterLINEAR-MODELSsymbolsSHRINKAGEStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaStatistics and Computing
researchProduct

Splitting the dynamics of large biochemical interaction networks

2003

This article is inscribed in the general motivation of understanding the dynamics on biochemical networks including metabolic and genetic interactions. Our approach is continuous modeling by differential equations. We address the problem of the huge size of those systems. We present a mathematical tool for reducing the size of the model, master-slave synchronization, and fit it to the biochemical context.

Statistics and ProbabilityMaster slave synchronizationModularity (networks)Theoretical computer scienceGeneral Immunology and MicrobiologyDifferential equationSystems BiologyQuantitative Biology::Molecular NetworksApplied MathematicsSystems biologyDynamics (mechanics)Context (language use)General MedicineBiologyBioinformaticsModels BiologicalGeneral Biochemistry Genetics and Molecular BiologyCell Physiological PhenomenaGene Expression RegulationModeling and SimulationSynchronization (computer science)AnimalsGeneral Agricultural and Biological SciencesAlgorithmsJournal of Theoretical Biology
researchProduct

Immune networks: Multi-tasking capabilities at medium load

2013

Associative network models featuring multi-tasking properties have been introduced recently and studied in the low load regime, where the number $P$ of simultaneously retrievable patterns scales with the number $N$ of nodes as $P\sim \log N$. In addition to their relevance in artificial intelligence, these models are increasingly important in immunology, where stored patterns represent strategies to fight pathogens and nodes represent lymphocyte clones. They allow us to understand the crucial ability of the immune system to respond simultaneously to multiple distinct antigen invasions. Here we develop further the statistical mechanical analysis of such systems, by studying the medium load r…

Statistics and ProbabilityModularity (networks)Theoretical computer scienceDegree (graph theory)Associative networkComputer scienceGeneral Physics and AstronomyFOS: Physical sciencesStatistical and Nonlinear PhysicsDisordered Systems and Neural Networks (cond-mat.dis-nn)Condensed Matter - Disordered Systems and Neural NetworksModeling and SimulationFOS: Biological sciencesCell Behavior (q-bio.CB)Human multitaskingQuantitative Biology - Cell BehaviorRelevance (information retrieval)Cluster analysisImmune Network Statistical Mechanics Hopfield model Parallel RetrievalMathematical Physics
researchProduct

The “ThreePlusOne” Likelihood-Based Test Statistics: Unified Geometrical and Graphical Interpretations

2014

The presentation of the well known Likelihood Ratio, Wald and Score test statistics in textbooks appears to lack a unified graphical and geometrical interpretation. We present two simple graphical representations on a common scale for these three test statistics, and also the recently proposed Gradient test statistic. These unified graphical displays may favour better understanding of the geometrical meaning of the likelihood based statistics and provide useful insights into their connections.

Statistics and ProbabilityScore testInterpretation (logic)Theoretical computer scienceScale (ratio)General MathematicsLikelihood ratio Wald Score Gradient statistic geometrical interpretation graphical displaySimple (abstract algebra)Likelihood-ratio testStatisticsStatistical inferenceTest statisticStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaStatistical hypothesis testingMathematicsThe American Statistician
researchProduct

Long read alignment based on maximal exact match seeds

2012

Abstract Motivation: The explosive growth of next-generation sequencing datasets poses a challenge to the mapping of reads to reference genomes in terms of alignment quality and execution speed. With the continuing progress of high-throughput sequencing technologies, read length is constantly increasing and many existing aligners are becoming inefficient as generated reads grow larger. Results: We present CUSHAW2, a parallelized, accurate, and memory-efficient long read aligner. Our aligner is based on the seed-and-extend approach and uses maximal exact matches as seeds to find gapped alignments. We have evaluated and compared CUSHAW2 to the three other long read aligners BWA-SW, Bowtie2 an…

Statistics and ProbabilitySequencing and Sequence AnalysisTheoretical computer scienceGenomicsBiologyBiochemistrySoftwareHumansMolecular BiologyAlignment-free sequence analysisExact matchSupplementary dataGenome Humanbusiness.industryChromosome MappingHigh-Throughput Nucleotide SequencingGenomicsSequence Analysis DNAOriginal PapersComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsComputer engineeringScalabilitybusinessSequence AlignmentAlgorithmsSoftwareBioinformatics
researchProduct