Search results for "computational"

showing 10 items of 5884 documents

ArtiFuse—computational validation of fusion gene detection tools without relying on simulated reads

2019

Abstract Motivation Gene fusions are an important class of transcriptional variants that can influence cancer development and can be predicted from RNA sequencing (RNA-seq) data by multiple existing tools. However, the real-world performance of these tools is unclear due to the lack of known positive and negative events, especially with regard to fusion genes in individual samples. Often simulated reads are used, but these cannot account for all technical biases in RNA-seq data generated from real samples. Results Here, we present ArtiFuse, a novel approach that simulates fusion genes by sequence modification to the genomic reference, and therefore, can be applied to any RNA-seq dataset wit…

Statistics and ProbabilitySource codeSequence analysisComputer sciencemedia_common.quotation_subjectValue (computer science)Genomicscomputer.software_genreBiochemistryFusion gene03 medical and health sciences0302 clinical medicineSoftwareMolecular BiologyGene030304 developmental biologymedia_common0303 health sciencesSequence Analysis RNAbusiness.industryHigh-Throughput Nucleotide SequencingRNAGenomicsComputer Science ApplicationsComputational MathematicsComputational Theory and Mathematics030220 oncology & carcinogenesisBenchmark (computing)RNAData miningGene FusionbusinesscomputerSoftwareBioinformatics
researchProduct

Geometric Entropies of Mixing (EOM)

2005

Trigonometric and trigonometric-algebraic entropies are introduced. Regularity increases the entropy and the maximal entropy is shown to result when a regular $n$-gon is inscribed in a circle. A regular $n$-gon circumscribing a circle gives the largest entropy reduction, or the smallest change in entropy from the state of maximum entropy which occurs in the asymptotic infinite $n$ limit. EOM are shown to correspond to minimum perimeter and maximum area in the theory of convex bodies, and can be used in the prediction of new inequalities for convex sets. These expressions are shown to be related to the phase functions obtained from the WKB approximation for Bessel and Hermite functions.

Statistics and ProbabilityStatistical Mechanics (cond-mat.stat-mech)Principle of maximum entropyConfiguration entropyMathematical analysisMaximum entropy thermodynamicsMin entropyFOS: Physical sciencesStatistical and Nonlinear PhysicsComputer Science::Computational GeometryQuantum relative entropyMaximum entropy probability distributionMathematics::Metric GeometryMathematical PhysicsEntropy rateJoint quantum entropyCondensed Matter - Statistical MechanicsMathematics
researchProduct

Selecting the tuning parameter in penalized Gaussian graphical models

2019

Penalized inference of Gaussian graphical models is a way to assess the conditional independence structure in multivariate problems. In this setting, the conditional independence structure, corresponding to a graph, is related to the choice of the tuning parameter, which determines the model complexity or degrees of freedom. There has been little research on the degrees of freedom for penalized Gaussian graphical models. In this paper, we propose an estimator of the degrees of freedom in $$\ell _1$$ -penalized Gaussian graphical models. Specifically, we derive an estimator inspired by the generalized information criterion and propose to use this estimator as the bias term for two informatio…

Statistics and ProbabilityStatistics::TheoryKullback–Leibler divergenceKullback-Leibler divergenceComputer scienceGaussianInformation Criteria010103 numerical & computational mathematicsModel complexityModel selection01 natural sciencesTheoretical Computer Science010104 statistics & probabilitysymbols.namesakeStatistics::Machine LearningGeneralized information criterionEntropy (information theory)Statistics::MethodologyGraphical model0101 mathematicsPenalized Likelihood Kullback-Leibler Divergence Model Complexity Model Selection Generalized Information Criterion.Model selectionEstimatorStatistics::ComputationComputational Theory and MathematicsConditional independencesymbolsPenalized likelihoodStatistics Probability and UncertaintySettore SECS-S/01 - StatisticaAlgorithmStatistics and Computing
researchProduct

Clusters of effects curves in quantile regression models

2018

In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM per…

Statistics and ProbabilityStatistics::TheoryMultivariate statistics05 social sciencesUnivariateFunctional data analysis01 natural sciencesQuantile regressionQuantile regression coefficients modeling Multivariate analysis Functional data analysis Curves clustering Variable selection010104 statistics & probabilityComputational Mathematics0502 economics and businessParametric modelCovariateStatistics::MethodologyApplied mathematics0101 mathematicsStatistics Probability and UncertaintyCluster analysisSettore SECS-S/01 - Statistica050205 econometrics MathematicsQuantile
researchProduct

Design-based estimation for geometric quantiles with application to outlier detection

2010

Geometric quantiles are investigated using data collected from a complex survey. Geometric quantiles are an extension of univariate quantiles in a multivariate set-up that uses the geometry of multivariate data clouds. A very important application of geometric quantiles is the detection of outliers in multivariate data by means of quantile contours. A design-based estimator of geometric quantiles is constructed and used to compute quantile contours in order to detect outliers in both multivariate data and survey sampling set-ups. An algorithm for computing geometric quantile estimates is also developed. Under broad assumptions, the asymptotic variance of the quantile estimator is derived an…

Statistics and ProbabilityStatistics::TheoryTheoryofComputation_COMPUTATIONBYABSTRACTDEVICESStatistics::ApplicationsComputingMethodologies_SIMULATIONANDMODELINGApplied MathematicsMathematicsofComputing_NUMERICALANALYSISUnivariateInformationSystems_DATABASEMANAGEMENTEstimatorStatistics::ComputationQuantile regressionHorvitz–Thompson estimatorComputational MathematicsDelta methodComputational Theory and MathematicsTheoryofComputation_ANALYSISOFALGORITHMSANDPROBLEMCOMPLEXITYOutlierConsistent estimatorStatisticsStatistics::MethodologyMathematicsQuantileComputational Statistics & Data Analysis
researchProduct

NeoFox: annotating neoantigen candidates with neoantigen features

2020

Abstract Summary The detection and prediction of true neoantigens is of great importance for the field of cancer immunotherapy. Wesearched the literature for proposed neoantigen features and integrated them into a toolbox called NEOantigen Feature toolbOX (NeoFox). NeoFox is an easy-to-use Python package that enables the annotation of neoantigen candidates with 16 neoantigen features. Availability and implementation NeoFox is freely available as an open source Python package released under the GNU General Public License (GPL) v3 license at https://github.com/TRON-Bioinformatics/neofox. Supplementary information Supplementary data are available at Bioinformatics online.

Statistics and ProbabilitySupplementary data0303 health sciencesInformation retrievalComputer science030302 biochemistry & molecular biologyPython (programming language)BiochemistryToolbox3. Good healthComputer Science Applications03 medical and health sciencesComputational MathematicsAnnotationOpen sourceComputational Theory and MathematicsMolecular Biologycomputer030304 developmental biologycomputer.programming_languageBioinformatics
researchProduct

IntelliPy: a GUI for analyzing IntelliCage data

2021

Abstract Summary The IntelliCage systems offer the possibility to conduct long-term behavioral experiments on mice in social groups without human intervention. Although this setup provides new findings, only about 150 studies with the IntelliCage system have been published in the last two decades, which is also caused by the challenging problems of processing and handling the large and heterogeneous amounts of captured data. This application note introduces the Python-GUI IntelliPy, especially designed for users not very experienced in using programming languages. IntelliPy allows users to quickly analyze the IntelliCage output in a user-friendly way, thus making the systems more accessible…

Statistics and ProbabilitySupplementary dataAcademicSubjects/SCI01060Computer scienceSystems BiologyMEDLINEBiochemistryApplications NotesComputer Science ApplicationsSocial groupWorld Wide WebComputational MathematicsComputational Theory and MathematicsIntervention (counseling)Molecular BiologyBioinformatics
researchProduct

LipiDisease: associate lipids to diseases using literature mining

2021

Abstract Summary Lipids exhibit an essential role in cellular assembly and signaling. Dysregulation of these functions has been linked with many complications including obesity, diabetes, metabolic disorders, cancer and more. Investigating lipid profiles in such conditions can provide insights into cellular functions and possible interventions. Hence the field of lipidomics is expanding in recent years. Even though the role of individual lipids in diseases has been investigated, there is no resource to perform disease enrichment analysis considering the cumulative association of a lipid set. To address this, we have implemented the LipiDisease web server. The tool analyzes millions of recor…

Statistics and ProbabilitySupplementary dataWeb serverAcademicSubjects/SCI01060Computer scienceCellular functionsComputational biologyDiseasecomputer.software_genreApplications NotesBiochemistryField (computer science)Computer Science ApplicationsComputational MathematicsComputational Theory and MathematicsLipidomicsData and Text MiningMolecular BiologycomputerBioinformatics
researchProduct

RNA-Seq Atlas—a reference database for gene expression profiling in normal tissue by next-generation sequencing

2012

Abstract Motivation: Next-generation sequencing technology enables an entirely new perspective for clinical research and will speed up personalized medicine. In contrast to microarray-based approaches, RNA-Seq analysis provides a much more comprehensive and unbiased view of gene expression. Although the perspective is clear and the long-term success of this new technology obvious, bioinformatics resources making these data easily available especially to the biomedical research community are still evolving. Results: We have generated RNA-Seq Atlas, a web-based repository of RNA-Seq gene expression profiles and query tools. The website offers open and easy access to RNA-Seq gene expression pr…

Statistics and ProbabilitySystems biologyRNA-SeqComputational biologyBiologycomputer.software_genreBiochemistryNeoplasmsGene expressionHumansMicroarray databasesMolecular BiologyGeneOligonucleotide Array Sequence AnalysisInternetSequence Analysis RNAbusiness.industryGene Expression ProfilingHigh-Throughput Nucleotide SequencingComputer Science ApplicationsGene expression profilingComputational MathematicsComputational Theory and MathematicsGene chip analysisData miningPersonalized medicineDatabases Nucleic AcidbusinesscomputerSoftwareBioinformatics
researchProduct

Structure Learning in Nested Effects Models

2007

Nested Effects Models (NEMs) are a class of graphical models introduced to analyze the results of gene perturbation screens. NEMs explore noisy subset relations between the high-dimensional outputs of phenotyping studies, e.g., the effects showing in gene expression profiles or as morphological features of the perturbed cell. In this paper we expand the statistical basis of NEMs in four directions. First, we derive a new formula for the likelihood function of a NEM, which generalizes previous results for binary data. Second, we prove model identifiability under mild assumptions. Third, we show that the new formulation of the likelihood allows efficiency in traversing model space. Fourth, we…

Statistics and ProbabilityTraverseComputer scienceMolecular Networks (q-bio.MN)Genes MHC Class IIPerturbation (astronomy)Genes InsectFeature selectionQuantitative Biology - Quantitative Methods03 medical and health sciences0302 clinical medicineGeneticsAnimalsheterocyclic compoundsQuantitative Biology - Molecular NetworksGraphical modelMolecular BiologyQuantitative Methods (q-bio.QM)Oligonucleotide Array Sequence Analysis030304 developmental biologyLikelihood Functions0303 health sciencesNanoelectromechanical systemsModels StatisticalModels GeneticGene Expression ProfilingGenomicsComputational MathematicsDrosophila melanogasterPhenotypeFOS: Biological sciencesBinary dataIdentifiabilityRNA InterferenceLikelihood functionAlgorithmAlgorithms030217 neurology & neurosurgery
researchProduct