Search results for "Cluster Analysis"

showing 10 items of 848 documents

Immune networks: Multi-tasking capabilities at medium load

2013

Associative network models featuring multi-tasking properties have been introduced recently and studied in the low load regime, where the number $P$ of simultaneously retrievable patterns scales with the number $N$ of nodes as $P\sim \log N$. In addition to their relevance in artificial intelligence, these models are increasingly important in immunology, where stored patterns represent strategies to fight pathogens and nodes represent lymphocyte clones. They allow us to understand the crucial ability of the immune system to respond simultaneously to multiple distinct antigen invasions. Here we develop further the statistical mechanical analysis of such systems, by studying the medium load r…

Statistics and ProbabilityModularity (networks)Theoretical computer scienceDegree (graph theory)Associative networkComputer scienceGeneral Physics and AstronomyFOS: Physical sciencesStatistical and Nonlinear PhysicsDisordered Systems and Neural Networks (cond-mat.dis-nn)Condensed Matter - Disordered Systems and Neural NetworksModeling and SimulationFOS: Biological sciencesCell Behavior (q-bio.CB)Human multitaskingQuantitative Biology - Cell BehaviorRelevance (information retrieval)Cluster analysisImmune Network Statistical Mechanics Hopfield model Parallel RetrievalMathematical Physics

researchProduct

ConvergenceClubs: A Package for Performing the Phillips and Sul's Club Convergence Clustering Procedure

2019

This paper introduces package ConvergenceClubs, which implements functions to perform the Phillips and Sul (2007, 2009) club convergence clustering procedure in a simple and reproducible manner. The approach proposed by Phillips and Sul to analyse the convergence patterns of groups of economies is formulated as a nonlinear time varying factor model that allows for different time paths as well as individual heterogeneity. Unlike other approaches in which economies are grouped a priori, it also allows the endogenous determination of convergence clubs. The algorithm, usage, and implementation details are discussed.

Statistics and ProbabilityNumerical AnalysisMathematical optimizationConvergence ClubsEconomicsClubConvergence (relationship)Statistics Probability and UncertaintyCluster analysis

researchProduct

Degree stability of a minimum spanning tree of price return and volatility

2002

We investigate the time series of the degree of minimum spanning trees obtained by using a correlation based clustering procedure which is starting from (i) asset return and (ii) volatility time series. The minimum spanning tree is obtained at different times by computing correlation among time series over a time window of fixed length $T$. We find that the minimum spanning tree of asset return is characterized by stock degree values, which are more stable in time than the ones obtained by analyzing a minimum spanning tree computed starting from volatility time series. Our analysis also shows that the degree of stocks has a very slow dynamics with a time-scale of several years in both cases.

Statistics and ProbabilityPhysics - Physics and SocietyFOS: Physical sciencesPhysics and Society (physics.soc-ph)Minimum spanning treeFOS: Economics and businessTime windowsStatisticsMathematical PhysicCluster analysisStock (geology)Condensed Matter - Statistical MechanicsMathematicsSpanning treeStatistical Finance (q-fin.ST)Statistical Mechanics (cond-mat.stat-mech)EconophysicQuantitative Finance - Statistical FinanceStatistical and Nonlinear PhysicsAsset returnCondensed Matter PhysicsSettore FIS/07 - Fisica Applicata(Beni Culturali Ambientali Biol.e Medicin)VolatilityCorrelation-based clusteringPrice returnVolatility (finance)

researchProduct

Iterative Cluster Analysis of Protein Interaction Data

2004

Abstract Motivation: Generation of fast tools of hierarchical clustering to be applied when distances among elements of a set are constrained, causing frequent distance ties, as happens in protein interaction data. Results: We present in this work the program UVCLUSTER, that iteratively explores distance datasets using hierarchical clustering. Once the user selects a group of proteins, UVCLUSTER converts the set of primary distances among them (i.e. the minimum number of steps, or interactions, required to connect two proteins) into secondary distances that measure the strength of the connection between each pair of proteins when the interactions for all the proteins in the group are consid…

Statistics and ProbabilitySaccharomyces cerevisiae ProteinsComputer sciencecomputer.software_genreBiochemistryInteractomePattern Recognition AutomatedSet (abstract data type)Protein Interaction MappingCluster (physics)Cluster AnalysisCluster analysisMolecular BiologyCytoskeletonMeasure (data warehouse)Gene Expression ProfilingProteinsActinsComputer Science ApplicationsHierarchical clusteringGene expression profilingComputational MathematicsComputational Theory and MathematicsPattern recognition (psychology)Benchmark (computing)Data miningcomputerAlgorithmsSoftwareSignal TransductionBioinformatics

researchProduct

Antibacterial Activity of Flavonoids Against Methicillin-resistant Staphylococcus aureus strains

2000

An experimental and theoretical study was performed on the anti-staphylococcal activity of 18 natural and synthetic flavonoids against methicillin-resistant Staphylococcus aureus strains. The analysed flavonoids belong to three well-differentiated structural patterns: chalcones, flavanones and flavones. The quantitative analysis of the anti-staphylococcal activity of the compounds was carried out by determining their percent inhibition degree. The hierarchical cluster analysis method was used to analyse the anti-MRSA activity of the compounds. With this methodology, the flavonoids were classified into four groups according to their anti-staphylococcal activity (high, sufficient, intermediat…

Statistics and ProbabilityStaphylococcus aureusChalconeStereochemistryFlavonoidMicrobial Sensitivity Testsmedicine.disease_causeFlavonesGeneral Biochemistry Genetics and Molecular BiologyStructure-Activity Relationshipchemistry.chemical_compoundChalconemedicineAnimalsCluster AnalysisHumansStructure–activity relationshipFlavonoidschemistry.chemical_classificationGeneral Immunology and MicrobiologyApplied MathematicsGeneral MedicineStaphylococcal InfectionsMethicillin-resistant Staphylococcus aureusAnti-Bacterial AgentschemistryBiochemistryStaphylococcus aureusModeling and SimulationMethicillin ResistanceGeneral Agricultural and Biological SciencesAntibacterial activityQuantitative analysis (chemistry)Journal of Theoretical Biology

researchProduct

Identification of clusters of companies in stock indices via Potts super-paramagnetic transitions

2000

The clustering of companies within a specific stock market index is studied by means of super-paramagnetic transitions of an appropriate q-state Potts model where the spins correspond to companies and the interactions are functions of the correlation coefficients determined from the time dependence of the companies' individual stock prices. The method is a generalization of the clustering algorithm by Domany et. al. to the case of anti-ferromagnetic interactions corresponding to anti-correlations. For the Dow Jones Industrial Average where no anti-correlations were observed in the investigated time period, the previous results obtained by different tools were well reproduced. For the Standa…

Statistics and ProbabilityStatistical Mechanics (cond-mat.stat-mech)SpinsFOS: Physical sciencesCondensed Matter PhysicsStock market indexParamagnetismCluster (physics)Statistical physicsCluster analysisStock (geology)Condensed Matter - Statistical MechanicsPotts modelMathematics

researchProduct

Clusters of effects curves in quantile regression models

2018

In this paper, we propose a new method for finding similarity of effects based on quantile regression models. Clustering of effects curves (CEC) techniques are applied to quantile regression coefficients, which are one-to-one functions of the order of the quantile. We adopt the quantile regression coefficients modeling (QRCM) framework to describe the functional form of the coefficient functions by means of parametric models. The proposed method can be utilized to cluster the effect of covariates with a univariate response variable, or to cluster a multivariate outcome. We report simulation results, comparing our approach with the existing techniques. The idea of combining CEC with QRCM per…

Statistics and ProbabilityStatistics::TheoryMultivariate statistics05 social sciencesUnivariateFunctional data analysis01 natural sciencesQuantile regressionQuantile regression coefficients modeling Multivariate analysis Functional data analysis Curves clustering Variable selection010104 statistics & probabilityComputational Mathematics0502 economics and businessParametric modelCovariateStatistics::MethodologyApplied mathematics0101 mathematicsStatistics Probability and UncertaintyCluster analysisSettore SECS-S/01 - Statistica050205 econometrics MathematicsQuantile

researchProduct

Testing for local structure in spatiotemporal point pattern data

2017

The detection of clustering structure in a point pattern is one of the main focuses of attention in spatiotemporal data mining. Indeed, statistical tools for clustering detection and identification of individual events belonging to clusters are welcome in epidemiology and seismology. Local second-order characteristics provide information on how an event relates to nearby events. In this work, we extend local indicators of spatial association (known as LISA functions) to the spatiotemporal context (which will be then called LISTA functions). These functions are then used to build local tests of clustering to analyse differences in local spatiotemporal structures. We present a simulation stud…

Statistics and ProbabilityStructure (mathematical logic)010504 meteorology & atmospheric sciencesEvent (computing)Ecological ModelingAssociation (object-oriented programming)Context (language use)computer.software_genre01 natural sciences010104 statistics & probabilityIdentification (information)Point (geometry)Data mining0101 mathematicsCluster analysiscomputer0105 earth and related environmental sciencesStatistical hypothesis testingMathematicsEnvironmetrics

researchProduct

Sample size in cluster-randomized trials with time to event as the primary endpoint

2011

In cluster-randomized trials, groups of individuals (clusters) are randomized to the treatments or interventions to be compared. In many of those trials, the primary objective is to compare the time for an event to occur between randomized groups, and the shared frailty model well fits clustered time-to-event data. Members of the same cluster tend to be more similar than members of different clusters, causing correlations. As correlations affect the power of a trial to detect intervention effects, the clustered design has to be considered in planning the sample size. In this publication, we derive a sample size formula for clustered time-to-event data with constant marginal baseline hazards…

Statistics and ProbabilityTime FactorsEndpoint DeterminationSubstance-Related DisordersEpidemiologyPsychological interventionBiostatisticsTime-to-Treatmentlaw.inventionCorrelationRandom AllocationRandomized controlled triallawStatisticsClinical endpointEconometricsCluster AnalysisHumansPoisson DistributionBaseline (configuration management)Randomized Controlled Trials as TopicMathematicsEvent (probability theory)Likelihood FunctionsModels StatisticalTerm (time)Sample size determinationSample SizeRegression AnalysisSubstance Abuse Treatment CentersStatistics in Medicine

researchProduct

RabbitMash: accelerating hash-based genome analysis on modern multi-core architectures

2020

Abstract Motivation Mash is a popular hash-based genome analysis toolkit with applications to important downstream analyses tasks such as clustering and assembly. However, Mash is currently not able to fully exploit the capabilities of modern multi-core architectures, which in turn leads to high runtimes for large-scale genomic datasets. Results We present RabbitMash, an efficient highly optimized implementation of Mash which can take full advantage of modern hardware including multi-threading, vectorization and fast I/O. We show that our approach achieves speedups of at least 1.3, 9.8, 8.5 and 4.4 compared to Mash for the operations sketch, dist, triangle and screen, respectively. Furtherm…

Statistics and ProbabilityWorkstationExploitComputer scienceHash functionParallel computingBiochemistrylaw.invention03 medical and health sciencesSoftwarelawCluster analysisMolecular Biology030304 developmental biology0303 health sciencesMulti-core processorGenomeComputersbusiness.industry030302 biochemistry & molecular biologyGenomicsSketchComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsbusinessAlgorithmsSoftwareBioinformatics

researchProduct