Search results for "cluster analysis."

showing 10 items of 805 documents

Testing for local structure in spatiotemporal point pattern data

2017

The detection of clustering structure in a point pattern is one of the main focuses of attention in spatiotemporal data mining. Indeed, statistical tools for clustering detection and identification of individual events belonging to clusters are welcome in epidemiology and seismology. Local second-order characteristics provide information on how an event relates to nearby events. In this work, we extend local indicators of spatial association (known as LISA functions) to the spatiotemporal context (which will be then called LISTA functions). These functions are then used to build local tests of clustering to analyse differences in local spatiotemporal structures. We present a simulation stud…

Statistics and ProbabilityStructure (mathematical logic)010504 meteorology & atmospheric sciencesEvent (computing)Ecological ModelingAssociation (object-oriented programming)Context (language use)computer.software_genre01 natural sciences010104 statistics & probabilityIdentification (information)Point (geometry)Data mining0101 mathematicsCluster analysiscomputer0105 earth and related environmental sciencesStatistical hypothesis testingMathematicsEnvironmetrics
researchProduct

Sample size in cluster-randomized trials with time to event as the primary endpoint

2011

In cluster-randomized trials, groups of individuals (clusters) are randomized to the treatments or interventions to be compared. In many of those trials, the primary objective is to compare the time for an event to occur between randomized groups, and the shared frailty model well fits clustered time-to-event data. Members of the same cluster tend to be more similar than members of different clusters, causing correlations. As correlations affect the power of a trial to detect intervention effects, the clustered design has to be considered in planning the sample size. In this publication, we derive a sample size formula for clustered time-to-event data with constant marginal baseline hazards…

Statistics and ProbabilityTime FactorsEndpoint DeterminationSubstance-Related DisordersEpidemiologyPsychological interventionBiostatisticsTime-to-Treatmentlaw.inventionCorrelationRandom AllocationRandomized controlled triallawStatisticsClinical endpointEconometricsCluster AnalysisHumansPoisson DistributionBaseline (configuration management)Randomized Controlled Trials as TopicMathematicsEvent (probability theory)Likelihood FunctionsModels StatisticalTerm (time)Sample size determinationSample SizeRegression AnalysisSubstance Abuse Treatment CentersStatistics in Medicine
researchProduct

RabbitMash: accelerating hash-based genome analysis on modern multi-core architectures

2020

Abstract Motivation Mash is a popular hash-based genome analysis toolkit with applications to important downstream analyses tasks such as clustering and assembly. However, Mash is currently not able to fully exploit the capabilities of modern multi-core architectures, which in turn leads to high runtimes for large-scale genomic datasets. Results We present RabbitMash, an efficient highly optimized implementation of Mash which can take full advantage of modern hardware including multi-threading, vectorization and fast I/O. We show that our approach achieves speedups of at least 1.3, 9.8, 8.5 and 4.4 compared to Mash for the operations sketch, dist, triangle and screen, respectively. Furtherm…

Statistics and ProbabilityWorkstationExploitComputer scienceHash functionParallel computingBiochemistrylaw.invention03 medical and health sciencesSoftwarelawCluster analysisMolecular Biology030304 developmental biology0303 health sciencesMulti-core processorGenomeComputersbusiness.industry030302 biochemistry & molecular biologyGenomicsSketchComputer Science ApplicationsComputational MathematicsComputational Theory and MathematicsbusinessAlgorithmsSoftwareBioinformatics
researchProduct

Assessing local differences between the spatio-temporal second-order structure of two point patterns occurring on the same linear network

2021

Abstract We introduce Local Indicators of Spatio-Temporal Association (LISTA) functions on linear networks and use them to build a statistical test for local second-order structure. This allows to identify differences in the spatio-temporal clustering behaviour of two point patterns, a point pattern of interest and a background one, both occurring on the same linear network. We assess the performance of the testing procedure for local second-order structure through simulation studies under a variety of scenarios that also account for different generating point processes. We show that the proposed local test is able to correctly identify the spatio-temporal difference in the local second-ord…

Statistics and Probabilitysecond-order characteristicsComputer scienceAssociation (object-oriented programming)Spatio-temporal point patternsStructure (category theory)Management Monitoring Policy and LawPoint processLocal propertielocal propertieshypothesis testinglocal indicators of spatio-temporal associationLinear networkPoint (geometry)Computers in Earth SciencesCluster analysisStatistical hypothesis testingbusiness.industrySecond-order characteristicPattern recognitionPower (physics)Linear networkHypothesis testingLocal Indicators of Spatio-Temporal Associationlinear networksspatio-temporal point patternsArtificial intelligencebusinessSettore SECS-S/01 - Statistica
researchProduct

Animal rennets as sources of dairy lactic acid bacteria

2014

ABSTRACT The microbial composition of artisan and industrial animal rennet pastes was studied by using both culture-dependent and -independent approaches. Pyrosequencing targeting the 16S rRNA gene allowed to identify 361 operational taxonomic units (OTUs) to the genus/species level. Among lactic acid bacteria (LAB), Streptococcus thermophilus and some lactobacilli, mainly Lactobacillus crispatus and Lactobacillus reuteri , were the most abundant species, with differences among the samples. Twelve groups of microorganisms were targeted by viable plate counts revealing a dominance of mesophilic cocci. All rennets were able to acidify ultrahigh-temperature-processed (UHT) milk as shown by pH …

Streptococcus thermophilusColony CountColony Count MicrobialApplied Microbiology and BiotechnologyAcidification; Animal rennet pastes; Autolysis; Lactic acid bacteria; Microbial ecology; PyrosequencingMicrobial ecologyMicrobialCheeseRNA Ribosomal 16SLactobacillusEnterococcus casseliflavusLactic acid bacteriaCluster AnalysisPhylogenyEcologybiologyLactobacillus crispatusBacterialAnimal rennet pastefood and beveragesPyrosequencingHydrogen-Ion ConcentrationAutolysiBiotaAnimals; Cluster Analysis; Colony Count Microbial; DNA Bacterial; DNA Ribosomal; Enterococcus; Hydrogen-Ion Concentration; Lactobacillus; Microbial Viability; Milk; Molecular Sequence Data; Phylogeny; RNA Ribosomal 16S; Sequence Analysis DNA; Biota; ChymosinMilkSequence AnalysisChymosinBiotechnologyDNA Bacterial16SMolecular Sequence DataDNA RibosomalEnterococcus faecalisMicrobiologyAcidificationAnimalsRibosomalMicrobial ViabilitySequence Analysis DNADNAbiology.organism_classificationLactobacillus reuteriLactobacillusEnterococcusFood MicrobiologyRNAMetagenomicsEnterococcusFood ScienceEnterococcus faeciumSettore AGR/16 - Microbiologia Agraria
researchProduct

Complex Detection in Protein-Protein Interaction Networks: A Compact Overview for Researchers and Practitioners

2012

The availability of large volumes of protein-protein interaction data has allowed the study of biological networks to unveil the complex structure and organization in the cell. It has been recognized by biologists that proteins interacting with each other often participate in the same biological processes, and that protein modules may be often associated with specific biological functions. Thus the detection of protein complexes is an important research problem in systems biology. In this review, recent graph-based approaches to clustering protein interaction networks are described and classified with respect to common peculiarities. The goal is that of providing a useful guide and referenc…

Structure (mathematical logic)Computer scienceSystems biologyCellData ScienceNanotechnologyComputational biologyProtein protein interaction networkBioinformatics network analysismedicine.anatomical_structuremedicineGraph (abstract data type)Lecture Notes in Computer ScienceCluster analysisProtein modulesBiological network
researchProduct

Semi-supervised Hyperspectral Image Classification with Graphs

2006

This paper presents a semi-supervised graph-based method for the classification of hyperspectral images. The method is designed to exploit the spatial/contextual information in the im- ages through composite kernels. The proposed method produces smoother classifications with respect to the intrinsic structure collectively revealed by known labeled and unlabeled points. Good accuracy in high dimensional spaces and low number of labeled samples (ill-posed situations) are produced as compared to standard inductive support vector machines.

Structured support vector machineContextual image classificationbusiness.industryHyperspectral imagingPattern recognitionGraphRelevance vector machineSupport vector machineComputingMethodologies_PATTERNRECOGNITIONKernel (image processing)Artificial intelligencebusinessCluster analysisMathematics2006 IEEE International Symposium on Geoscience and Remote Sensing
researchProduct

A deep semantic segmentation-based algorithm to segment crops and weeds in agronomic color images

2022

Abstract In precision agriculture, the accurate segmentation of crops and weeds in agronomic images has always been the center of attention. Many methods have been proposed but still the clean and sharp segmentation of crops and weeds is a challenging issue for the images with a high presence of weeds. This work proposes a segmentation method based on the combination of semantic segmentation and K-means algorithms for the segmentation of crops and weeds in color images. Agronomic images of two different databases were used for the segmentation algorithms. Using the thresholding technique, everything except plants was removed from the images. Afterward, semantic segmentation was applied usin…

Subtractive colorComputer scienceComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONConfusion matrixForestryAquatic ScienceThresholdingAccurate segmentationComputer Science ApplicationsClassification rateAnimal Science and ZoologySegmentationPrecision agricultureCluster analysisAgronomy and Crop ScienceAlgorithmInformation Processing in Agriculture
researchProduct

A Student's t‐based density peaks clustering with superpixel segmentation (tDPCSS) method for image color clustering

2020

Superpixel segmentationComputer sciencebusiness.industryGeneral Chemical EngineeringHuman Factors and ErgonomicsPattern recognitionGeneral ChemistryArtificial intelligenceCluster analysisbusinessImage (mathematics)Color Research & Application
researchProduct

Compared regimes of NDVI and Rainfall in semi-arid regions of Africa

2006

International audience; Bi-monthly normalized difference vegetation index (NDVI) at an 8km spatial resolution from the advanced very high resolution radiometers (AVHRR) was used from 1981 to 1995 to analyse the vegetation response to rainfall supply in semi-arid regions of Africa. Within the 200-600 mm annual rainfall belt, for which the apparent NDVI response to rainfall was the strongest, three regions were selected which exhibited different patterns in their NDVI regimes and/or relationships with rainfall. The regions, located in western, southern and eastern Africa, were split into coherent sub-regions in terms of mean regime of photosynthetic activity through a cluster analysis. Overal…

SupplyrainfallevapotranspirationConcentration distribution[SDU.STU.CL] Sciences of the Universe [physics]/Earth Sciences/ClimatologyCluster analysisVegetation indexvegetationRainfall ratePlant cover[ SDE.MCG.CG ] Environmental Sciences/Global Changes/domain_sde.mcg.cgannual averagespatial resolutionphotosynthesisexhibits1995high resolution[SDE.MCG.CG] Environmental Sciences/Global Changes/domain_sde.mcg.cgStructureWater use efficiencyResponsePluviometrycorrelationAfricaGeneral Earth and Planetary SciencesSemi arid zone[ SDU.STU.CL ] Sciences of the Universe [physics]/Earth Sciences/ClimatologySouthern AfricaRainy season
researchProduct