Search results for "cluster analysis"

showing 10 items of 848 documents

Efficient and Accurate OTU Clustering with GPU-Based Sequence Alignment and Dynamic Dendrogram Cutting.

2015

De novo clustering is a popular technique to perform taxonomic profiling of a microbial community by grouping 16S rRNA amplicon reads into operational taxonomic units (OTUs). In this work, we introduce a new dendrogram-based OTU clustering pipeline called CRiSPy. The key idea used in CRiSPy to improve clustering accuracy is the application of an anomaly detection technique to obtain a dynamic distance cutoff instead of using the de facto value of 97 percent sequence similarity as in most existing OTU clustering pipelines. This technique works by detecting an abrupt change in the merging heights of a dendrogram. To produce the output dendrograms, CRiSPy employs the OTU hierarchical clusterin…

Computer scienceCorrelation clusteringSingle-linkage clusteringMolecular Sequence DataMachine learningcomputer.software_genrePattern Recognition AutomatedCURE data clustering algorithmRNA Ribosomal 16SGeneticsComputer GraphicsCluster analysisBase Sequencebusiness.industryApplied MathematicsDendrogramHigh-Throughput Nucleotide SequencingPattern recognitionSignal Processing Computer-AssistedEquipment DesignHierarchical clusteringEquipment Failure AnalysisRNA BacterialCanopy clustering algorithmArtificial intelligenceHierarchical clustering of networksbusinesscomputerSequence AlignmentAlgorithmsBiotechnologyIEEE/ACM transactions on computational biology and bioinformatics
researchProduct

Projection Clustering Unfolding: A New Algorithm for Clustering Individuals or Items in a Preference Matrix

2020

In the framework of preference rankings, the interest can lie in clustering individuals or items in order to reduce the complexity of the preference space for an easier interpretation of collected data. The last years have seen a remarkable flowering of works about the use of decision tree for clustering preference vectors. As a matter of fact, decision trees are useful and intuitive, but they are very unstable: small perturbations bring big changes. This is the reason why it could be necessary to use more stable procedures in order to clustering ranking data. In this work, a Projection Clustering Unfolding (PCU) algorithm for preference data will be proposed in order to extract useful info…

Computer scienceDecision treeProjetion pursuit · Preference data · Clustering rankingsSpace (commercial competition)PreferenceMatrix (mathematics)RankingProcrustes analysisSettore SECS-S/01 - StatisticaCluster analysisProjection (set theory)AlgorithmPreference (economics)Subspace topologyProjetion pursuit Preference data Clustering rankingsData Analysis and Applications 3
researchProduct

Exudates as Landmarks Identified through FCM Clustering in Retinal Images

2020

The aim of this work was to develop a method for the automatic identification of exudates, using an unsupervised clustering approach. The ability to classify each pixel as belonging to an eventual exudate, as a warning of disease, allows for the tracking of a patient&rsquo

Computer scienceDiabetic retinopathy; Exudates; Fuzzy C-means clustering; Morphological processing; Retinal landmarks; SegmentationFundus (eye)Fuzzy logiclcsh:TechnologyField (computer science)030218 nuclear medicine & medical imaginglcsh:Chemistry03 medical and health sciences0302 clinical medicineFcm clusteringfuzzy C-means clusteringretinal landmarksGeneral Materials ScienceSegmentationSensitivity (control systems)Cluster analysisInstrumentationlcsh:QH301-705.5Fluid Flow and Transfer ProcessesSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniPixelSettore INF/01 - Informaticabusiness.industrylcsh:TProcess Chemistry and TechnologyexudatessegmentationGeneral EngineeringPattern recognitionlcsh:QC1-999Computer Science Applicationsdiabetic retinopathyComputingMethodologies_PATTERNRECOGNITIONlcsh:Biology (General)lcsh:QD1-999lcsh:TA1-2040Artificial intelligencebusinesslcsh:Engineering (General). Civil engineering (General)030217 neurology & neurosurgerylcsh:Physicsmorphological processingApplied Sciences
researchProduct

A Clustering Approach for Improving Network Performance in Heterogeneous Systems

2000

A lot of research has focused on solving the problem of computation-aware task scheduling on heterogeneous systems. In this paper, we propose a clustering algorithm that, given a network topology, provides a network partition adapted to the communication requirements of the applications running on the machine. Also, we propose a criterion to measure the quality of each one of the possible mappings of processes to processors based on that network partition. Evaluation results show that these proposals can greatly improve network performance, providing a basis of a communication-aware scheduling technique.

Computer scienceDistributed computingNetwork partitionNetwork performanceThroughputNetwork topologyCluster analysisNetwork simulationScheduling (computing)
researchProduct

Least-squares community extraction in feature-rich networks using similarity data

2021

We explore a doubly-greedy approach to the issue of community detection in feature-rich networks. According to this approach, both the network and feature data are straightforwardly recovered from the underlying unknown non-overlapping communities, supplied with a center in the feature space and intensity weight(s) over the network each. Our least-squares additive criterion allows us to search for communities one-by-one and to find each community by adding entities one by one. A focus of this paper is that the feature-space data part is converted into a similarity matrix format. The similarity/link values can be used in either of two modes: (a) as measured in the same scale so that one may …

Computer scienceEconomicsKernel FunctionsSocial Sciences02 engineering and technologyLeast squaresInfographicsTranslocation GeneticGeographical LocationsMedical Conditions0202 electrical engineering electronic engineering information engineeringMedicine and Health SciencesPsychologyCluster AnalysisOperator TheoryData ManagementMultidisciplinaryApplied MathematicsSimulation and ModelingQRExperimental PsychologyEuropeFeature (computer vision)Research DesignPhysical SciencesMedicine020201 artificial intelligence & image processingGraphsAlgorithmsNetwork AnalysisNetwork analysisResearch ArticleComputer and Information SciencesScienceFeature vectorScale (descriptive set theory)Research and Analysis MethodsColumn (database)Similarity (network science)020204 information systemsParasitic DiseasesLeast-Squares AnalysisFeature databusiness.industryData VisualizationBiology and Life SciencesPattern recognitionTropical DiseasesEconomic AnalysisMalariaPeople and PlacesArtificial intelligencebusinessMathematicsPLoS ONE
researchProduct

MetNet: A two-level approach to reconstructing and comparing metabolic networks

2021

Metabolic pathway comparison and interaction between different species can detect important information for drug engineering and medical science. In the literature, proposals for reconstructing and comparing metabolic networks present two main problems: network reconstruction requires usually human intervention to integrate information from different sources and, in metabolic comparison, the size of the networks leads to a challenging computational problem. We propose to automatically reconstruct a metabolic network on the basis of KEGG database information. Our proposal relies on a two-level representation of the huge metabolic network: the first level is graph-based and depicts pathways a…

Computer scienceEnzyme MetabolismMetabolic networkcomputer.software_genreBiochemistryInfographics0302 clinical medicineCluster AnalysisEnzyme ChemistryData ManagementMammals0303 health sciencesMultidisciplinaryBasis (linear algebra)Settore INF/01 - InformaticaQRChemical ReactionsEukaryotaGraphChemistryVertebratesPhysical SciencesMedicineCarbohydrate MetabolismData miningMetabolic PathwaysComputational problemGraphsNetwork AnalysisMetabolic Networks and PathwaysResearch ArticleComputer and Information SciencesComputingMethodologies_SIMULATIONANDMODELINGScience03 medical and health sciencesMetabolic NetworksSimilarity (psychology)Xenobiotic MetabolismAnimalsHumansMetabolomicsKEGGRepresentation (mathematics)Symbiosis030304 developmental biologyData VisualizationOrganismsBiology and Life SciencesMetabolismMetabolic pathwayComputingMethodologies_PATTERNRECOGNITIONMetabolismAmniotesEnzymologycomputerZoology030217 neurology & neurosurgerySoftwarePLoS ONE
researchProduct

Detection, tracking and event localization of jet stream features in 4-D atmospheric data

2012

We introduce a novel algorithm for the efficient detection and tracking of features in spatiotemporal atmospheric data, as well as for the precise localization of the occurring genesis, lysis, merging and splitting events. The algorithm works on data given on a four-dimensional structured grid. Feature selection and clustering are based on adjustable local and global criteria, feature tracking is predominantly based on spatial overlaps of the feature's full volumes. The resulting 3-D features and the identified correspondences between features of consecutive time steps are represented as the nodes and edges of a directed acyclic graph, the event graph. Merging and splitting events appear in…

Computer scienceEvent (computing)lcsh:QE1-996.5Feature selectionGridcomputer.software_genreTracking (particle physics)Directed acyclic graphData segmentlcsh:GeologyFeature (computer vision)Data miningCluster analysiscomputerAlgorithmPhysics::Atmospheric and Oceanic Physics
researchProduct

Nonnegative Tensor Train Decompositions for Multi-domain Feature Extraction and Clustering

2016

Tensor train (TT) is one of the modern tensor decomposition models for low-rank approximation of high-order tensors. For nonnegative multiway array data analysis, we propose a nonnegative TT (NTT) decomposition algorithm for the NTT model and a hybrid model called the NTT-Tucker model. By employing the hierarchical alternating least squares approach, each fiber vector of core tensors is optimized efficiently at each iteration. We compared the performances of the proposed method with a standard nonnegative Tucker decomposition (NTD) algorithm by using benchmark data sets including event-related potential data and facial image data in multi-domain feature extraction and clustering tasks. It i…

Computer scienceFiber (mathematics)business.industryFeature extraction020206 networking & telecommunicationsPattern recognition010103 numerical & computational mathematics02 engineering and technology01 natural sciencesImage (mathematics)Multi domainCore (graph theory)0202 electrical engineering electronic engineering information engineeringDecomposition (computer science)TensorArtificial intelligence0101 mathematicsCluster analysisbusinessTucker decomposition
researchProduct

Semi-automatic Brain Lesion Segmentation in Gamma Knife Treatments Using an Unsupervised Fuzzy C-Means Clustering Technique

2016

MR Imaging is being increasingly used in radiation treatment planning as well as for staging and assessing tumor response. Leksell Gamma Knife (R) is a device for stereotactic neuro-radiosurgery to deal with inaccessible or insufficiently treated lesions with traditional surgery or radiotherapy. The target to be treated with radiation beams is currently contoured through slice-by-slice manual segmentation on MR images. This procedure is time consuming and operator-dependent. Segmentation result repeatability may be ensured only by using automatic/semi-automatic methods with the clinicians supporting the planning phase. In this paper a semi-automatic segmentation method, based on an unsuperv…

Computer scienceGamma knifeBrain lesions Gamma knife treatments MR imaging Semi-automatic segmentation Unsupervised FCM clusteringFuzzy logicBrain lesions; Gamma knife treatments; MR imaging; Semi-automatic segmentation; Unsupervised FCM clustering030218 nuclear medicine & medical imaging03 medical and health sciences0302 clinical medicineComputer visionSegmentationRadiation treatment planningCluster analysisSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniSemi-automatic segmentationBrain lesionsbusiness.industryMr imagingUnsupervised FCM clusteringBrain lesionGamma knife treatmentBrain lesionsSemi automaticArtificial intelligencebusinessGamma knife treatments030217 neurology & neurosurgeryMR imaging
researchProduct

Bag-of-word based brand recognition using Markov Clustering Algorithm for codebook generation

2015

International audience; In order to address the issue of counterfeiting online, it is necessary to use automatic tools that analyze the large amount of information available over the Internet. Analysis methods that extract information about the content of the images are very promising for this purpose. In this paper, a method that automatically extract the brand of objects in images is proposed. The method does not explicitly search for text or logos. This information is implicitly included in the Bag-of-Words representation. In the Bag-of-Words paradigm, visual features are clustered to create the visual words. Despite its shortcomings, k-means is the most widely used algorithm. With k-mea…

Computer scienceInitialization02 engineering and technologyMachine learningcomputer.software_genre[ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV][INFO.INFO-CV] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]0502 economics and business0202 electrical engineering electronic engineering information engineeringVisual WordCluster analysisRepresentation (mathematics)Markov chainbusiness.industry05 social sciencesCodebook[INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]Pattern recognitionIdentity (object-oriented programming)050211 marketing020201 artificial intelligence & image processingArtificial intelligencebusinessAlgorithmcomputerWord (computer architecture)
researchProduct