Search results for "Hierarchical Clustering"
showing 10 items of 56 documents
Preventing Overlaps in Agglomerative Hierarchical Conceptual Clustering
2020
Hierarchical Clustering is an unsupervised learning task, whi-ch seeks to build a set of clusters ordered by the inclusion relation. It is usually assumed that the result is a tree-like structure with no overlapping clusters, i.e., where clusters are either disjoint or nested. In Hierarchical Conceptual Clustering (HCC), each cluster is provided with a conceptual description which belongs to a predefined set called the pattern language. Depending on the application domain, the elements in the pattern language can be of different nature: logical formulas, graphs, tests on the attributes, etc. In this paper, we tackle the issue of overlapping concepts in the agglomerative approach of HCC. We …
An AI Walk from Pharmacokinetics to Marketing
2009
This work is intended for providing a review of reallife practical applications of Artificial Intelligence (AI) methods. We focus on the use of Machine Learning (ML) methods applied to rather real problems than synthetic problems with standard and controlled environment. In particular, we will describe the following problems in next sections: • Optimization of Erythropoietin (EPO) dosages in anaemic patients undergoing Chronic Renal Failure (CRF). • Optimization of a recommender system for citizen web portal users. • Optimization of a marketing campaign. The choice of these problems is due to their relevance and their heterogeneity. This heterogeneity shows the capabilities and versatility …
Part-of-Speech Induction by Singular Value Decomposition and Hierarchical Clustering
2006
Part-of-speech induction involves the automatic discovery of word classes and the assignment of each word of a vocabulary to one or several of these classes. The approach proposed here is based on the analysis of word distributions in a large collection of German newspaper texts. Its main advantage over other attempts is that it combines the hierarchical clustering of context vectors with a previous step of dimensionality reduction that minimizes the effects of sampling errors.
Identifying legitimate Web users and bots with different traffic profiles — an Information Bottleneck approach
2020
Abstract Recent studies reported that about half of Web users nowadays are intelligent agents (Web bots). Many bots are impersonators operating at a very high sophistication level, trying to emulate navigational behaviors of legitimate users (humans). Moreover, bot technology continues to evolve which makes bot detection even harder. To deal with this problem, many advanced methods for differentiating bots from humans have been proposed, a large part of which relies on supervised machine learning techniques. In this paper, we propose a novel approach to identify various profiles of bots and humans which combines feature selection and unsupervised learning of HTTP-level traffic patterns to d…
The Hierarchical Agglomerative Clustering with Gower index: a methodology for automatic design of OLAP cube in ecological data processing context
2015
In Press, Corrected Proof; International audience; The OLAP systems can be an improvement for ecological studies. In fact, ecology studies, follows and analyzes phenomenon across space and time and according to several parameters. OLAP systems can provide to ecologists browsing in a large dataset. One focus of the current research on OLAP system is the automatic design of OLAP cubes and of data warehouse schemas. This kind of works makes accessible OLAP technology to non information technology experts. But to be efficient, the automatic OLAP building must take into account various cases. Moreover the OLAP technology is based on the concept of hierarchy. Thereby the hierarchical clustering m…
Structural analyses in the study of behavior : From rodents to non-human primates
2022
Ajuts: J-BL's research was funded by Natural Sciences and Engineering Research Council of Canada (NSERC, Discovery Grant #: 2015-06034 to J-BL). MC, SA, and GC's research was funded by a grant from the University of Palermo, Italy. The term " structure " indicates a set of components that, in relation to each other, shape an organic complex. Such a complex takes on essential connotations of functionally unitary entity resulting from the mutual relationships of its constituent elements. In a broader sense, we can use the word " structure " to define the set of relationships among the elements of an emergent system that is not determined by the mere algebraic sum of these elements, but by the…
Classification of Chitinozoa (Llandoverian, Canada) Using Image Analysis
1996
Chitinozoa (Llandoverian, Canada) were studied using image analysis. After digitalization of the objects, shape parameters were calculated. The boundary of each fossil was then traced by a vector centred at the centroid for Fast Fourier Transform (FFT). Results of the two methods were used as variables in a hierarchical cluster analysis in order to group the samples. These results show that Chitinozoa can be significantly classified in terms of taxa using independent shape parameters obtained by image analysis.
Fuzzy Systems Based on Multispecies PSO Method in Spatial Analysis
2012
We present a method by using the hierarchical cluster-based Multispecies particle swarm optimization to generate a fuzzy system of Takagi-Sugeno-Kang type encapsulated in a geographical information system considered as environmental decision support for spatial analysis. We consider a spatial area partitioned in subzones: the data measured in each subzone are used to extract a fuzzy rule set of above mentioned type. We adopt a similarity index (greater than a specific threshold) for comparing fuzzy systems generated for adjacent subzones.
Radio frequency fingerprinting for outdoor user equipment localization
2017
The recent advancements in cellular mobile technology and smart phone usage have opened opportunities for researchers and commercial companies to develop ubiquitous low cost localization systems. Radio frequency (RF) fingerprinting is a popular positioning technique which uses radio signal strength (RSS) values from already existing infrastructures to provide satisfactory user positioning accuracy in indoor and densely built outdoor urban areas where Global Navigation Satellite System (GNSS) signal is poor and hard to reach. However a major requirement for the RF fingerprinting to maintain good localization accuracy is the collection and updating of large training database. The Minimization…
Assessment of nonnegative matrix factorization algorithms for electroencephalography spectral analysis.
2020
AbstractBackgroundNonnegative matrix factorization (NMF) has been successfully used for electroencephalography (EEG) spectral analysis. Since NMF was proposed in the 1990s, many adaptive algorithms have been developed. However, the performance of their use in EEG data analysis has not been fully compared. Here, we provide a comparison of four NMF algorithms in terms of accuracy of estimation, stability (repeatability of the results) and time complexity of algorithms with simulated data. In the practical application of NMF algorithms, stability plays an important role, which was an emphasis in the comparison. A Hierarchical clustering algorithm was implemented to evaluate the stability of NM…