Search results for "Data set"
showing 10 items of 154 documents
A managerial approach to firms’ networking strategy
2009
As recently pointed out by many authors and companies‟ manager, the company competitiveness is more and more based on the ability of a firm to build strategic and competitive networks with partners and competitors. Nowadays, in a competitive environment, firms are facing challenges such as the growing demand of innovation, the increasing competitiveness and the need to penetrate new markets, by using network strategies. Consequently, such motivations are putting an increasing interest on networking strategy issue, which are becoming an essential strength of company competitive strategy. In this work the most significant theories on firm networking are reviewed and an innovative strategic pe…
Distance-constrained data clustering by combined k-means algorithms and opinion dynamics filters
2014
Data clustering algorithms represent mechanisms for partitioning huge arrays of multidimensional data into groups with small in–group and large out–group distances. Most of the existing algorithms fail when a lower bound for the distance among cluster centroids is specified, while this type of constraint can be of help in obtaining a better clustering. Traditional approaches require that the desired number of clusters are specified a priori, which requires either a subjective decision or global meta–information knowledge that is not easily obtainable. In this paper, an extension of the standard data clustering problem is addressed, including additional constraints on the cluster centroid di…
Comparison of Internal Clustering Validation Indices for Prototype-Based Clustering
2017
Clustering is an unsupervised machine learning and pattern recognition method. In general, in addition to revealing hidden groups of similar observations and clusters, their number needs to be determined. Internal clustering validation indices estimate this number without any external information. The purpose of this article is to evaluate, empirically, characteristics of a representative set of internal clustering validation indices with many datasets. The prototype-based clustering framework includes multiple, classical and robust, statistical estimates of cluster location so that the overall setting of the paper is novel. General observations on the quality of validation indices and on t…
Clustering techniques for personal photo album management
2009
In this work we propose a novel approach for the automatic representation of pictures achieving at more effective organization of personal photo albums. Images are analyzed and described in multiple representation spaces, namely, faces, background and time of capture. Faces are automatically detected, rectified and represented projecting the face itself in a common low-dimensional eigenspace. Backgrounds are represented with low-level visual features based on RGB histogram and Gabor filter bank. Faces, time and background information of each image in the collection is automatically organized using a mean-shift clustering technique. Given the particular domain of personal photo libraries, wh…
HyperLabelMe : A Web Platform for Benchmarking Remote-Sensing Image Classifiers
2017
HyperLabelMe is a web platform that allows the automatic benchmarking of remote-sensing image classifiers. To demonstrate this platform's attributes, we collected and harmonized a large data set of labeled multispectral and hyperspectral images with different numbers of classes, dimensionality, noise sources, and levels. The registered user can download training data pairs (spectra and land cover/use labels) and submit the predictions for unseen testing spectra. The system then evaluates the accuracy and robustness of the classifier, and it reports different scores as well as a ranked list of the best methods and users. The system is modular, scalable, and ever-growing in data sets and clas…
Can Extensive Reticulation and Concerted Evolution Result in a Cladistically Structured Molecular Data Set?
2001
Hierarchy is the main criterion for informativeness in a data set, even if no explicit reference to evolution as a causal process is provided. Sequence data (nuclear ribosomal DNA ITS) from Armeria (Plumbaginaceae) contains a certain amount of hierarchical structure as suggested by data decisiveness (DD) and distribution of tree lengths (DTL). However, ancillary evidence suggests that extensive gene flow and biased concerted evolution in these multi-copy regions have significantly shaped the ITS data set. This argument is discussed using parsimony analysis of four data sets, constructed by combining wild sequences with those from different generations of artificial hybrids (wild + F1, F2, a…
The availability of raw data in substance abuse scientific journals
2018
[Objective]: The availability of research data sets is an important milestone since it can enhance the dynamics of research. This study aims to analyse the PubMed Central repository to determine the availability and type of raw data sets in Substance Abuse journals indexed in the Journal Citation Reports.
An approach based on the Adaptive Resonance Theory for analysing the viability of recommender systems in a citizen Web portal
2007
This paper proposes a methodology to optimise the future accuracy of a collaborative recommender application in a citizen Web portal. There are four stages namely, user modelling, benchmarking of clustering algorithms, prediction analysis and recommendation. The first stage is to develop analytical models of common characteristics of Web-user data. These artificial data sets are then used to evaluate the performance of clustering algorithms, in particular benchmarking the ART2 neural network with K-means clustering. Afterwards, it is evaluated the predictive accuracy of the clusters applied to a real-world data set derived from access logs to the citizen Web portal Infoville XXI (http://www…
The ROSAT-ESO Flux Limited X-ray (REFLEX) Galaxy Cluster Survey. V. The cluster catalogue
2004
We present the catalogue of the REFLEX Cluster Survey providing information on the X-ray properties, redshifts, and some identification details of the clusters in the REFLEX sample. The catalogue describes a statistically complete X-ray flux-limited sample of 447 galaxy clusters above an X-ray flux of 3 10(-12) erg /s/cm**2 (0.1 to 2.4 keV) in an area of 4.24 ster in the southern sky. The cluster candidates were first selected by their X-ray emission in the ROSAT-All Sky Survey and subsequently spectroscopically identified in the frame of an ESO key programme. In addition to the cluster catalogue we also describe the complete selection criteria as a function of the sky position and the conv…
Spatio-Chromatic Adaptation via Higher-Order Canonical Correlation Analysis of Natural Images
2014
Independent component and canonical correlation analysis are two general-purpose statistical methods with wide applicability. In neuroscience, independent component analysis of chromatic natural images explains the spatio-chromatic structure of primary cortical receptive fields in terms of properties of the visual environment. Canonical correlation analysis explains similarly chromatic adaptation to different illuminations. But, as we show in this paper, neither of the two methods generalizes well to explain both spatio-chromatic processing and adaptation at the same time. We propose a statistical method which combines the desirable properties of independent component and canonical correlat…