Search results for "DATA MINING"
showing 10 items of 907 documents
Exploring automatic grouping procedures in ceramic petrology
2004
Although a small number of studies have attempted to introduce automatic grouping procedures into thin section petrography of archaeological ceramics, the majority of studies continue to be carried out by non-automatic means. Although such an approach with the single observer grouping samples has a number of advantages, it is problematic when dealing with large numbers of samples. This paper aims to explore different coding systems and statistical analyses for grouping ceramic thin sections. In the example discussed a number of variables are defined, codified and analysed by correspondence analysis, classical multidimensional scaling, non-metric isotonic multidimensional scaling and Sammon …
Towards more relevance-oriented data mining research
2008
Data mining (DM) research has successfully developed advanced DM techniques and algorithms over the last few decades, and many organisations have great expectations to take more benefit of their data warehouses in decision making. Currently, the strong focus of most DM-researchers is still only on technology-oriented topics. Commonly the DM research has several stakeholders, the major of which can be divided into internal and external ones each having their own point of view, and which are at least partly conflicting. The most important internal groups of stakeholders are the DM research community and academics in other disciplines. The most important external stakeholder groups are manager…
An ANN model to correlate roughness and structural performance in asphalt pavements
2017
Abstract In this paper, using a large database from the Long Term Pavement Performance program, the authors developed an Artificial Neural Network (ANN) to estimate the structural performance of asphalt pavements from roughness data. Considering advantages of modern high-performance survey devices in the acquisition of road pavement functional parameters, it would be of practical significance if the structural state of a pavement could be estimated from its functional conditions. To differentiate various road section conditions, several significant input parameters, related to traffic, weather, and structural aspects, have been included in the analysis. The results are very interesting and …
Neural Networks as Soft Sensors: a Comparison in a Real World Application.
2006
Physical atmosphere parameters, as temperature or humidity, can be indirectly estimated on the surface of a monument by means of soft sensors based on neural networks, if an ambient air monitoring station works in the neighborhood of the monument itself. Since the soft sensors work as virtual instruments, the accuracy of such measurements has to be analyzed and validated from statistical and metrological points of view. The paper compares different typologies of neural networks, which can be used as soft sensors in a complex real world application: a non invasive monitoring of the conservation state of old monuments. In this context, several designed connessionistic systems, based on radial…
A Cluster Analysis of Stock Market Data Using Hierarchical SOMs
2016
The analysis of stock markets has become relevant mainly because of its financial implications. In this paper, we propose a novel methodology for performing a structured cluster analysis of stock market data. Our proposed method uses a tree-based neural network called the TTOSOM. The TTOSOM performs self-organization to construct tree-based clusters of vector data in the multi-dimensional space. The resultant tree possesses interesting mathematical properties such as a succinct representation of the original data distribution, and a preservation of the underlying topology. In order to demonstrate the capabilities of our method, we analyze 206 assets of the Italian stock market. We were able…
A Review of Kernel Methods in Remote Sensing Data Analysis
2011
Kernel methods have proven effective in the analysis of images of the Earth acquired by airborne and satellite sensors. Kernel methods provide a consistent and well-founded theoretical framework for developing nonlinear techniques and have useful properties when dealing with low number of (potentially high dimensional) training samples, the presence of heterogenous multimodalities, and different noise sources in the data. These properties are particularly appropriate for remote sensing data analysis. In fact, kernel methods have improved results of parametric linear methods and neural networks in applications such as natural resource control, detection and monitoring of anthropic infrastruc…
Web Usage Mining by Neural Hybrid Prediction with Markov Chain Components
2021
This paper presents and evaluates a two-level web usage prediction technique, consisting of a neural network in the first level and contextual component predictors in the second level. We used Markov chains of different orders as contextual predictors to anticipate the next web access based on specific web access history. The role of the neural network is to decide, based on previous behaviour, whose predictor’s output to use. The predicted web resources are then prefetched into the cache of the browser. In this way, we considerably increase the hit rate of the web browser, which shortens the load times. We have determined the optimal configuration of the proposed hybrid predictor on a real…
Optimal Pruned K-Nearest Neighbors: OP-KNN Application to Financial Modeling
2008
The paper proposes a methodology called OP-KNN, which builds a one hidden-layer feed forward neural network, using nearest neighbors neurons with extremely small computational time. The main strategy is to select the most relevant variables beforehand, then to build the model using KNN kernels. Multi-response sparse regression (MRSR) is used as the second step in order to rank each k-th nearest neighbor and finally as a third step leave-one-out estimation is used to select the number of neighbors and to estimate the generalization performances. This new methodology is tested on a toy example and is applied to financial modeling.
Improving the Competency of Classifiers through Data Generation
2001
This paper describes a hybrid approach in which sub-symbolic neural networks and symbolic machine learning algorithms are grouped into an ensemble of classifiers. Initially each classifier determines which portion of the data it is most competent in. The competency information is used to generated new data that are used for further training and prediction. The application of this approach in a difficult to learn domain shows an increase in the predictive power, in terms of the accuracy and level of competency of both the ensemble and the component classifiers.
Clustering Quality and Topology Preservation in Fast Learning SOMs
2008
The Self-Organizing Map (SOM) is a popular unsupervised neural network able to provide effective clustering and data visualization for data represented in multidimensional input spaces. In this paper, we describe Fast Learning SOM (FLSOM) which adopts a learning algorithm that improves the performance of the standard SOM with respect to the convergence time in the training phase. We show that FLSOM also improves the quality of the map by providing better clustering quality and topology preservation of multidimensional input data. Several tests have been carried out on different multidimensional datasets, which demonstrate better performances of the algorithm in comparison with the original …