Search results for "Data mining"
showing 10 items of 907 documents
Physical Activity and Exercise: Text Mining Analysis
2021
It is currently difficult to have a global state of the art vision of certain scientific topics. In the field of physical activity (PA) and exercise, this is due to information overload. The present study aims to provide a solution by analysing a large mass of scientific articles using text mining (TM). The purpose was to analyse what is being investigated in the PA health field on young people from primary, secondary and higher education. Titles and abstracts published in the Web of Science (WOS) database were analysed using TM on 24 November 2020, and after removing duplicates, 85,368 remained. The results show 9960 (unique) words and the most frequently used bi-grams and tri-grams. A co-…
ComPWA: A common amplitude analysis framework for PANDA
2014
A large part of the physics program of the PANDA experiment at FAIR deals with the search for new conventional and exotic hadronic states like e.g. hybrids and glueballs. For many analyses PANDA will need an amplitude analysis, e.g. a partial wave analysis (PWA), to identify possible candidates and for the classification of known states. Therefore, a new, agile and efficient amplitude analysis framework ComPWA is under development. It is modularized to provide easy extension with models and formalisms as well as fitting of multiple datasets, even from different experiments. Experience from existing PWA programs was used to fix the requirements of the framework and to prevent it from restric…
The poor man’s goldmine? : Career paths in Swedish and Finnish merchant shipping, c. 1840–1950
2017
This article analyses the career paths of Swedish and Finnish sailors from the mid-19th to the mid-20th century. The article shows that, for the most of the men, the seaman’s occupation was just a passing phase before taking up a job on shore, but many of them also created a longlasting and advancing career by going to sea. There was not necessarily, however, a clear distinction between job opportunities at sea and those on shore in those days: men worked both at sea and on shore. We therefore argue that an individual’s advancement in a maritime career was a context-specific socio-economic phenomenon. In Scandinavia, work on board ships was dependent on features that characterized the divis…
Evolving Tree Algorithm Modifications
2007
There are many variants of the original self-organizing neural map algorithm proposed by Kohonen. One of the most recent is the Evolving Tree, a tree-shaped self-organizing network which has many interesting characteristics. This network builds a tree structure splitting the input dataset during learning. This paper presents a speed-up modification of the original training algorithm useful when the Evolving Tree network is used with complex data as images or video. After a measurement of the effectiveness an application of the modified algorithm in image segmentation is presented.
An overview of incremental feature extraction methods based on linear subspaces
2018
Abstract With the massive explosion of machine learning in our day-to-day life, incremental and adaptive learning has become a major topic, crucial to keep up-to-date and improve classification models and their corresponding feature extraction processes. This paper presents a categorized overview of incremental feature extraction based on linear subspace methods which aim at incorporating new information to the already acquired knowledge without accessing previous data. Specifically, this paper focuses on those linear dimensionality reduction methods with orthogonal matrix constraints based on global loss function, due to the extensive use of their batch approaches versus other linear alter…
Using PageRank for non-personalized default rankings in dynamic markets
2017
Abstract Default ranking algorithms are used to generate non-personalized product rankings for standard consumers, for example, on landing pages of online stores. Default rankings are created without any information about the consumers’ preferences. This paper proposes using the product centrality ranking algorithm (PCRA), which solves some problems of existing default ranking algorithms: Existing approaches either have low accuracy, because they rely on only one product attribute, or they are unable to estimate ranks for new or updated products, because they use past consumer behavior, such as previous sales or ratings. The PCRA uses the PageRank centrality of products in a product dominat…
Analysis and design of sequencing rules for car sequencing
2009
Abstract This paper presents novel approaches for generating sequencing rules for the car sequencing (CS) problem in cases of two and multiple processing times per station. The CS problem decides on the succession of different car models launched down a mixed-model assembly line. It aims to avoid work overloads at the stations of the line by applying so-called sequencing rules, which restrict the maximum occurrence of labor-intensive options in a subsequence of a certain length. Thus to successfully avoid work overloads, suitable sequencing rules are essential. The paper shows that the only existing rule generation approach leads to sequencing rules which misclassify feasible sequences. We …
A Two-layer Partitioning for Non-point Spatial Data
2021
Non-point spatial objects (e.g., polygons, linestrings, etc.) are ubiquitous and their effective management is always timely. We study the problem of indexing non-point objects in memory. We propose a secondary partitioning technique for space-oriented partitioning indices (e.g., grids), which improves their performance significantly, by avoiding the generation and elimination of duplicate results. Our approach is novel and of a high impact, as (i) it is extremely easy to implement and (ii) it can be used by any space-partitioning index. We show how our approach can be used to boost the performance of spatial range queries. We also show how we can avoid performing the expensive refinement s…
ExtMiner : Combining multiple ranking and clustering algorithms for structured document retrieval
2006
This paper introduces ExtMiner, a platform and potential tool for information management in SMEs (small & medium-size enterprise), or for organizational workgroups. ExtMiner supports interactive and iterative clustering of documents. It provides users with a visual cluster and list views at the same time, supporting iterative search process. ExtMiner may also be applied as a platform for research on retrieval fusion, since it combines search, clustering and visualization algorithms. ExtMiner was evaluated with three document collections. Although the findings were encouraging the user interface and performance with large document repositories need further development. peerReviewed
An approach based on the Adaptive Resonance Theory for analysing the viability of recommender systems in a citizen Web portal
2007
This paper proposes a methodology to optimise the future accuracy of a collaborative recommender application in a citizen Web portal. There are four stages namely, user modelling, benchmarking of clustering algorithms, prediction analysis and recommendation. The first stage is to develop analytical models of common characteristics of Web-user data. These artificial data sets are then used to evaluate the performance of clustering algorithms, in particular benchmarking the ART2 neural network with K-means clustering. Afterwards, it is evaluated the predictive accuracy of the clusters applied to a real-world data set derived from access logs to the citizen Web portal Infoville XXI (http://www…