Search results for "Data mining"

showing 10 items of 907 documents

Physical Activity and Exercise: Text Mining Analysis

2021

It is currently difficult to have a global state of the art vision of certain scientific topics. In the field of physical activity (PA) and exercise, this is due to information overload. The present study aims to provide a solution by analysing a large mass of scientific articles using text mining (TM). The purpose was to analyse what is being investigated in the PA health field on young people from primary, secondary and higher education. Titles and abstracts published in the Web of Science (WOS) database were analysed using TM on 24 November 2020, and after removing duplicates, 85,368 remained. The results show 9960 (unique) words and the most frequently used bi-grams and tri-grams. A co-…

Higher educationAdolescentDatabases FactualHealth Toxicology and MutagenesisschoolApplied psychologyPhysical activityphysical activitytext miningField (computer science)ArticleText mininguniversityData MiningHumansSocial sphereExerciseEsportsLingüísticastudentsbusiness.industryPublic Health Environmental and Occupational HealthRInformation overloadTerm (time)Index (publishing)MedicineEducacióbusinessPsychologyInternational Journal of Environmental Research and Public Health

researchProduct

ComPWA: A common amplitude analysis framework for PANDA

2014

A large part of the physics program of the PANDA experiment at FAIR deals with the search for new conventional and exotic hadronic states like e.g. hybrids and glueballs. For many analyses PANDA will need an amplitude analysis, e.g. a partial wave analysis (PWA), to identify possible candidates and for the classification of known states. Therefore, a new, agile and efficient amplitude analysis framework ComPWA is under development. It is modularized to provide easy extension with models and formalisms as well as fitting of multiple datasets, even from different experiments. Experience from existing PWA programs was used to fix the requirements of the framework and to prevent it from restric…

HistoryEngineeringbusiness.industryComparabilityExtension (predicate logic)computer.software_genreRotation formalisms in three dimensionsComputer Science ApplicationsEducationAmplitudeSoftwareData miningbusinessQuality assurancecomputerFree parameterAgile software developmentJournal of Physics: Conference Series

researchProduct

The poor man’s goldmine? : Career paths in Swedish and Finnish merchant shipping, c. 1840–1950

2017

This article analyses the career paths of Swedish and Finnish sailors from the mid-19th to the mid-20th century. The article shows that, for the most of the men, the seaman’s occupation was just a passing phase before taking up a job on shore, but many of them also created a longlasting and advancing career by going to sea. There was not necessarily, however, a clear distinction between job opportunities at sea and those on shore in those days: men worked both at sea and on shore. We therefore argue that an individual’s advancement in a maritime career was a context-specific socio-economic phenomenon. In Scandinavia, work on board ships was dependent on features that characterized the divis…

HistoryHistory060106 history of social sciencesmaritime labour0502 economics and businessEconomic historyta6150601 history and archaeology050207 economicssocial mobilityShoregeographygeography.geographical_feature_categorycareer paths05 social sciencesdata mining06 humanities and the artsmerimiehetSocial mobilityurakehityssosiaalinen liikkuvuusOn boardIndustrialisationWork (electrical)Law8. Economic growthmerenkulkutiedonlouhintaDivision of labour

researchProduct

Evolving Tree Algorithm Modifications

2007

There are many variants of the original self-organizing neural map algorithm proposed by Kohonen. One of the most recent is the Evolving Tree, a tree-shaped self-organizing network which has many interesting characteristics. This network builds a tree structure splitting the input dataset during learning. This paper presents a speed-up modification of the original training algorithm useful when the Evolving Tree network is used with complex data as images or video. After a measurement of the effectiveness an application of the modified algorithm in image segmentation is presented.

Incremental decision treeComputer scienceID3 algorithmImage segmentationcomputer.software_genreTree (data structure)Tree traversalTree structureEvolving Tree neural networkTree networkData miningcomputerAlgorithmOrder statistic tree

researchProduct

An overview of incremental feature extraction methods based on linear subspaces

2018

Abstract With the massive explosion of machine learning in our day-to-day life, incremental and adaptive learning has become a major topic, crucial to keep up-to-date and improve classification models and their corresponding feature extraction processes. This paper presents a categorized overview of incremental feature extraction based on linear subspace methods which aim at incorporating new information to the already acquired knowledge without accessing previous data. Specifically, this paper focuses on those linear dimensionality reduction methods with orthogonal matrix constraints based on global loss function, due to the extensive use of their batch approaches versus other linear alter…

Information Systems and ManagementComputer scienceDimensionality reductionFeature extraction010103 numerical & computational mathematics02 engineering and technologycomputer.software_genre01 natural sciencesLinear subspaceManagement Information SystemsMatrix decompositionCategorizationDiscriminative modelArtificial IntelligencePrincipal component analysis0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingAdaptive learningOrthogonal matrixData mining0101 mathematicscomputerSoftwareKnowledge-Based Systems

researchProduct

Using PageRank for non-personalized default rankings in dynamic markets

2017

Abstract Default ranking algorithms are used to generate non-personalized product rankings for standard consumers, for example, on landing pages of online stores. Default rankings are created without any information about the consumers’ preferences. This paper proposes using the product centrality ranking algorithm (PCRA), which solves some problems of existing default ranking algorithms: Existing approaches either have low accuracy, because they rely on only one product attribute, or they are unable to estimate ranks for new or updated products, because they use past consumer behavior, such as previous sales or ratings. The PCRA uses the PageRank centrality of products in a product dominat…

Information Systems and ManagementGeneral Computer ScienceComputer science02 engineering and technologyManagement Science and Operations Researchcomputer.software_genreIndustrial and Manufacturing Engineeringlaw.inventionPageRanklaw0502 economics and business0202 electrical engineering electronic engineering information engineeringEconometricsProduct (category theory)Consumer behaviour05 social sciencesGraphRankingModeling and SimulationGraph (abstract data type)050211 marketing020201 artificial intelligence & image processingLearning to rankData miningCentralitycomputerEuropean Journal of Operational Research

researchProduct

Analysis and design of sequencing rules for car sequencing

2009

Abstract This paper presents novel approaches for generating sequencing rules for the car sequencing (CS) problem in cases of two and multiple processing times per station. The CS problem decides on the succession of different car models launched down a mixed-model assembly line. It aims to avoid work overloads at the stations of the line by applying so-called sequencing rules, which restrict the maximum occurrence of labor-intensive options in a subsequence of a certain length. Thus to successfully avoid work overloads, suitable sequencing rules are essential. The paper shows that the only existing rule generation approach leads to sequencing rules which misclassify feasible sequences. We …

Information Systems and ManagementGeneral Computer ScienceOperations researchComputer scienceModeling and SimulationSubsequenceData miningManagement Science and Operations ResearchLine (text file)Mixed-model assembly lines Car sequencing Sequencing rulescomputer.software_genrecomputerIndustrial and Manufacturing EngineeringEuropean Journal of Operational Research

researchProduct

A Two-layer Partitioning for Non-point Spatial Data

2021

Non-point spatial objects (e.g., polygons, linestrings, etc.) are ubiquitous and their effective management is always timely. We study the problem of indexing non-point objects in memory. We propose a secondary partitioning technique for space-oriented partitioning indices (e.g., grids), which improves their performance significantly, by avoiding the generation and elimination of duplicate results. Our approach is novel and of a high impact, as (i) it is extremely easy to implement and (ii) it can be used by any space-partitioning index. We show how our approach can be used to boost the performance of spatial range queries. We also show how we can avoid performing the expensive refinement s…

Information engineeringDistributed databaseRange query (data structures)Computer scienceSearch engine indexingScalabilityTwo layerPoint (geometry)Data miningcomputer.software_genreSpatial analysiscomputer

researchProduct

ExtMiner : Combining multiple ranking and clustering algorithms for structured document retrieval

2006

This paper introduces ExtMiner, a platform and potential tool for information management in SMEs (small & medium-size enterprise), or for organizational workgroups. ExtMiner supports interactive and iterative clustering of documents. It provides users with a visual cluster and list views at the same time, supporting iterative search process. ExtMiner may also be applied as a platform for research on retrieval fusion, since it combines search, clustering and visualization algorithms. ExtMiner was evaluated with three document collections. Although the findings were encouraging the user interface and performance with large document repositories need further development. peerReviewed

Information managementdokumenttien hakumenetelmätklusterointiDocument retrievalInformation retrievalComputer scienceDocument clusteringXMLcomputer.software_genreRanking (information retrieval)document clusteringRankingHuman–computer information retrievalRelevance (information retrieval)Data miningUser interfaceDocument retrievalCluster analysiscomputer

researchProduct

An approach based on the Adaptive Resonance Theory for analysing the viability of recommender systems in a citizen Web portal

2007

This paper proposes a methodology to optimise the future accuracy of a collaborative recommender application in a citizen Web portal. There are four stages namely, user modelling, benchmarking of clustering algorithms, prediction analysis and recommendation. The first stage is to develop analytical models of common characteristics of Web-user data. These artificial data sets are then used to evaluate the performance of clustering algorithms, in particular benchmarking the ART2 neural network with K-means clustering. Afterwards, it is evaluated the predictive accuracy of the clusters applied to a real-world data set derived from access logs to the citizen Web portal Infoville XXI (http://www…

Information retrievalArtificial neural networkComputer scienceGeneral EngineeringRecommender systemcomputer.software_genreComputer Science ApplicationsData setAdaptive resonance theoryArtificial IntelligenceCollaborative filteringData miningCluster analysiscomputerExpert Systems with Applications

researchProduct