Search results for "Data stream mining"

showing 10 items of 35 documents

The upgraded HADES trigger and data acquisition system

2011

The HADES experiment is a High Acceptance Di-Electron Spectrometer located at GSI in Darmstadt, Germany. Recently, its trigger and data acquisition system was upgraded. The main goal was to substantially increase the event rate capability by a factor of up to 20 to reach 100 kHz in light and 20 kHz in heavy ion reaction systems. The total data rate written to storage is about 400 MByte/s in peak.In this context, the complete read-out system was exchanged to FPGA-based platforms using optical communication. For data transport a general-purpose real-time network protocol was developed to meet the strong requirements of the system. In particular, trigger information has to reach all front-end …

EthernetEvent (computing)business.industryData stream miningComputer scienceContext (language use)Data acquisitionServer farmVirtual address spacebusinessCommunications protocolInstrumentationMathematical PhysicsComputer hardwareJournal of Instrumentation
researchProduct

CUDA-Accelerated Alignment of Subsequences in Streamed Time Series Data

2014

Euclidean Distance (ED) and Dynamic Time Warping (DTW) are cornerstones in the field of time series data mining. Many high-level algorithms like kNN-classification, clustering or anomaly detection make excessive use of these distance measures as subroutines. Furthermore, the vast growth of recorded data produced by automated monitoring systems or integrated sensors establishes the need for efficient implementations. In this paper, we introduce linear memory parallelization schemes for the alignment of a given query Q in a stream of time series data S for both ED and DTW using CUDA-enabled accelerators. The ED parallelization features a log-linear calculation scheme in contrast to the naive …

Euclidean distanceCUDADynamic time warpingData stream miningComputer scienceAnomaly detectionParallel computingCluster analysisTime complexityDistance measures2014 43rd International Conference on Parallel Processing
researchProduct

Remote Sensing Image Classification with Large Scale Gaussian Processes

2017

Current remote sensing image classification problems have to deal with an unprecedented amount of heterogeneous and complex data sources. Upcoming missions will soon provide large data streams that will make land cover/use classification difficult. Machine learning classifiers can help at this, and many methods are currently available. A popular kernel classifier is the Gaussian process classifier (GPC), since it approaches the classification problem with a solid probabilistic treatment, thus yielding confidence intervals for the predictions as well as very competitive results to state-of-the-art neural networks and support vector machines. However, its computational cost is prohibitive for…

FOS: Computer and information sciences010504 meteorology & atmospheric sciencesComputer scienceMultispectral image0211 other engineering and technologiesMachine Learning (stat.ML)02 engineering and technologyLand cover01 natural sciencesStatistics - ApplicationsMachine Learning (cs.LG)Kernel (linear algebra)Bayes' theoremsymbols.namesakeStatistics - Machine LearningApplications (stat.AP)Electrical and Electronic EngineeringGaussian process021101 geological & geomatics engineering0105 earth and related environmental sciencesRemote sensingContextual image classificationArtificial neural networkData stream miningProbabilistic logicSupport vector machineComputer Science - LearningKernel (image processing)symbolsGeneral Earth and Planetary Sciences
researchProduct

Machine learning information fusion in Earth observation: A comprehensive review of methods, applications and data sources

2020

This paper reviews the most important information fusion data-driven algorithms based on Machine Learning (ML) techniques for problems in Earth observation. Nowadays we observe and model the Earth with a wealth of observations, from a plethora of different sensors, measuring states, fluxes, processes and variables, at unprecedented spatial and temporal resolutions. Earth observation is well equipped with remote sensing systems, mounted on satellites and airborne platforms, but it also involves in-situ observations, numerical models and social media data streams, among other data sources. Data-driven approaches, and ML techniques in particular, are the natural choice to extract significant i…

FOS: Computer and information sciencesEarth observationComputer Science - Machine LearningComputer scienceComputer Vision and Pattern Recognition (cs.CV)Computer Science - Computer Vision and Pattern Recognition02 engineering and technologyMachine learningcomputer.software_genreField (computer science)Machine Learning (cs.LG)Set (abstract data type)0202 electrical engineering electronic engineering information engineeringbusiness.industryData stream mining020206 networking & telecommunicationsNumerical modelsSensor fusionInformation fusionHardware and ArchitectureSignal Processing020201 artificial intelligence & image processingArtificial intelligencebusinesscomputerSoftwareInformation SystemsInformation Fusion
researchProduct

A two-armed bandit collective for hierarchical examplar based mining of frequent itemsets with applications to intrusion detection

2014

Published version of a chapter in the book: Transactions on Computational Collective Intelligence XIV. Also available from the publisher at: http://dx.doi.org/10.1007/978-3-662-44509-9_1 In this paper we address the above problem by posing frequent item-set mining as a collection of interrelated two-armed bandit problems. We seek to find itemsets that frequently appear as subsets in a stream of itemsets, with the frequency being constrained to support granularity requirements. Starting from a randomly or manually selected examplar itemset, a collective of Tsetlin automata based two-armed bandit players - one automaton for each item in the examplar - learns which items should be included in …

Finite-state machineVDP::Technology: 500::Information and communication technology: 550::Computer technology: 551Computational complexity theoryData stream miningComputer scienceNearest neighbor searchSearch engine indexingInformationSystems_DATABASEMANAGEMENTIntrusion detection systemcomputer.software_genreCardinalityAnomaly detectionData miningcomputer
researchProduct

Higher-Fidelity Frugal and Accurate Quantile Estimation Using a Novel Incremental <italic>Discretized</italic> Paradigm

2018

Traditional pattern classification works with the moments of the distributions of the features and involves the estimation of the means and variances. As opposed to this, more recently, research has indicated the power of using the quantiles of the distributions because they are more robust and applicable for non-parametric methods. The estimation of the quantiles is even more pertinent when one is mining data streams. However, the complexity of quantile estimation is much higher than the corresponding estimation of the mean and variance, and this increased complexity is more relevant as the size of the data increases. Clearly, in the context of infinite data streams, a computational and sp…

General Computer ScienceDiscretizationLearning automataData stream miningComputer scienceGeneral EngineeringEstimatorContext (language use)02 engineering and technologyRobustness (computer science)0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingGeneral Materials ScienceAlgorithmQuantileIEEE Access
researchProduct

EUDAQ $-$ A Data Acquisition Software Framework for Common Beam Telescopes

2019

EUDAQ is a generic data acquisition software developed for use in conjunction with common beam telescopes at charged particle beam lines. Providing high-precision reference tracks for performance studies of new sensors, beam telescopes are essential for the research and development towards future detectors for high-energy physics. As beam time is a highly limited resource, EUDAQ has been designed with reliability and ease-of-use in mind. It enables flexible integration of different independent devices under test via their specific data acquisition systems into a top-level framework. EUDAQ controls all components globally, handles the data flow centrally and synchronises and records the data…

Physics - Instrumentation and DetectorsDetector control systems (detector and experiment monitoring and slow-control systems architecture hardware algorithms databases)data acquisitionData management01 natural sciences7. Clean energyHigh Energy Physics - Experiment030218 nuclear medicine & medical imagingHigh Energy Physics - Experiment (hep-ex)0302 clinical medicineData acquisitionbeam [charged particle]Particle tracking detectors[PHYS.HEXP]Physics [physics]/High Energy Physics - Experiment [hep-ex]hardwareDetectors and Experimental Techniquesphysics.ins-detInstrumentationMathematical PhysicsData processingData stream miningPhysicsDetectorInstrumentation and Detectors (physics.ins-det)control systemCharged particle beamdatabases)Particle Physics - ExperimentComputer hardwareperformancearchitectureData acquisition system for beam tests [5]FOS: Physical sciencesalgorithmsprogramming03 medical and health sciencesCalorimeterscharged particle: beam0103 physical sciencesddc:530ddc:610[PHYS.PHYS.PHYS-INS-DET]Physics [physics]/Physics [physics]/Instrumentation and Detectors [physics.ins-det]hep-ex010308 nuclear & particles physicsbusiness.industryDetector control systems (detector and experiment monitoring and slow-control systemsData acquisition conceptsData flow diagramdata managementbusinessBeam (structure)
researchProduct

Application of dictionary learning to denoise LIGO’s blip noise transients

2020

Data streams of gravitational-wave detectors are polluted by transient noise features, or ``glitches,'' of instrumental and environmental origin. In this work we investigate the use of total variation methods and learned dictionaries to mitigate the effect of those transients in the data. We focus on a specific type of transient, ``blip" glitches, as this is the most common type of glitch present in the LIGO detectors and their waveforms are easy to identify. We randomly select 100 blip glitches scattered in the data from advanced LIGO's O1 run, as provided by the citizen-science project Gravity Spy. Our results show that dictionary-learning methods are a valid approach to model and subtrac…

Physics010308 nuclear & particles physicsData stream miningAstrophysics::High Energy Astrophysical PhenomenaAstrophysics::Instrumentation and Methods for AstrophysicsFOS: Physical sciencesBinary numberGeneral Relativity and Quantum Cosmology (gr-qc)Type (model theory)01 natural sciencesGeneral Relativity and Quantum CosmologyLIGOGlitchNoiseTransient noise0103 physical sciencesAstrophysics::Solar and Stellar AstrophysicsTransient (computer programming)010306 general physicsAlgorithmPhysical Review D
researchProduct

Modeling recurrent distributions in streams using possible worlds

2015

Discovering changes in the data distribution of streams and discovering recurrent data distributions are challenging problems in data mining and machine learning. Both have received a lot of attention in the context of classification. With the ever increasing growth of data, however, there is a high demand of compact and universal representations of data streams that enable the user to analyze current as well as historic data without having access to the raw data. To make a first step towards this direction, we propose a condensed representation that captures the various — possibly recurrent — data distributions of the stream by extending the notion of possible worlds. The representation en…

Possible worldBasis (linear algebra)Computer scienceData stream miningRepresentation (systemics)Context (language use)Data pre-processingData miningRaw datacomputer.software_genrecomputerData modeling2015 IEEE International Conference on Data Science and Advanced Analytics (DSAA)
researchProduct

On using novel “Anti-Bayesian” techniques for the classification of dynamical data streams

2017

The classification of dynamical data streams is among the most complex problems encountered in classification. This is, firstly, because the distribution of the data streams is non-stationary, and it changes without any prior “warning”. Secondly, the manner in which it changes is also unknown. Thirdly, and more interestingly, the model operates with the assumption that the correct classes of previously-classified patterns become available at a juncture after their appearance. This paper pioneers the use of unreported novel schemes that can classify such dynamical data streams by invoking the recently-introduced “Anti-Bayesian” (AB) techniques. Contrary to the Bayesian paradigm, that compare…

QuantilesComputer scienceData stream miningBayesian probability02 engineering and technologyClassificationcomputer.software_genreAnti-Bayesian classificationRobustness (computer science)020204 information systems0202 electrical engineering electronic engineering information engineering020201 artificial intelligence & image processingData miningcomputerBayesian paradigmQuantile2017 IEEE Congress on Evolutionary Computation (CEC)
researchProduct