Search results for "Reinforcement learning"

showing 10 items of 95 documents

Assigning discounts in a marketing campaign by using reinforcement learning and neural networks

2009

In this work, RL is used to find an optimal policy for a marketing campaign. Data show a complex characterization of state and action spaces. Two approaches are proposed to circumvent this problem. The first approach is based on the self-organizing map (SOM), which is used to aggregate states. The second approach uses a multilayer perceptron (MLP) to carry out a regression of the action-value function. The results indicate that both approaches can improve a targeted marketing campaign. Moreover, the SOM approach allows an intuitive interpretation of the results, and the MLP approach yields robust results with generalization capabilities.

Artificial neural networkComputer scienceGeneralizationbusiness.industrymedia_common.quotation_subjectAggregate (data warehouse)General EngineeringMachine learningcomputer.software_genreComputer Science ApplicationsFunction approximationArtificial IntelligenceMultilayer perceptronReinforcement learningState (computer science)Artificial intelligenceFunction (engineering)businesscomputermedia_commonExpert Systems with Applications
researchProduct

Safer Reinforcement Learning for Agents in Industrial Grid-Warehousing

2020

In mission-critical, real-world environments, there is typically a low threshold for failure, which makes interaction with learning algorithms particularly challenging. Here, current state-of-the-art reinforcement learning algorithms struggle to learn optimal control policies safely. Loss of control follows, which could result in equipment breakages and even personal injuries.

Artificial neural networkComputer scienceSAFERControl (management)0202 electrical engineering electronic engineering information engineeringReinforcement learning020206 networking & telecommunications02 engineering and technologyMarkov decision processGridOptimal controlIndustrial engineering
researchProduct

Increasing sample efficiency in deep reinforcement learning using generative environment modelling

2020

Artificial neural networkComputer sciencebusiness.industrySample (statistics)Machine learningcomputer.software_genreTheoretical Computer ScienceComputational Theory and MathematicsArtificial IntelligenceControl and Systems EngineeringReinforcement learningMarkov decision processArtificial intelligencebusinesscomputerVDP::Teknologi: 500::Informasjons- og kommunikasjonsteknologi: 550Generative grammar
researchProduct

Road Detection for Reinforcement Learning Based Autonomous Car

2020

Human mistakes in traffic often have terrible consequences. The long-awaited introduction of self-driving vehicles may solve many of the problems with traffic, but much research is still needed before cars are fully autonomous.In this paper, we propose a new Road Detection algorithm using online supervised learning based on a Neural Network architecture. This algorithm is designed to support a Reinforcement Learning algorithm (for example, the standard Proximal Policy Optimization or PPO) by detecting when the car is in an adverse condition. Specifically, the PPO gets a penalty whenever the virtual automobile gets stuck or drives off the road with any of its four wheels.Initial experiments …

Artificial neural networkComputer sciencebusiness.industrySupervised learningNeural network architectureReinforcement learningArtificial intelligenceReinforcement learning algorithmbusinessProceedings of the 2020 The 3rd International Conference on Information Science and System
researchProduct

CostNet: An End-to-End Framework for Goal-Directed Reinforcement Learning

2020

Reinforcement Learning (RL) is a general framework concerned with an agent that seeks to maximize rewards in an environment. The learning typically happens through trial and error using explorative methods, such as \(\epsilon \)-greedy. There are two approaches, model-based and model-free reinforcement learning, that show concrete results in several disciplines. Model-based RL learns a model of the environment for learning the policy while model-free approaches are fully explorative and exploitative without considering the underlying environment dynamics. Model-free RL works conceptually well in simulated environments, and empirical evidence suggests that trial and error lead to a near-opti…

Artificial neural networkEnd-to-end principlebusiness.industryComputer scienceReinforcement learningSample (statistics)Markov decision processArtificial intelligenceEmpirical evidenceTrial and errorbusinessFeature learning
researchProduct

Multitasking in Driving as Optimal Adaptation Under Uncertainty

2021

Objective The objective was to better understand how people adapt multitasking behavior when circumstances in driving change and how safe versus unsafe behaviors emerge. Background Multitasking strategies in driving adapt to changes in the task environment, but the cognitive mechanisms of this adaptation are not well known. Missing is a unifying account to explain the joint contribution of task constraints, goals, cognitive capabilities, and beliefs about the driving environment. Method We model the driver’s decision to deploy visual attention as a stochastic sequential decision-making problem and propose hierarchical reinforcement learning as a computationally tractable solution to it. The…

Automobile Drivingreinforcement learningComputer sciencevisuaalinen ympäristöHuman Factors and Ergonomicshuomiokyky050105 experimental psychologyBehavioral NeuroscienceCognitionHuman–computer interactiondrivingHumansHuman multitaskingReinforcement learning0501 psychology and cognitive sciencesajotapamultitaskingAdaptation (computer science)050107 human factorsApplied Psychologycomputational rationalitykuljettajattask interleaving05 social sciencesUncertaintyliikennekäyttäytyminenAutomobile drivingkognitiiviset prosessithavainnot
researchProduct

Some Effects of Individual Learning on the Evolution of Sensors

2001

In this paper, we present an abstract model of sensor evolution, where sensor development is only determined by artificial evolution and the adaptation of agent reactions is accomplished by individual learning. With the environment cast into a MDP framework, sensors can be conceived as a map from environmental states to agent observations and Reinforcement Learning algorithms can be utilised. On the basis of a simple gridworld scenario, we present some results of the interaction between individual learning and evolution of sensors.

Basis (linear algebra)business.industryComputer scienceIndividual learningEvolutionary algorithmReinforcement learningMarkov decision processArtificial intelligencebusinessAdaptation (computer science)
researchProduct

Cerebellar learning of bio-mechanical functions of extra-ocular muscles: modeling by artificial neural networks

2003

A control circuit is proposed to model the command of saccadic eye movements. Its wiring is deduced from a mathematical constraint, i.e. the necessity, for motor orders processing, to compute an approximate inverse function of the bio-mechanical function of the moving plant, here the bio-mechanics of the eye. This wiring is comparable to the anatomy of the cerebellar pathways. A predicting element, necessary for inversion and thus for movement accuracy, is modeled by an artificial neural network whose structure, deduced from physical constraints expressing the mechanics of the eye, is similar to the cell connectivity of the cerebellar cortex. Its functioning is set by supervised reinforceme…

CerebellumEye MovementsArtificial neural networkbusiness.industryGeneral NeuroscienceMotor controlEye movementPattern recognitionSaccadic maskingBiomechanical Phenomenamedicine.anatomical_structureOculomotor MusclesCerebellumCerebellar cortexMotor systemmedicineLearningReinforcement learningNeural Networks ComputerArtificial intelligencebusinessNeuroscienceMathematicsNeuroscience
researchProduct

Emergent Collective Behaviors in a Multi-agent Reinforcement Learning Pedestrian Simulation: A Case Study

2015

In this work, a Multi-agent Reinforcement Learning framework is used to generate simulations of virtual pedestrians groups. The aim is to study the influence of two different learning approaches in the quality of generated simulations. The case of study consists on the simulation of the crossing of two groups of embodied virtual agents inside a narrow corridor. This scenario is a classic experiment inside the pedestrian modeling area, because a collective behavior, specifically the lanes formation, emerges with real pedestrians. The paper studies the influence of different learning algorithms, function approximation approaches, and knowledge transfer mechanisms on performance of learned ped…

Collective behaviorFunction approximationbusiness.industryComputer scienceBellman equationVector quantizationProbabilistic logicReinforcement learningArtificial intelligencebusinessTransfer of learningKnowledge transferSimulation
researchProduct

Learning formulae from elementary facts

1997

Since the seminal paper by E.M. Gold [Gol67] the computational learning theory community has been presuming that the main problem in the learning theory on the recursion-theoretical level is to restore a grammar from samples of language or a program from its sample computations. However scientists in physics and biology have become accustomed to looking for interesting assertions rather than for a universal theory explaining everything.

Computational learning theoryGrammarSample exclusion dimensionmedia_common.quotation_subjectAlgorithmic learning theoryMathematics educationLearning theoryReinforcement learningSample (statistics)Inductive reasoningmedia_commonMathematics
researchProduct