Search results for "Reinforcement learning"

showing 10 items of 95 documents

Designing a multi-layer edge-computing platform for energy-efficient and delay-aware offloading in vehicular networks

2021

Abstract Vehicular networks are expected to support many time-critical services requiring huge amounts of computation resources with very low delay. However, such requirements may not be fully met by vehicle on-board devices due to their limited processing and storage capabilities. The solution provided by 5G is the application of the Multi-Access Edge Computing (MEC) paradigm, which represents a low-latency alternative to remote clouds. Accordingly, we envision a multi-layer job-offloading scheme based on three levels, i.e., the Vehicular Domain, the MEC Domain and Backhaul Network Domain. In such a view, jobs can be offloaded from the Vehicular Domain to the MEC Domain, and even further o…

Markov ModelsVehicular ad hoc networkComputer Networks and CommunicationsComputer scienceDistributed computing5G; Edge Computing; Markov Models; Reinforcement Learning; Vehicular NetworksLoad balancing (computing)Reinforcement LearningDomain (software engineering)ServerEdge ComputingReinforcement learningVehicular NetworksMarkov decision process5GEdge computingEfficient energy useComputer Networks

researchProduct

Least-squares temporal difference learning based on an extreme learning machine

2014

Abstract Reinforcement learning (RL) is a general class of algorithms for solving decision-making problems, which are usually modeled using the Markov decision process (MDP) framework. RL can find exact solutions only when the MDP state space is discrete and small enough. Due to the fact that many real-world problems are described by continuous variables, approximation is essential in practical applications of RL. This paper is focused on learning the value function of a fixed policy in continuous MPDs. This is an important subproblem of several RL algorithms. We propose a least-squares temporal difference (LSTD) algorithm based on the extreme learning machine. LSTD is typically combined wi…

Mathematical optimizationArtificial neural networkArtificial IntelligenceCognitive NeuroscienceBellman equationReinforcement learningState spaceMarkov decision processTemporal difference learningComputer Science ApplicationsMathematicsExtreme learning machineCurse of dimensionalityNeurocomputing

researchProduct

Kernelizing LSPE(&#x003BB;)

2007

We propose the use of kernel-based methods as underlying function approximator in the least-squares based policy evaluation framework of LSPE(λ) and LSTD(λ). In particular we present the 'kernelization' of model-free LSPE(λ). The 'kernelization' is computationally made possible by using the subset of regressors approximation, which approximates the kernel using a vastly reduced number of basis functions. The core of our proposed solution is an efficient recursive implementation with automatic supervised selection of the relevant basis functions. The LSPE method is well-suited for optimistic policy iteration and can thus be used in the context of online reinforcement learning. We use the hig…

Mathematical optimizationKernel (statistics)KernelizationLeast squares support vector machineBenchmark (computing)Reinforcement learningContext (language use)Basis functionFunction (mathematics)Mathematics2007 IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning

researchProduct

Reinforcement Learning Based Mobility Load Balancing with the Cell Individual Offset

2021

In this study, we focus on the cell individual offset (CIO) parameter in the handover process, which represents the willingness of a cell to admit the incoming handovers. However, it is challenging to tune the CIO parameter, as any poor implementation can lead to undesired outcomes, such as making the neighboring cells over-loaded while decreasing the traffic load of the cell. In this work, a reinforcement learning-based approach for parameter selection is introduced, since it is quite convenient for dynamically changing environments. In that regard, two different techniques, namely Q-learning and SARSA, are proposed, as they are known for their multi-objective optimization capabilities. Mo…

Mathematical optimizationOffset (computer science)Computer science05 social sciences050801 communication & media studies020206 networking & telecommunicationsSelf-organizing network02 engineering and technologyLoad balancing (computing)Load management0508 media and communicationsHandoverMetric (mathematics)0202 electrical engineering electronic engineering information engineeringBenchmark (computing)Reinforcement learning2021 IEEE 93rd Vehicular Technology Conference (VTC2021-Spring)

researchProduct

The Dreaming Variational Autoencoder for Reinforcement Learning Environments

2018

Reinforcement learning has shown great potential in generalizing over raw sensory data using only a single neural network for value optimization. There are several challenges in the current state-of-the-art reinforcement learning algorithms that prevent them from converging towards the global optima. It is likely that the solution to these problems lies in short- and long-term planning, exploration and memory management for reinforcement learning algorithms. Games are often used to benchmark reinforcement learning algorithms as they provide a flexible, reproducible, and easy to control environment. Regardless, few games feature a state-space where results in exploration, memory, and plannin…

Memory managementArtificial neural networkComputer sciencebusiness.industryBenchmark (computing)Feature (machine learning)Reinforcement learningArtificial intelligenceMarkov decision processbusinessAutoencoderGenerative grammar

researchProduct

A reinforcement learning approach for individualizing erythropoietin dosages in hemodialysis patients

2009

This paper presents a reinforcement learning (RL) approach for anemia management in patients undergoing chronic renal failure. Erythropoietin (EPO) is the treatment of choice for this kind of anemia but it is an expensive drug and with some dangerous side-effects that should be considered especially for patients who do not respond to the treatment. Therefore, an individualized treatment appears to be necessary. RL is a suitable approach to tackle this problem. Moreover, resulting policies are similar to medical protocols, and hence, they can easily be transferred to daily practice. A cohort of 64 patients are included in the study. An implementation of the Q-learning algorithm based on a st…

Nephrologymedicine.medical_specialtyDosebusiness.industryAnemiamedicine.medical_treatmentGeneral Engineeringmedicine.diseaseComputer Science ApplicationsArtificial IntelligenceErythropoietinInternal medicineHealth careCohortmedicineReinforcement learningHemodialysisbusinessIntensive care medicinemedicine.drugExpert Systems with Applications

researchProduct

Learning to Approach a Moving Ball with a Simulated Two-Wheeled Robot

2006

We show how a two-wheeled robot can learn to approach a moving ball using Reinforcement Learning. The robot is controlled by setting the velocities of its two wheels. It has to reach the ball under certain conditions to be able to kick it towards a given target. In order to kick, the ball has to be in front of the robot. The robot also has to reach the ball at a certain angle in relation to the target, because the ball is always kicked in the direction from the center of the robot to the ball. The robot learns which velocity differences should be applied to the wheels: one of the wheels is set to the maximum velocity, the other one according to this difference. We apply a REINFORCE algorith…

Neural gasRadial basis function networkComputer sciencebusiness.industryRoboticsBang-bang robotComputer Science::RoboticsControl theoryBall (bearing)RobotReinforcement learningArtificial intelligencebusinessSimulation

researchProduct

Virtual Resource Allocation for Wireless Virtualized Heterogeneous Network with Hybrid Energy Supply

2022

In this work, two novel virtual user association and resource allocation algorithms are introduced for a wireless virtualized heterogeneous network with hybrid energy supply. In the considered system, macro base stations (MBSs) are supplied by the grid power and small base stations (SBSs) have the energy harvesting capability in addition to the grid power supplement. Multiple infrastructure providers (InPs) own the physical resources, i.e., BSs and radio resources. The Mobile Virtual Network Operators (MVNOs) are able to recent these resources from the InPs and operate the virtualized resources for providing services to different users. In particular, aiming to maximize the overall utility …

Optimizationenergy harvestingreinforcement learningvirtualisointiComputer scienceDistributed computingresource allocationsyväoppiminenwireless network virtualizationresursointicomputer.software_genreIndium phosphideenergian kerääminenIII-V semiconductor materialsBase stationVirtualizationHybrid power systemsWirelessResource managementElectrical and Electronic EngineeringWireless networksbusiness.industryWireless networkApplied MathematicsResource managementdeep learningVirtualizationGridComputer Science ApplicationskoneoppiminenResource allocationbusinessADMMcomputerHeterogeneous networklangattomat verkot

researchProduct

OMNI-DRL: Learning to Fly in Forests with Omnidirectional Images

2022

Perception is crucial for drone obstacle avoidance in complex, static, and unstructured outdoor environments. However, most navigation solutions based on Deep Reinforcement Learning (DRL) use limited Field-Of-View (FOV) images as input. In this paper, we demonstrate that omnidirectional images improve these methods. Thus, we provide a comparative benchmark of several visual modalities for navigation: ground truth depth, ground truth semantic segmentation, and RGB images. These exhaustive comparisons reveal that it is superior to use an omnidirectional camera to navigate with classical DRL methods. Finally, we show in two different virtual forest environments that adapting the convolution to…

Perception and sensingDeep Reinforcement LearningControl and Systems EngineeringMobile robots and vehicles[INFO.INFO-RB] Computer Science [cs]/Robotics [cs.RO]Omnidirectional sensorsLearning robot control

researchProduct

An Artificial Soft Somatosensory System for a Cognitive Robot

2020

The paper proposes an artificial somatosensory system loosely inspired by human beings' biology and embedded in a cognitive architecture (CA). It enables a robot to receive the stimulation from its embodiment, and use these sensations, we called roboceptions, to behave according to both the external environment and the internal robot status. In such a way, the robot is aware of its body and able to interpret physical sensations can be more effective in the task while maintaining its well being. The robot's physiological urges are tightly bound to the specific physical state of the robot. Positive and negative physical information can, therefore, be processed and let the robot behave in a mo…

Physical informationHuman–computer interactionComputer scienceTask analysisRobotReinforcement learningCognitionCognitive architectureHumanoid robotTask (project management)2020 Fourth IEEE International Conference on Robotic Computing (IRC)

researchProduct

Search results for "Reinforcement learning"

Designing a multi-layer edge-computing platform for energy-efficient and delay-aware offloading in vehicular networks

2021

Least-squares temporal difference learning based on an extreme learning machine

2014

Kernelizing LSPE(&amp;#x003BB;)

2007

Reinforcement Learning Based Mobility Load Balancing with the Cell Individual Offset

2021

The Dreaming Variational Autoencoder for Reinforcement Learning Environments

2018

A reinforcement learning approach for individualizing erythropoietin dosages in hemodialysis patients

2009

Learning to Approach a Moving Ball with a Simulated Two-Wheeled Robot

2006

Virtual Resource Allocation for Wireless Virtualized Heterogeneous Network with Hybrid Energy Supply

2022

OMNI-DRL: Learning to Fly in Forests with Omnidirectional Images

2022

An Artificial Soft Somatosensory System for a Cognitive Robot

2020

Kernelizing LSPE(λ)