Search results for "reinforcement learning"

showing 10 items of 95 documents

Roboception and adaptation in a cognitive robot

2023

In robotics, perception is usually oriented at understanding what is happening in the external world, while few works pay attention to what is occurring in the robot’s body. In this work, we propose an artificial somatosensory system, embedded in a cognitive architecture, that enables a robot to perceive the sensations from its embodiment while executing a task. We called these perceptions roboceptions, and they let the robot act according to its own physical needs in addition to the task demands. Physical information is processed by the robot to behave in a balanced way, determining the most appropriate trade-off between the achievement of the task and its well being. The experiments show …

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniControl and Systems EngineeringGeneral MathematicsSomatosensory system Soft sensors Cognitive architecture Humanoid robot Roboception Reinforcement learningSoftwareComputer Science Applications

researchProduct

Deep learning techniques for visual object tracking

2023

Visual object tracking plays a crucial role in various vision systems, including biometric analysis, medical imaging, smart traffic systems, and video surveillance. Despite notable advancements in visual object tracking over the past few decades, many tracking algorithms still face challenges due to factors like illumination changes, deformation, and scale variations. This thesis is divided into three parts. The first part introduces the visual object tracking problem and discusses the traditional approaches that have been used to study it. We then propose a novel method called Tracking by Iterative Multi-Refinements, which addresses the issue of locating the target by redefining the search…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniDeep LearningVisual Object TrackingFast LearningReinforcement LearningCNN

researchProduct

Enabling peer-to-peer User-Preference-Aware Energy Sharing Through Reinforcement Learning

2020

Renewable, heterogeneous and distributed energy resources are the future of power systems, as envisioned by the recent paradigm of Virtual Power Plants (VPPs). Residential electricity generation, e.g., through photovoltaic panels, plays a fundamental role in this paradigm, where users are able to participate in an energy sharing system and exchange energy resources among each other. In this work, we study energy sharing systems and, differently from previous approaches, we consider realistic user behaviors by taking into account the user preferences and level of engagement in the energy trades. We formulate the problem of matching energy resources while contemplating the user behavior as a …

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniHeuristicbusiness.industryComputer scienceDistributed computingEnergy SharingPeer-to-peercomputer.software_genreReinforcement LearningBehavioral modelingElectric power systemElectricity generationDistributed generationReinforcement learningbusinesscomputerInteger programmingVirtual Power Plant

researchProduct

A Reinforcement Learning Approach for User Preference-aware Energy Sharing Systems

2021

Energy Sharing Systems (ESS) are envisioned to be the future of power systems. In these systems, consumers equipped with renewable energy generation capabilities are able to participate in an energy market to sell their energy. This paper proposes an ESS that, differently from previous works, takes into account the consumers’ preference, engagement, and bounded rationality. The problem of maximizing the energy exchange while considering such user modeling is formulated and shown to be NP-Hard. To learn the user behavior, two heuristics are proposed: 1) a Reinforcement Learning-based algorithm, which provides a bounded regret and 2) a more computationally efficient heuristic, named BPT- ${K}…

Settore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniMathematical optimizationCorrectnessComputer Networks and CommunicationsRenewable Energy Sustainability and the EnvironmentComputer scienceHeuristicUser modelingRegretBounded rationalityReinforcement learningCoal Energy exchange Energy Sharing Systems Green products Power generation Production Reinforcement Learning Renewable energy sources User Preference Virtual Power PlantsEnergy marketHeuristics

researchProduct

TERMODINAMICA, CAMPI QUANTICI E FUNZIONI MENTALI

2010

Nel tentativo di fornire spiegazioni esaurienti sul fenomeno della mente umana, il presente saggio scientifico di anatomia comparata e di fisiologia considera alcune tesi di fondo che si collegano a nuovi parametri fisici della realtà che ci circonda.

Settore VET/01 - Anatomia Degli Animali DomesticiNeuroplasticity reinforcement learning dopamine human brain.

researchProduct

Adapting to Dynamic LEO-B5G Systems : Meta-Critic Learning Based Efficient Resource Scheduling

2022

Low earth orbit (LEO) satellite-assisted communications have been considered as one of key elements in beyond 5G systems to provide wide coverage and cost-efficient data services. Such dynamic space-terrestrial topologies impose exponential increase in the degrees of freedom in network management. In this paper, we address two practical issues for an over-loaded LEO-terrestrial system. The first challenge is how to efficiently schedule resources to serve the massive number of connected users, such that more data and users can be delivered/served. The second challenge is how to make the algorithmic solution more resilient in adapting to dynamic wireless environments.To address them, we first…

Signal Processing (eess.SP)FOS: Computer and information sciencesdynamic environmentComputer Science - Machine Learningreinforcement learningmeta-critic learningComputer Science - Artificial Intelligence5G-tekniikkaresursointiMachine Learning (cs.LG): Electrical & electronics engineering [C06] [Engineering computing & technology]LEO satelliteslangaton tiedonsiirtoresources allocationalgoritmitFOS: Electrical engineering electronic engineering information engineeringElectrical and Electronic EngineeringElectrical Engineering and Systems Science - Signal Processing: Ingénierie électrique & électronique [C06] [Ingénierie informatique & technologie]Applied MathematicstietoliikennesatelliititComputer Science ApplicationsArtificial Intelligence (cs.AI)koneoppiminenresource schedulinglangattomat verkot

researchProduct

Quantum Machine Learning: A tutorial

2021

This tutorial provides an overview of Quantum Machine Learning (QML), a relatively novel discipline that brings together concepts from Machine Learning (ML), Quantum Computing (QC) and Quantum Information (QI). The great development experienced by QC, partly due to the involvement of giant technological companies as well as the popularity and success of ML have been responsible of making QML one of the main streams for researchers working on fuzzy borders between Physics, Mathematics and Computer Science. A possible, although arguably coarse, classification of QML methods may be based on those approaches that make use of ML in a quantum experimentation environment and those others that take…

SpeedupTheoretical computer scienceQuantum machine learningComputer scienceCognitive NeuroscienceQuantum reinforcement learningQuantum computingFuzzy logicPopularityComputer Science ApplicationsComputational speed-upDevelopment (topology)Artificial IntelligenceQuantum clusteringQuantum informationQuantumQuantum-inspired learning algorithmsQuantum computerQuantum autoencoders

researchProduct

Hunting active Brownian particles: Learning optimal behavior

2021

We numerically study active Brownian particles that can respond to environmental cues through a small set of actions (switching their motility and turning left or right with respect to some direction) which are motivated by recent experiments with colloidal self-propelled Janus particles. We employ reinforcement learning to find optimal mappings between the state of particles and these actions. Specifically, we first consider a predator-prey situation in which prey particles try to avoid a predator. Using as reward the squared distance from the predator, we discuss the merits of three state-action sets and show that turning away from the predator is the most successful strategy. We then rem…

Statistical Mechanics (cond-mat.stat-mech)Single clusterComputer scienceFOS: Physical sciencesCondensed Matter - Soft Condensed MatterSmall setActive matterSoft Condensed Matter (cond-mat.soft)Reinforcement learningStatistical physicsConcentration gradientSensory cueCondensed Matter - Statistical MechanicsBrownian motionPhysical Review E

researchProduct

A Theoretical Learning Model Combining Stochastic Cellular Automata and Economic Indicators to Simulate Land Use Change

2015

The study of change in land use has been included in territorial planning to inform spatial planners and policy makers of the possible developments they face in order to optimize future management decisions. In this paper the authors present the core of an original learning model coupling stochastic Cellular Automata and economic indicators to simulate the land use change. This model is an important step in building an “environmental virtual laboratory” to explore, explain and forecast land use change.

Stochastic cellular automatonEconomic indicatorLand useOperations researchComputer scienceStochastic modellingManagement scienceVirtual LaboratoryReinforcement learningLand use land-use change and forestryCellular automatonInternational Journal of Applied Evolutionary Computation

researchProduct

An AI Walk from Pharmacokinetics to Marketing

2009

This work is intended for providing a review of reallife practical applications of Artificial Intelligence (AI) methods. We focus on the use of Machine Learning (ML) methods applied to rather real problems than synthetic problems with standard and controlled environment. In particular, we will describe the following problems in next sections: • Optimization of Erythropoietin (EPO) dosages in anaemic patients undergoing Chronic Renal Failure (CRF). • Optimization of a recommender system for citizen web portal users. • Optimization of a marketing campaign. The choice of these problems is due to their relevance and their heterogeneity. This heterogeneity shows the capabilities and versatility …

Support vector machineEngineeringComputingMethodologies_PATTERNRECOGNITIONAdaptive resonance theoryArtificial neural networkbusiness.industryMultilayer perceptronReinforcement learningArtificial intelligencebusinessCluster analysisFuzzy logicHierarchical clustering

researchProduct