0000000000297033

AUTHOR

Francisco Martinez-gil

0000-0002-2795-2816

showing 10 related works from this author

MARL-Ped: A multi-agent reinforcement learning based framework to simulate pedestrian groups

2014

Abstract Pedestrian simulation is complex because there are different levels of behavior modeling. At the lowest level, local interactions between agents occur; at the middle level, strategic and tactical behaviors appear like overtakings or route choices; and at the highest level path-planning is necessary. The agent-based pedestrian simulators either focus on a specific level (mainly in the lower one) or define strategies like the layered architectures to independently manage the different behavioral levels. In our Multi-Agent Reinforcement-Learning-based Pedestrian simulation framework (MARL-Ped) the situation is addressed as a whole. Each embodied agent uses a model-free Reinforcement L…

EngineeringFocus (computing)business.industryPedestriancomputer.software_genreEmbodied agentHardware and ArchitectureVirtual machineModeling and SimulationShortest path problemPath (graph theory)Reinforcement learningArtificial intelligenceMotion planningbusinesscomputerSoftwareSimulation Modelling Practice and Theory
researchProduct

Clinically-Driven Virtual Patient Cohorts Generation: An Application to Aorta

2021

The combination of machine learning methods together with computational modeling and simulation of the cardiovascular system brings the possibility of obtaining very valuable information about new therapies or clinical devices through in-silico experiments. However, the application of machine learning methods demands access to large cohorts of patients. As an alternative to medical data acquisition and processing, which often requires some degree of manual intervention, the generation of virtual cohorts made of synthetic patients can be automated. However, the generation of a synthetic sample can still be computationally demanding to guarantee that it is clinically meaningful and that it re…

Computer sciencePhysiologySample (statistics)Target populationMachine learningcomputer.software_genreData acquisitionVirtual patientPhysiology (medical)digital twinQP1-981support vector machineOriginal Researchbusiness.industrygenerative adversarial networkSampling (statistics)synthetic populationthoracic-aortaSupport vector machineReference samplein-silico trialsCohortArtificial intelligencevirtual cohortbusinesscomputerclinically-driven samplingFrontiers in Physiology
researchProduct

MARL-Ped+Hitmap: Towards Improving Agent-Based Simulations with Distributed Arrays

2016

Multi-agent systems allow the modelling of complex, heterogeneous, and distributed systems in a realistic way. MARL-Ped is a multi-agent system tool, based on the MPI standard, for the simulation of different scenarios of pedestrians who autonomously learn the best behavior by Reinforcement Learning. MARL-Ped uses one MPI process for each agent by design, with a fixed fine-grain granularity. This requirement limits the performance of the simulations for a restricted number of processors that is lesser than the number of agents. On the other hand, Hitmap is a library to ease the programming of parallel applications based on distributed arrays. It includes abstractions for the automatic parti…

020203 distributed computingComputer scienceDistributed computingMessage passing0202 electrical engineering electronic engineering information engineeringProcess (computing)Reinforcement learning020207 software engineering02 engineering and technologyCrowd simulationGranularityPartition (database)
researchProduct

Calibrating a Motion Model Based on Reinforcement Learning for Pedestrian Simulation

2012

In this paper, the calibration of a framework based in Multi-agent Reinforcement Learning (RL) for generating motion simulations of pedestrian groups is presented. The framework sets a group of autonomous embodied agents that learn to control individually its instant velocity vector in scenarios with collisions and friction forces. The result of the process is a different learned motion controller for each agent. The calibration of both, the physical properties involved in the motion of our embodied agents and the corresponding dynamics, is an important issue for a realistic simulation. The physics engine used has been calibrated with values taken from real pedestrian dynamics. Two experime…

Computer Science::Multiagent SystemsComputer scienceDynamics (mechanics)DiagramComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISIONCalibrationProcess (computing)Reinforcement learningMotion controllerPhysics engineSimulationMotion (physics)
researchProduct

Modeling, evaluation, and scale on artificial pedestrians: a literature review

2017

Modeling pedestrian dynamics and their implementation in a computer are challenging and important issues in the knowledge areas of transportation and computer simulation. The aim of this article is to provide a bibliographic outlook so that the reader may have quick access to the most relevant works related to this problem. We have used three main axes to organize the article's contents: pedestrian models, validation techniques, and multiscale approaches. The backbone of this work is the classification of existing pedestrian models; we have organized the works in the literature under five categories, according to the techniques used for implementing the operational level in each pedestrian …

DesignGeneral Computer ScienceComputer sciencemedia_common.quotation_subjectPerformance02 engineering and technologyPedestriancomputer.software_genreTheoretical Computer SciencePedestrian modeling and simulationCrowdsSimulació per ordinador0502 economics and business0202 electrical engineering electronic engineering information engineeringScale sizeAlgorismes computacionalsQuality (business)Macromedia_commonInformática050210 logistics & transportationFocus (computing)MultimediaValidation methodsScale (chemistry)05 social sciencesVerificationMultiscale simulation020207 software engineeringCrowdsData scienceValidation methodsMacroscopic and microscopic pedestrian modelscomputerAlgorithms
researchProduct

Emergent Collective Behaviors in a Multi-agent Reinforcement Learning Pedestrian Simulation: A Case Study

2015

In this work, a Multi-agent Reinforcement Learning framework is used to generate simulations of virtual pedestrians groups. The aim is to study the influence of two different learning approaches in the quality of generated simulations. The case of study consists on the simulation of the crossing of two groups of embodied virtual agents inside a narrow corridor. This scenario is a classic experiment inside the pedestrian modeling area, because a collective behavior, specifically the lanes formation, emerges with real pedestrians. The paper studies the influence of different learning algorithms, function approximation approaches, and knowledge transfer mechanisms on performance of learned ped…

Collective behaviorFunction approximationbusiness.industryComputer scienceBellman equationVector quantizationProbabilistic logicReinforcement learningArtificial intelligencebusinessTransfer of learningKnowledge transferSimulation
researchProduct

Agent's actions as a classification criteria for the state space in a learning from rewards system

2008

We focus in this paper on the problem of learning an autonomous agent's policy when the state space is very large and the set of actions available is comparatively short. To this end, we use a non-parametric decision rule (concretely, a nearest-neighbour strategy) in order to cluster the state space by means of the action that leads to a successful situation. Using an exploration strategy to avoid greedy behaviour, the agent builds clusters of positively-classified states through trial and error learning. In this paper, we implement a 3D synthetic agent which plays an 'avoid the asteroid' game that suits our assumptions. Using as the state space a feature vector space extracted from a visua…

Error-driven learningComputer sciencebusiness.industryFeature vectorAutonomous agentDecision ruleTrial and errorcomputer.software_genreMachine learningTheoretical Computer ScienceIntelligent agentArtificial IntelligenceVisual navigation systemArtificial intelligencebusinessClassifier (UML)computerSoftwareJournal of Experimental & Theoretical Artificial Intelligence
researchProduct

Multi-agent Reinforcement Learning for Simulating Pedestrian Navigation

2012

In this paper we introduce a Multi-agent system that uses Reinforcement Learning (RL) techniques to learn local navigational behaviors to simulate virtual pedestrian groups. The aim of the paper is to study empirically the validity of RL to learn agent-based navigation controllers and their transfer capabilities when they are used in simulation environments with a higher number of agents than in the learned scenario. Two RL algorithms which use Vector Quantization (VQ) as the generalization method for the space state are presented. Both strategies are focused on obtaining a good vector quantizier that generalizes adequately the state space of the agents. We empirically state the convergence…

Computer scienceGeneralizationbusiness.industryVector quantizationContext (language use)Machine learningcomputer.software_genreDomain (software engineering)Convergence (routing)State spaceReinforcement learningArtificial intelligenceTransfer of learningbusinesscomputer
researchProduct

Using Inverse Reinforcement Learning with Real Trajectories to Get More Trustworthy Pedestrian Simulations

2020

Reinforcement learning is one of the most promising machine learning techniques to get intelligent behaviors for embodied agents in simulations. The output of the classic Temporal Difference family of Reinforcement Learning algorithms adopts the form of a value function expressed as a numeric table or a function approximator. The learned behavior is then derived using a greedy policy with respect to this value function. Nevertheless, sometimes the learned policy does not meet expectations, and the task of authoring is difficult and unsafe because the modification of one value or parameter in the learned value function has unpredictable consequences in the space of the policies it represents…

0209 industrial biotechnologyreinforcement learningComputer scienceGeneral Mathematics02 engineering and technologypedestrian simulationTask (project management)learning by demonstration020901 industrial engineering & automationAprenentatgeInformàticaBellman equation0202 electrical engineering electronic engineering information engineeringComputer Science (miscellaneous)Reinforcement learningEngineering (miscellaneous)business.industrycausal entropylcsh:MathematicsProcess (computing)020206 networking & telecommunicationsFunction (mathematics)inverse reinforcement learninglcsh:QA1-939Problem domainTable (database)Artificial intelligenceTemporal difference learningbusinessoptimizationMathematics
researchProduct

Emergent behaviors and scalability for multi-agent reinforcement learning-based pedestrian models

2017

This paper analyzes the emergent behaviors of pedestrian groups that learn through the multiagent reinforcement learning model developed in our group. Five scenarios studied in the pedestrian model literature, and with different levels of complexity, were simulated in order to analyze the robustness and the scalability of the model. Firstly, a reduced group of agents must learn by interaction with the environment in each scenario. In this phase, each agent learns its own kinematic controller, that will drive it at a simulation time. Secondly, the number of simulated agents is increased, in each scenario where agents have previously learnt, to test the appearance of emergent macroscopic beha…

Engineeringmedia_common.quotation_subject02 engineering and technologyPedestrianMachine learningcomputer.software_genreConsistency (database systems)Robustness (computer science)0202 electrical engineering electronic engineering information engineeringReinforcement learningQuality (business)Macromedia_commonInformáticaPedestrian simulation and modelingKinematic controllerbusiness.industry020207 software engineeringEmergent behavioursBehavioural simulationHardware and ArchitectureModeling and SimulationScalability020201 artificial intelligence & image processingArtificial intelligencebusinessMulti-agent reinforcement learning (Marl)computerSoftwareSimulation Modelling Practice and Theory
researchProduct