6533b82efe1ef96bd1293175

RESEARCH PRODUCT

Learning of Cooperative Behaviour in Robot Populations

Paul TroddenSandor M. VeresDario BausoMichalis Smyrnakis

subject

0209 industrial biotechnologyEngineeringbusiness.industryRegretSample (statistics)02 engineering and technologyVariation (game tree)Traffic flowRobot kinematics Automobiles Service robots Convergence Games020901 industrial engineering & automationSettore ING-INF/04 - AutomaticaSimple (abstract algebra)Convergence (routing)0202 electrical engineering electronic engineering information engineeringRobot020201 artificial intelligence & image processingArtificial intelligenceSettore MAT/09 - Ricerca OperativabusinessGame theory

description

This paper addresses convergence and equilibrium properties of game theoretic learning algorithms in robot populations using simple and broadly applicable reward/cost models of cooperation between robotic agents. New models for robot cooperation are proposed by combining regret based learning methods and network evolution models. Results of mean-field game theory are employed in order to show the asymptotic second moment boundedness in the variation of cooperative behaviour. The behaviour of the proposed models are tested in simulation results, which are based on sample networks and a single lane traffic flow case study.

10.1109/ecc.2016.7810284http://hdl.handle.net/10447/253219