6533b833fe1ef96bd129c3ae
RESEARCH PRODUCT
Learning for allocations in the long-run average core of dynamical cooperative TU games
Puduru Viswanadha ReddyDario Bausosubject
Mathematical optimizationComputer scienceRobustness (computer science)Stochastic processBounded functionRule-based systemRobust controlVideo game designGame theoryMathematical economicsUpper and lower boundsgame theory controldescription
We consider repeated coalitional TU games characterized by unknown but bounded and time-varying coalitions' values. We build upon the assumption that the Game Designer uses a vague measure of the extra reward that each coalition has received up to the current time to learn on how to re-adjust the allocations among the players. As main result, we present an allocation rule based on the extra reward variable that converges with probability one to the core of the long-run average game. Analogies with stochastic stability theory are put in evidence.
year | journal | country | edition | language |
---|---|---|---|---|
2011-02-01 |