6533b7d2fe1ef96bd125e962

RESEARCH PRODUCT

Evolution and Learning: Evolving Sensors in a Simple MDP Environment

Tobias JungThomas UthmannPeter Dauscher

subject

Learning classifier systembusiness.industryComputer science05 social sciencesAutonomous agentExperimental and Cognitive PsychologyGrid050105 experimental psychologyTask (project management)03 medical and health sciencesBehavioral Neuroscience0302 clinical medicineGenetic algorithmReinforcement learning0501 psychology and cognitive sciencesArtificial intelligencebusinessAdaptation (computer science)030217 neurology & neurosurgeryCommunication channel

description

Natural intelligence and autonomous agents face difficulties when acting in information-dense environments. Assailed by a multitude of stimuli they have to make sense of the inflow of information, filtering and processing what is necessary, but discarding that which is unimportant. This paper aims at investigating the interactions between evolution of the sensorial channel extracting the information from the environment and the simultaneous individual adaptation of agent-control. Our particular goal is to study the influence of learning on the evolution of sensors, with learning duration being the tunable parameter. A genetic algorithm governs the evolution of sensors appropriate for the agent solving a simple grid world task. The performance of the agent is taken as fitness; ‘sensors’ are conceived as a map from environmental states to agent observations, and individual adaptation is modeled by Q-learning. Our experimental results show that due to the principles of cognitive economy learning and varying the degree thereof actually transforms the fitness landscape. In particular we identify a trade-off between learning speed (load) and sensor accuracy (error). These results are further reinforced by theoretical analysis: we derive an analytical measure for the quality of sensors based on the mutual entropy between the system of states and the selection of an optimal action, a concept recently proposed by Polani, Martinetz, and Kim.

https://doi.org/10.1177/1059712303113002