6533b872fe1ef96bd12d381a

RESEARCH PRODUCT

Interpretable Option Discovery Using Deep Q-Learning and Variational Autoencoders

Per-arne AndersenOle-christoffer GranmoMorten Goodwin

subject

Generalizationbusiness.industryComputer scienceAutonomous agentQ-learningSample (statistics)Machine learningcomputer.software_genreLocal convergenceVariety (cybernetics)Reinforcement learningArtificial intelligenceCluster analysisbusinesscomputer

description

Deep Reinforcement Learning (RL) is unquestionably a robust framework to train autonomous agents in a wide variety of disciplines. However, traditional deep and shallow model-free RL algorithms suffer from low sample efficiency and inadequate generalization for sparse state spaces. The options framework with temporal abstractions [18] is perhaps the most promising method to solve these problems, but it still has noticeable shortcomings. It only guarantees local convergence, and it is challenging to automate initiation and termination conditions, which in practice are commonly hand-crafted.

https://doi.org/10.1007/978-3-030-71711-7_11