6533b872fe1ef96bd12d381a
RESEARCH PRODUCT
Interpretable Option Discovery Using Deep Q-Learning and Variational Autoencoders
Per-arne AndersenOle-christoffer GranmoMorten Goodwinsubject
Generalizationbusiness.industryComputer scienceAutonomous agentQ-learningSample (statistics)Machine learningcomputer.software_genreLocal convergenceVariety (cybernetics)Reinforcement learningArtificial intelligenceCluster analysisbusinesscomputerdescription
Deep Reinforcement Learning (RL) is unquestionably a robust framework to train autonomous agents in a wide variety of disciplines. However, traditional deep and shallow model-free RL algorithms suffer from low sample efficiency and inadequate generalization for sparse state spaces. The options framework with temporal abstractions [18] is perhaps the most promising method to solve these problems, but it still has noticeable shortcomings. It only guarantees local convergence, and it is challenging to automate initiation and termination conditions, which in practice are commonly hand-crafted.
year | journal | country | edition | language |
---|---|---|---|---|
2021-01-01 |