Logo image
Exploring compact reinforcement-learning representations with linear regression
Technical documentation   Open access

Exploring compact reinforcement-learning representations with linear regression

Thomas Walsh, István Szita, Carlos Diuk and Michael Littman
Rutgers University
2009
DOI:
https://doi.org/10.7282/T3ZW1QCR

Abstract

This paper presents a new algorithm for online linear regression whose efficiency guarantees satisfy the requirements of the KWIK (Knows What It Knows) framework. The algorithm improves on the computational and storage complexity bounds of the current state-of-the-art procedure in this setting. We explore several applications of this algorithm for learning compact reinforcement-learning representations. We show that KWIK linear regression can be used to learn the reward function of a factored MDP and the probabilities of action outcomes in Stochastic STRIPS and Object Oriented MDPs, none of which have been proven to be efficiently learnable in the RL setting before. We also combine KWIK linear regression with other KWIK learners to learn larger portions of these models, including experiments on learning factored MDP transition and reward functions together.
pdf
tr5b44b65a8427c324.55 kBDownloadView
Version of Record (VoR) Open Access
url
Report an accessibility issueView
Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

109 File downloads
132 Record Views

Details

Logo image