Exploring compact reinforcement-learning representations with linear regression

Thomas Walsh; István Szita; Carlos Diuk; Michael Littman

doi:10.7282/T3ZW1QCR

Back

Exploring compact reinforcement-learning representations with linear regression

Technical documentation

Open access

Exploring compact reinforcement-learning representations with linear regression

Thomas Walsh, István Szita, Carlos Diuk and Michael Littman

Rutgers University

2009

DOI:

https://doi.org/10.7282/T3ZW1QCR

Abstract

This paper presents a new algorithm for online linear regression whose efficiency guarantees satisfy the requirements of the KWIK (Knows What It Knows) framework. The algorithm improves on the computational and storage complexity bounds of the current state-of-the-art procedure in this setting. We explore several applications of this algorithm for learning compact reinforcement-learning representations. We show that KWIK linear regression can be used to learn the reward function of a factored MDP and the probabilities of action outcomes in Stochastic STRIPS and Object Oriented MDPs, none of which have been proven to be efficiently learnable in the RL setting before. We also combine KWIK linear regression with other KWIK learners to learn larger portions of these models, including experiments on learning factored MDP transition and reward functions together.

Files and links (2)

pdf

tr5b44b65a8427c324.55 kBDownload View

Version of Record (VoR) Open Access

url

Report an accessibility issueView

Please complete a content remediation request to report an accessibility issue with a library electronic resource, website, or service.

Metrics

109 File downloads

132 Record Views

Details

Title: Subtitle: Exploring compact reinforcement-learning representations with linear regression
Creators: Thomas Walsh (Author) - Computer Science (New Brunswick)
István Szita (Author) - University of Alberta
Carlos Diuk (Author) - Computer Science (New Brunswick)
Michael Littman (Author) - Computer Science (New Brunswick)
Date published: 2009
Publisher: Rutgers University
Number of pages: 11 p.
Academic Unit: School of Arts and Sciences; Computer Science (SAS)
Language: English
Resource Type: Technical documentation
Comment: Technical report DCS-tr-660
Identifiers: 991031549960204646