Structured Apprenticeship Learning

Abdeslam Boularias; Oliver Krömer; Jan Peters

doi:10.1007/978-3-642-33486-3_15

Back

Book chapter

Peer reviewed

Structured Apprenticeship Learning

Abdeslam Boularias, Oliver Krömer and Jan Peters

Machine Learning and Knowledge Discovery in Databases, pp.227-242

Lecture Notes in Computer Science, Springer Berlin Heidelberg

2012

DOI: https://doi.org/10.1007/978-3-642-33486-3_15

Abstract

Adjacent State

Markov Decision Process

Markov Random Field

Optimal Policy

Reward Function

We propose a graph-based algorithm for apprenticeship learning when the reward features are noisy. Previous apprenticeship learning techniques learn a reward function by using only local state features. This can be a limitation in practice, as often some features are misspecified or subject to measurement noise. Our graphical framework, inspired from the work on Markov Random Fields, allows to alleviate this problem by propagating information between states, and rewarding policies that choose similar actions in adjacent states. We demonstrate the advantage of the proposed approach on grid-world navigation problems, and on the problem of teaching a robot to grasp novel objects in simulation.

Metrics

9 Record Views

Details

Title: Structured Apprenticeship Learning
Creators: Abdeslam Boularias - Max Planck Institute for Intelligent Systems, Tübingen, Germany
Oliver Krömer - Darmstadt University of Technology, Darmstadt, Germany
Jan Peters - Darmstadt University of Technology, Darmstadt, Germany
Publication Details: Machine Learning and Knowledge Discovery in Databases, pp.227-242
Date published: 2012
Series: Lecture Notes in Computer Science
Publisher: Springer Berlin Heidelberg; Berlin, Heidelberg
Academic Unit: Computer Science (SAS)
Language: English
Resource Type: Book chapter
Identifiers: 991031665409504646