Regularization and Feature Selection in Least Squares Temporal Difference Learning

Description

This is Python implementations of Least Angle Regression Temporal Difference (LARS-TD) algorithm and Least-Squares Temporal Difference (LSTD). For more information on the algorithm please refer to the paper

“Regularization and Feature Selection in Least Squares Temporal Difference Learning”

https://zicokolter.com/publications/kolter2009regularization.pdf

In this paper, the authors tried to propose a regularization framework for least-square temporal differences learning. Specifically, they presented an approach to find the fixed point by using l1 regularization framework. To evaluate the framework’s efficiency, they examined the framework by using two well-known problems, which means Mountain Car and Chain Domain. The results showed that the framework could deal with challenges well

Executing program

python main.py

GitHub

View Github