Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review why Q-learning could fail. #24

Open
SBoulanger opened this issue Jul 16, 2020 · 0 comments
Open

Review why Q-learning could fail. #24

SBoulanger opened this issue Jul 16, 2020 · 0 comments
Assignees
Labels
learning Learn something

Comments

@SBoulanger
Copy link
Collaborator

For more information about how and why Q-learning methods can fail, see 1) this classic paper by Tsitsiklis and van Roy, 2) the (much more recent) review by Szepesvari (in section 4.3.2), and 3) chapter 11 of Sutton and Barto, especially section 11.3 (on “the deadly triad” of function approximation, bootstrapping, and off-policy data, together causing instability in value-learning algorithms)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
learning Learn something
Projects
None yet
Development

No branches or pull requests

2 participants