Result: GQ($��$) Quick Reference and Implementation Guide
Title:
GQ($��$) Quick Reference and Implementation Guide
Authors:
Publisher Information:
arXiv, 2017.
Publication Year:
2017
Subject Terms:
Document Type:
Academic journal
Article
DOI:
10.48550/arxiv.1705.03967
Rights:
arXiv Non-Exclusive Distribution
Accession Number:
edsair.doi...........cac70723289c195e67ee15d6c54a3d60
Database:
OpenAIRE
Further Information
This document should serve as a quick reference for and guide to the implementation of linear GQ($��$), a gradient-based off-policy temporal-difference learning algorithm. Explanation of the intuition and theory behind the algorithm are provided elsewhere (e.g., Maei & Sutton 2010, Maei 2011). If you questions or concerns about the content in this document or the attached java code please email Adam White (adam.white@ualberta.ca). The code is provided as part of the source files in the arXiv submission.