Publications• Sorted by Date • Classified by Publication Type • Classified by Research Category • When Do Off-Policy and On-Policy Policy Gradient Methods Align?Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, and Frans A. Oliehoek. When Do Off-Policy and On-Policy Policy Gradient Methods Align?. arXiv e-prints, pp. arXiv:2402.12034, February 2024. DownloadAbstract(unavailable) BibTeX Entry@ARTICLE{Mambelli24arxiv, author = {{Mambelli}, Davide and {Bongers}, Stephan and {Zoeter}, Onno and {Spaan}, Matthijs T.~J. and {Oliehoek}, Frans A.}, title = {When Do Off-Policy and On-Policy Policy Gradient Methods Align?}, journal = {arXiv e-prints}, year = 2024, month = feb, eid = {arXiv:2402.12034}, pages = {arXiv:2402.12034}, doi = {10.48550/arXiv.2402.12034}, archivePrefix = {arXiv}, eprint = {2402.12034}, primaryClass = {stat.ML}, keywords = {nonrefereed, arxiv}, }
Generated by
bib2html.pl
(written by Patrick Riley) on
Tue Nov 05, 2024 16:13:37 UTC |