Publications

Sorted by DateClassified by Publication TypeClassified by Research Category

When Do Off-Policy and On-Policy Policy Gradient Methods Align?

Davide Mambelli, Stephan Bongers, Onno Zoeter, Matthijs T. J. Spaan, and Frans A. Oliehoek. When Do Off-Policy and On-Policy Policy Gradient Methods Align?. arXiv e-prints, pp. arXiv:2402.12034, February 2024.

Download

pdf ps.gz ps HTML 

Abstract

(unavailable)

BibTeX Entry

@ARTICLE{Mambelli24arxiv,
       author = {{Mambelli}, Davide and {Bongers}, Stephan and {Zoeter}, Onno and {Spaan}, Matthijs T.~J. and {Oliehoek}, Frans A.},
        title = {When Do Off-Policy and On-Policy Policy Gradient Methods Align?},
      journal = {arXiv e-prints},
         year = 2024,
        month = feb,
          eid = {arXiv:2402.12034},
        pages = {arXiv:2402.12034},
          doi = {10.48550/arXiv.2402.12034},
archivePrefix = {arXiv},
       eprint = {2402.12034},
 primaryClass = {stat.ML},
    keywords =  {nonrefereed, arxiv},
}

Generated by bib2html.pl (written by Patrick Riley) on Tue Jun 25, 2024 12:39:45 UTC