Proximal Gradient Temporal Difference Learning: Stable Reinforcement Learning with Polynomial Sample Complexity

Academic Article

Authors

  • Liu, Bo
  • Gemp, Ian
  • Ghavamzadeh, Mohammad
  • Liu, Ji
  • Mahadevan, Sridhar
  • Petrik, Marek
  • Status

    Publication Date

  • 2018
  • Start Page

  • 461
  • End Page

  • 494
  • Volume

  • 63