Video URL

https://simons.berkeley.edu/talks/tbd-246

The Mean-Squared Error of Double Q-Learning

(2020). The Mean-Squared Error of Double Q-Learning. The Simons Institute for the Theory of Computing. https://simons.berkeley.edu/talks/tbd-246

The Mean-Squared Error of Double Q-Learning. The Simons Institute for the Theory of Computing, Dec. 02, 2020, https://simons.berkeley.edu/talks/tbd-246

          @misc{ scivideos_16826,
            doi = {},
            url = {https://simons.berkeley.edu/talks/tbd-246},
            author = {},
            keywords = {},
            language = {en},
            title = {The Mean-Squared Error of Double Q-Learning},
            publisher = {The Simons Institute for the Theory of Computing},
            year = {2020},
            month = {dec},
            note = {16826 see, \url{https://scivideos.org/Simons-Institute/16826}}
          }

R. Srikant (University of Illinois at Urbana-Champaign)

December 02, 2020

Source Repository Simons Institute

Subject

Computer Science

Abstract

We establish a theoretical comparison between the asymptotic mean-squared error of Double Q-learning and Q-learning. Using prior work on the asymptotic mean-squared error of linear stochastic approximation based on Lyapunov equations, we show that the asymptotic mean-squared error of Double Q-learning is exactly equal to that of Q-learning if Double Q-learning uses twice the learning rate of Q-learning and outputs the average of its two estimators. We also present some practical implications of this theoretical observation using simulations.

Supported by

Video URL

The Mean-Squared Error of Double Q-Learning

Abstract

Probing New Physics with Supernovae

Joint modeling of astrophysical systematics for cosmological surveys

Perspectives on Communicating Physics to the Public

Analytical Methods for Inflation Correlators.

Black holes and gravitational waves

Video URL

The Mean-Squared Error of Double Q-Learning

APA

MLA

BibTex

Abstract