Toggle navigation
FLAIR lab
Home
Research
Publications
Projects
Posts
Team
Software
GITHUB
CBP
Teaching
Join us
Guides
Open projects
Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory
Yufeng Zhang, Qi Cai, Zhuoran Yang, Yongxin Chen, Zhaoran Wang
Abstract
Publication
Conference on Neural Information Processing Systems (Neurips) (
Oral
)
Date
October, 2020
Links
PDF