Can Temporal-Difference and Q-Learning Learn Representation? A Mean-Field Theory

Abstract

Publication
Conference on Neural Information Processing Systems (Neurips) (Oral)
Date
Links
PDF