Reinforcement learning is direct adaptive optimal control - Research

That is, at any time step k, the control rule should specify ... is to estimate a real-valued function, Q, of ..... Department of Computer Science at the University.
415KB taille 1 téléchargements 277 vues