Michael O. Duff, Ph.D.

University of Massachusetts, Amherst, Amherst, MA 
Reinforcement Learning
"Michael Duff"
Mean distance: 16.58 (cluster 29)


Sign in to add mentor
Andrew Barto grad student 2002 U Mass Amherst
 (Optimal learning: Computational procedures for Bayes -adaptive Markov decision processes.)
BETA: Related publications


You can help our author matching system! If you notice any publications incorrectly attributed to this author, please sign in and mark matches as correct or incorrect.

Niv Y, Duff MO, Dayan P. (2005) Dopamine, uncertainty and TD learning. Behavioral and Brain Functions : Bbf. 1: 6
Duff MO, Barto AG. (1997) Local bandit approximation for optimal learning problems Advances in Neural Information Processing Systems. 1019-1025
See more...