Michael O. Duff, Ph.D.
Affiliations: | University of Massachusetts, Amherst, Amherst, MA |
Area:
Reinforcement LearningGoogle:
"Michael Duff"Mean distance: 16.58 (cluster 29)
Parents
Sign in to add mentorAndrew Barto | grad student | 2002 | U Mass Amherst | |
(Optimal learning: Computational procedures for Bayes -adaptive Markov decision processes.) |
BETA: Related publications
See more...
Publications
You can help our author matching system! If you notice any publications incorrectly attributed to this author, please sign in and mark matches as correct or incorrect. |
Niv Y, Duff MO, Dayan P. (2005) Dopamine, uncertainty and TD learning. Behavioral and Brain Functions : Bbf. 1: 6 |
Duff MO, Barto AG. (1997) Local bandit approximation for optimal learning problems Advances in Neural Information Processing Systems. 1019-1025 |