Andrew Barto - Publications

University of Massachusetts, Amherst, Amherst, MA 
Reinforcement Learning

44 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2015 Niekum S, Osentoski S, Konidaris G, Chitta S, Marthi B, Barto AG. Learning grounded finite-state representations from unstructured demonstrations International Journal of Robotics Research. 34: 131-157. DOI: 10.1177/0278364914554471  0.692
2015 Niekum S, Osentoski S, Atkeson CG, Barto AG. Online Bayesian changepoint detection for articulated motion models Proceedings - Ieee International Conference On Robotics and Automation. 2015: 1468-1475. DOI: 10.1109/ICRA.2015.7139383  0.506
2015 Botvinick M, Weinstein A, Solway A, Barto A. Reinforcement learning, efficient coding, and the statistics of natural tasks Current Opinion in Behavioral Sciences. 5: 71-77. DOI: 10.1016/J.Cobeha.2015.08.009  0.454
2014 Baldassarre G, Stafford T, Mirolli M, Redgrave P, Ryan RM, Barto A. Intrinsic motivations and open-ended development in animals, humans, and robots: an overview. Frontiers in Psychology. 5: 985. PMID 25249998 DOI: 10.3389/Fpsyg.2014.00985  0.389
2013 Levy YZ, Levy DJ, Barto AG, Meyer JS. A computational hypothesis for allostasis: delineation of substance dependence, conventional therapies, and alternative treatments. Frontiers in Psychiatry. 4: 167. PMID 24391601 DOI: 10.3389/Fpsyt.2013.00167  0.709
2013 Shah A, Barto AG, Fagg AH. A dual process account of coarticulation in motor skill acquisition. Journal of Motor Behavior. 45: 531-49. PMID 24116847 DOI: 10.1080/00222895.2013.837423  0.521
2013 Kuindersma SR, Grupen RA, Barto AG. Variable risk control via stochastic optimization International Journal of Robotics Research. 32: 806-825. DOI: 10.1177/0278364913476124  0.766
2013 Kuindersma S, Grupen R, Barto A. Variational Bayesian optimization for runtime risk-sensitive control Robotics: Science and Systems. 8: 201-208.  0.609
2012 Konidaris G, Kuindersma S, Grupen R, Barto A. Robot learning from demonstration by constructing skill trees International Journal of Robotics Research. 31: 360-375. DOI: 10.1177/0278364911428653  0.76
2011 Ribas-Fernandes JJ, Solway A, Diuk C, McGuire JT, Barto AG, Niv Y, Botvinick MM. A neural signature of hierarchical reinforcement learning. Neuron. 71: 370-9. PMID 21791294 DOI: 10.1016/J.Neuron.2011.05.042  0.354
2011 Niekum S, Spector L, Barto A. Evolution of reward functions for reinforcement learning Genetic and Evolutionary Computation Conference, Gecco'11 - Companion Publication. 177-178. DOI: 10.1145/2001858.2001957  0.328
2011 Kuindersma S, Grupen R, Barto A. Learning dynamic arm motions for postural recovery Ieee-Ras International Conference On Humanoid Robots. 7-12. DOI: 10.1109/Humanoids.2011.6100881  0.769
2011 Botvinick MM, Niv Y, Barto AG. Hierarchically organised behaviour and its neural foundations: A reinforcement-learning perspective Modelling Natural Action Selection. 264-299. DOI: 10.1017/CBO9780511731525.017  0.35
2011 Konidaris G, Kuindersma S, Grupen R, Barto A. Autonomous skill acquisition on a mobile manipulator Proceedings of the National Conference On Artificial Intelligence. 2: 1468-1473.  0.599
2010 Konidaris G, Kuindersmay S, Barto A, Grupen R. Constructing skill trees for reinforcement learning agents from demonstration trajectories Advances in Neural Information Processing Systems 23: 24th Annual Conference On Neural Information Processing Systems 2010, Nips 2010 0.66
2009 Shah A, Barto AG. Effect on movement selection of an evolving sensory representation: a multiple controller model of skill acquisition. Brain Research. 1299: 55-73. PMID 19595991 DOI: 10.1016/j.brainres.2009.07.006  0.404
2009 Botvinick MM, Niv Y, Barto AC. Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition. 113: 262-80. PMID 18926527 DOI: 10.1016/j.cognition.2008.08.011  0.379
2006 Rosenstein MT, Barto AG, Van Emmerik REA. Learning at the level of synergies for a robot weightlifter Robotics and Autonomous Systems. 54: 706-717. DOI: 10.1016/J.Robot.2006.03.002  0.771
2006 Wolfe AP, Barto AG. Decision tree methods for finding reusable MDP homomorphisms Proceedings of the National Conference On Artificial Intelligence. 1: 530-535.  0.46
2005 Berthier NE, Rosenstein MT, Barto AG. Approximate optimal control as a model for motor learning. Psychological Review. 112: 329-46. PMID 15783289 DOI: 10.1037/0033-295X.112.2.329  0.745
2005 Şimşek Ö, Wolfe AP, Barto AG. Identifying useful subgoals in reinforcement learning by local graph partitioning Icml 2005 - Proceedings of the 22nd International Conference On Machine Learning. 817-824.  0.594
2004 Shah A, Fagg AH, Barto AG. Cortical involvement in the recruitment of wrist muscles. Journal of Neurophysiology. 91: 2445-56. PMID 14749314 DOI: 10.1152/Jn.00879.2003  0.404
2004 Rosenstein MT, Barto AG. Reinforcement learning with supervision by a stable controller Proceedings of the American Control Conference. 5: 4517-4522. DOI: 10.1109/ACC.2004.182663  0.752
2004 Şimşek O, Wolfe AP, Barto AG. Local graph partitioning as a basis for generating temporally-extended actions in reinforcement learning Aaai Workshop - Technical Report. 91-96.  0.591
2003 Barto AG, Mahadevan S. Recent Advances in Hierarchical Reinforcement Learning Discrete Event Dynamic Systems: Theory and Applications. 13: 343-379+382. DOI: 10.1023/A:1022140919877  0.366
2003 Perkins TJ, Barto AG. Lyapunov design for safe reinforcement learning Journal of Machine Learning Research. 3: 803-832.  0.639
2002 Fagg AH, Shah A, Barto AG. A computational model of muscle recruitment for wrist movements. Journal of Neurophysiology. 88: 3348-58. PMID 12466451 DOI: 10.1152/Jn.00621.2002  0.404
2002 Ravindran B, Barto AG. Model minimization in hierarchical reinforcement learning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2371: 196-211.  0.36
2001 Rosenstein MT, Barto AG. Robot weightlifting by direct policy search Ijcai International Joint Conference On Artificial Intelligence. 839-844.  0.715
2001 Perkins TJ, Barto AG. Heuristic search in infinite state spaces guided by Lyapunov analysis Ijcai International Joint Conference On Artificial Intelligence. 242-247.  0.518
1999 Moll R, Barto AG, Perkins TJ, Sutton RS. Learning instance-independent value functions to enhance local search Advances in Neural Information Processing Systems. 1017-1023.  0.706
1997 Barto AG, Sutton RS. Chapter 19 Reinforcement learning in artificial intelligence Advances in Psychology. 121: 358-386. DOI: 10.1016/S0166-4115(97)80105-7  0.575
1997 Hansen EA, Barto AG, Zilberstein S. Reinforcement learning for mixed open-loop and closed-loop control Advances in Neural Information Processing Systems. 1026-1032.  0.304
1997 Duff MO, Barto AG. Local bandit approximation for optimal learning problems Advances in Neural Information Processing Systems. 1019-1025.  0.666
1994 Gullapalli V, Barto AG, Grupen RA. Learning admittance mappings for force-guided assembly Proceedings - Ieee International Conference On Robotics and Automation. 2633-2638.  0.609
1992 Sutton RS, Barto AG, Williams RJ. Reinforcement Learning is Direct Adaptive Optimal Control Ieee Control Systems. 12: 19-22. DOI: 10.1109/37.126844  0.572
1991 Jacobs RA, Jordan MI, Barto AG. Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks Cognitive Science. 15: 219-250. DOI: 10.1016/0364-0213(91)80006-Q  0.641
1986 Moore JW, Desmond JE, Berthier NE, Blazis DE, Sutton RS, Barto AG. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals. Behavioural Brain Research. 21: 143-54. PMID 3755947 DOI: 10.1016/0166-4328(86)90092-6  0.403
1983 Barto AG, Sutton RS, Anderson CW. Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems Ieee Transactions On Systems, Man and Cybernetics. 834-846. DOI: 10.1109/TSMC.1983.6313077  0.543
1982 Barto AG, Anderson CW, Sutton RS. Synthesis of nonlinear control surfaces by a layered associative search network. Biological Cybernetics. 43: 175-85. PMID 7093360 DOI: 10.1007/BF00319977  0.489
1982 Barto AG, Sutton RS. Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behavioural Brain Research. 4: 221-35. PMID 6277346 DOI: 10.1016/0166-4328(82)90001-8  0.399
1981 Barto AG, Sutton RS. Landmark learning: an illustration of associative search. Biological Cybernetics. 42: 1-8. PMID 7326277 DOI: 10.1007/BF00335152  0.526
1981 Sutton RS, Barto AG. Toward a modern theory of adaptive networks: expectation and prediction. Psychological Review. 88: 135-70. PMID 7291377 DOI: 10.1037/0033-295X.88.2.135  0.478
1979 Barto AG, Sutton RS, Brouwer PS. Associative search network: A reinforcement learning associative memory Biological Cybernetics. 40: 201-211. DOI: 10.1007/BF00453370  0.462
Show low-probability matches.