Andrew Barto - Publications

Affiliations: 
University of Massachusetts, Amherst, Amherst, MA 
Area:
Reinforcement Learning
Website:
http://www-anw.cs.umass.edu/~barto/

105 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2015 Niekum S, Osentoski S, Konidaris G, Chitta S, Marthi B, Barto AG. Learning grounded finite-state representations from unstructured demonstrations International Journal of Robotics Research. 34: 131-157. DOI: 10.1177/0278364914554471  1
2015 Niekum S, Osentoski S, Atkeson CG, Barto AG. Online Bayesian changepoint detection for articulated motion models Proceedings - Ieee International Conference On Robotics and Automation. 2015: 1468-1475. DOI: 10.1109/ICRA.2015.7139383  1
2015 Botvinick M, Weinstein A, Solway A, Barto A. Reinforcement learning, efficient coding, and the statistics of natural tasks Current Opinion in Behavioral Sciences. 5: 71-77. DOI: 10.1016/j.cobeha.2015.08.009  1
2014 Baldassarre G, Stafford T, Mirolli M, Redgrave P, Ryan RM, Barto A. Intrinsic motivations and open-ended development in animals, humans, and robots: an overview. Frontiers in Psychology. 5: 985. PMID 25249998 DOI: 10.3389/fpsyg.2014.00985  1
2014 Solway A, Diuk C, Córdova N, Yee D, Barto AG, Niv Y, Botvinick MM. Optimal behavioral hierarchy. Plos Computational Biology. 10: e1003779. PMID 25122479 DOI: 10.1371/journal.pcbi.1003779  1
2014 Barto AG. Commentary on utility and bounds. Topics in Cognitive Science. 6: 338-41. PMID 24764141 DOI: 10.1111/tops.12090  1
2014 Da Silva BC, Baldassarre G, Konidaris G, Barto A. Learning parameterized motor skills on a humanoid robot Proceedings - Ieee International Conference On Robotics and Automation. 5239-5244. DOI: 10.1109/ICRA.2014.6907629  1
2014 Barto AG, Konidaris G, Vigorito C. Behavioral hierarchy: Exploration and representation Computational and Robotic Models of the Hierarchical Organization of Behavior. 13-46. DOI: 10.1007/978-3-642-39875-9_2  1
2014 Da Silva BC, Konidaris G, Barto A. Active learning of parameterized skills 31st International Conference On Machine Learning, Icml 2014. 5: 3736-3745.  1
2013 Levy YZ, Levy DJ, Barto AG, Meyer JS. A computational hypothesis for allostasis: delineation of substance dependence, conventional therapies, and alternative treatments. Frontiers in Psychiatry. 4: 167. PMID 24391601 DOI: 10.3389/fpsyt.2013.00167  1
2013 Barto A, Mirolli M, Baldassarre G. Novelty or surprise? Frontiers in Psychology. 4: 907. PMID 24376428 DOI: 10.3389/fpsyg.2013.00907  1
2013 Shah A, Barto AG, Fagg AH. A dual process account of coarticulation in motor skill acquisition. Journal of Motor Behavior. 45: 531-49. PMID 24116847 DOI: 10.1080/00222895.2013.837423  1
2013 Kuindersma SR, Grupen RA, Barto AG. Variable risk control via stochastic optimization International Journal of Robotics Research. 32: 806-825. DOI: 10.1177/0278364913476124  1
2013 Barto AG. Intrinsic motivation and reinforcement learning Intrinsically Motivated Learning in Natural and Artificial Systems. 17-47. DOI: 10.1007/978-3-642-32375-1_2  1
2012 Konidaris G, Kuindersma S, Grupen R, Barto A. Robot learning from demonstration by constructing skill trees International Journal of Robotics Research. 31: 360-375. DOI: 10.1177/0278364911428653  1
2012 Niekum S, Osentoski S, Konidaris G, Barto AG. Learning and generalization of complex tasks from unstructured demonstrations Ieee International Conference On Intelligent Robots and Systems. 5239-5246. DOI: 10.1109/IROS.2012.6386006  1
2012 Thomas PS, Barto AG. Motor primitive discovery 2012 Ieee International Conference On Development and Learning and Epigenetic Robotics, Icdl 2012. DOI: 10.1109/DevLrn.2012.6400845  1
2012 Da Silva BC, Barto AG. TD-δπ: A model-free algorithm for efficient exploration Proceedings of the National Conference On Artificial Intelligence. 2: 886-892.  1
2012 Dabney W, Barto AG. Adaptive step-size for online temporal difference learning Proceedings of the National Conference On Artificial Intelligence. 2: 872-878.  1
2012 Da Silva BC, Konidaris G, Barto AG. Learning parameterized skills Proceedings of the 29th International Conference On Machine Learning, Icml 2012. 2: 1679-1686.  1
2012 Konidaris G, Scheidwasser I, Barto AG. Transfer in reinforcement learning via shared features Journal of Machine Learning Research. 13: 1333-1371.  1
2011 Ribas-Fernandes JJ, Solway A, Diuk C, McGuire JT, Barto AG, Niv Y, Botvinick MM. A neural signature of hierarchical reinforcement learning. Neuron. 71: 370-9. PMID 21791294 DOI: 10.1016/j.neuron.2011.05.042  1
2011 Niekum S, Spector L, Barto A. Evolution of reward functions for reinforcement learning Genetic and Evolutionary Computation Conference, Gecco'11 - Companion Publication. 177-178. DOI: 10.1145/2001858.2001957  1
2011 Kuindersma S, Grupen R, Barto A. Learning dynamic arm motions for postural recovery Ieee-Ras International Conference On Humanoid Robots. 7-12. DOI: 10.1109/Humanoids.2011.6100881  1
2011 Botvinick MM, Niv Y, Barto AG. Hierarchically organised behaviour and its neural foundations: A reinforcement-learning perspective Modelling Natural Action Selection. 264-299. DOI: 10.1017/CBO9780511731525.017  1
2011 Niekum S, Barto AG. Clustering via Dirichlet process mixture models for portable skill discovery Advances in Neural Information Processing Systems 24: 25th Annual Conference On Neural Information Processing Systems 2011, Nips 2011 1
2011 Konidaris G, Kuindersma S, Grupen R, Barto A. Autonomous skill acquisition on a mobile manipulator Proceedings of the National Conference On Artificial Intelligence. 2: 1468-1473.  1
2011 Thomas PS, Barto AG. Conjugate Markov decision processes Proceedings of the 28th International Conference On Machine Learning, Icml 2011. 137-144.  1
2010 Stout A, Barto AG. Competence progress intrinsic motivation 2010 Ieee 9th International Conference On Development and Learning, Icdl-2010 - Conference Program. 257-262. DOI: 10.1109/DEVLRN.2010.5578835  1
2010 Lewis RL, Singh S, Barto AG. Where do rewards come from? Proceedings of the International Symposium On Ai Inspired Biology - a Symposium At the Aisb 2010 Convention. 111-116.  1
2010 Konidaris G, Kuindersmay S, Barto A, Grupen R. Constructing skill trees for reinforcement learning agents from demonstration trajectories Advances in Neural Information Processing Systems 23: 24th Annual Conference On Neural Information Processing Systems 2010, Nips 2010 1
2009 Shah A, Barto AG. Effect on movement selection of an evolving sensory representation: a multiple controller model of skill acquisition. Brain Research. 1299: 55-73. PMID 19595991 DOI: 10.1016/j.brainres.2009.07.006  1
2009 Botvinick MM, Niv Y, Barto AC. Hierarchically organized behavior and its neural foundations: a reinforcement learning perspective. Cognition. 113: 262-80. PMID 18926527 DOI: 10.1016/j.cognition.2008.08.011  1
2009 Şimşek O, Barto AG. Skill characterization based on betweenness Advances in Neural Information Processing Systems 21 - Proceedings of the 2008 Conference. 1497-1504.  1
2009 Konidaris G, Barto A. Efficient skill learning using abstraction selection Ijcai International Joint Conference On Artificial Intelligence. 1107-1112.  1
2009 Konidaris G, Barto A. Skill discovery in continuous reinforcement learning domains using skill chaining Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference. 1015-1023.  1
2008 Konidaris G, Barto A. Sensorimotor abstraction selection for efficient, autonomous robot skill acquisition 2008 Ieee 7th International Conference On Development and Learning, Icdl. 151-156. DOI: 10.1109/DEVLRN.2008.4640821  1
2008 Vigorito CM, Barto AG. Hierarchical representations of behavior for efficient creative search Aaai Spring Symposium - Technical Report. 135-141.  1
2007 Vigorito CM, Ganesan D, Barto AG. Adaptive control of duty cycling in energy-harvesting wireless sensor networks 2007 4th Annual Ieee Communications Society Conference On Sensor, Mesh and Ad Hoc Communications and Networks, Secon. 21-30. DOI: 10.1109/SAHCN.2007.4292814  1
2007 Konidaris G, Barto A. Building portable options: Skill transfer in reinforcement learning Ijcai International Joint Conference On Artificial Intelligence. 895-900.  1
2007 Ravindran B, Barto AG, Mathew V. Deictic option schemas Ijcai International Joint Conference On Artificial Intelligence. 1023-1028.  1
2007 Jonsson A, Barto A. Active learning of dynamic bayesian networks in Markov decision processes Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4612: 273-284.  1
2006 Şimşek O, Barto AG. An intrinsic reward mechanism for efficient exploration Acm International Conference Proceeding Series. 148: 833-840. DOI: 10.1145/1143844.1143949  1
2006 Konidaris G, Barto A. Autonomous shaping: Knowledge transfer in reinforcement learning Acm International Conference Proceeding Series. 148: 489-496. DOI: 10.1145/1143844.1143906  1
2006 Rosenstein MT, Barto AG, Van Emmerik REA. Learning at the level of synergies for a robot weightlifter Robotics and Autonomous Systems. 54: 706-717. DOI: 10.1016/j.robot.2006.03.002  1
2006 Ferguson K, Arroyo I, Mahadevan S, Woolf B, Barto A. Improving intelligent tutoring systems: Using expectation maximization to learn student skill levels Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4053: 453-462. DOI: 10.1007/11774303_45  1
2006 Wolfe AP, Barto AG. Decision tree methods for finding reusable MDP homomorphisms Proceedings of the National Conference On Artificial Intelligence. 1: 530-535.  1
2006 Jonsson A, Barto A. Causal graph based decomposition of factored MDPs Journal of Machine Learning Research. 7: 2259-2301.  1
2006 Konidaris G, Barto A. An adaptive robot motivational system Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 4095: 346-356.  1
2005 Berthier NE, Rosenstein MT, Barto AG. Approximate optimal control as a model for motor learning. Psychological Review. 112: 329-46. PMID 15783289 DOI: 10.1037/0033-295X.112.2.329  1
2005 Jonsson A, Johns J, Mehranian H, Arroyo I, Woolf B, Barto A, Fisher D, Mahadevan S. Evaluating the feasibility of learning student models from data Aaai Workshop - Technical Report. 1-6.  1
2005 Şimşek O, Barto AG. Learning skills in reinforcement learning using relative novelty Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 3607: 367-374.  1
2005 Jonsson A, Barto A. A causal approach to hierarchical decomposition of factored MDPs Icml 2005 - Proceedings of the 22nd International Conference On Machine Learning. 401-408.  1
2005 Şimşek Ö, Wolfe AP, Barto AG. Identifying useful subgoals in reinforcement learning by local graph partitioning Icml 2005 - Proceedings of the 22nd International Conference On Machine Learning. 817-824.  1
2005 Singh S, Barto AG, Chentanez N. Intrinsically motivated reinforcement learning Advances in Neural Information Processing Systems 1
2004 Shah A, Fagg AH, Barto AG. Cortical involvement in the recruitment of wrist muscles. Journal of Neurophysiology. 91: 2445-56. PMID 14749314 DOI: 10.1152/jn.00879.2003  1
2004 Rosenstein MT, Barto AG. Reinforcement learning with supervision by a stable controller Proceedings of the American Control Conference. 5: 4517-4522. DOI: 10.1109/ACC.2004.182663  1
2004 Şimşek O, Wolfe AP, Barto AG. Local graph partitioning as a basis for generating temporally-extended actions in reinforcement learning Aaai Workshop - Technical Report. 91-96.  1
2004 Şimşek O, Barto AG. Using relative novelty to identify useful temporal abstractions in reinforcement learning Proceedings, Twenty-First International Conference On Machine Learning, Icml 2004. 751-758.  1
2003 Barto AG, Mahadevan S. Recent Advances in Hierarchical Reinforcement Learning Discrete Event Dynamic Systems: Theory and Applications. 13: 343-379+382. DOI: 10.1023/A:1022140919877  1
2003 Ravindran B, Barto AG. SMDP homomorphisms: An algebraic approach to abstraction in semi-Markov decision processes Ijcai International Joint Conference On Artificial Intelligence. 1011-1016.  1
2003 Ravindran B, Barto AG. Relativized Options: Choosing the Right Transformation Proceedings, Twentieth International Conference On Machine Learning. 2: 608-615.  1
2003 Perkins TJ, Barto AG. Lyapunov design for safe reinforcement learning Journal of Machine Learning Research. 3: 803-832.  1
2002 Fagg AH, Shah A, Barto AG. A computational model of muscle recruitment for wrist movements. Journal of Neurophysiology. 88: 3348-58. PMID 12466451 DOI: 10.1152/jn.00621.2001  1
2002 McGovern A, Moss E, Barto AG. Building a basic block instruction scheduler with reinforcement learning and rollouts Machine Learning. 49: 141-160. DOI: 10.1023/A:1017976211990  1
2002 Kositsky M, Barto AG. The emergence of movement units through learning with noisy efferent signals and delayed sensory feedback Neurocomputing. 44: 889-895. DOI: 10.1016/S0925-2312(02)00488-5  1
2002 Ravindran B, Barto AG. Model minimization in hierarchical reinforcement learning Lecture Notes in Computer Science (Including Subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2371: 196-211.  1
2002 Kositsky M, Barto AG. The emergence of multiple movement units in the presence of noise and feedback delay Advances in Neural Information Processing Systems 1
2001 Rosenstein MT, Barto AG. Robot weightlifting by direct policy search Ijcai International Joint Conference On Artificial Intelligence. 839-844.  1
2001 Perkins TJ, Barto AG. Heuristic search in infinite state spaces guided by Lyapunov analysis Ijcai International Joint Conference On Artificial Intelligence. 242-247.  1
2001 Jonsson A, Barto AG. Automated state abstraction for options using the u-tree algorithm Advances in Neural Information Processing Systems 1
1999 Barto AG, Fagg AH, Sitkoff N, Houk JC. A cerebellar model of timing and prediction in the control of reaching. Neural Computation. 11: 565-94. PMID 10085421  1
1999 Moll R, Barto AG, Perkins TJ, Sutton RS. Learning instance-independent value functions to enhance local search Advances in Neural Information Processing Systems. 1017-1023.  1
1998 Crites RH, Barto AG. Elevator Group Control Using Multiple Reinforcement Learning Agents Machine Learning. 12: 235-262.  1
1998 Monaco JF, Ward DG, Barto AG. Automated aircraft recovery via reinforcement learning: Initial experiments Advances in Neural Information Processing Systems. 1022-1028.  1
1997 Kettner RE, Mahamud S, Leung HC, Sitkoff N, Houk JC, Peterson BW, Barto AG. Prediction of complex two-dimensional trajectories by a cerebellar model of smooth pursuit eye movement. Journal of Neurophysiology. 77: 2115-30. PMID 9114259  1
1997 Barto AG, Sutton RS. Chapter 19 Reinforcement learning in artificial intelligence Advances in Psychology. 121: 358-386. DOI: 10.1016/S0166-4115(97)80105-7  1
1997 Papka R, Callan JP, Barto AG. Text-based information retrieval using exponentiated gradient descent Advances in Neural Information Processing Systems. 3-9.  1
1997 Hansen EA, Barto AG, Zilberstein S. Reinforcement learning for mixed open-loop and closed-loop control Advances in Neural Information Processing Systems. 1026-1032.  1
1997 Duff MO, Barto AG. Local bandit approximation for optimal learning problems Advances in Neural Information Processing Systems. 1019-1025.  1
1997 Fagg AH, Sitkoff N, Barto AG, Houk JC. Cerebellar learning for control of a two-link arm in muscle space Proceedings - Ieee International Conference On Robotics and Automation. 3: 2638-2644.  1
1997 Fagg AH, Sitkoff N, Barto AG, Houk JC. Model of cerebellar learning for control of arm movements using muscle synergies Proceedings of Ieee International Symposium On Computational Intelligence in Robotics and Automation, Cira. 6-12.  1
1996 Houk JC, Buckingham JT, Barto AG. Models of the cerebellum and motor learning Behavioral and Brain Sciences. 19: 368-383.  1
1995 Barto AG, Bradtke SJ, Singh SP. Learning to act using real-time dynamic programming Artificial Intelligence. 72: 81-138. DOI: 10.1016/0004-3702(94)00011-O  1
1994 Barto AG. Reinforcement learning control. Current Opinion in Neurobiology. 4: 888-93. PMID 7888773 DOI: 10.1016/0959-4388(94)90138-4  1
1994 Gullapalli V, Barto AG, Grupen RA. Learning admittance mappings for force-guided assembly Proceedings - Ieee International Conference On Robotics and Automation. 2633-2638.  1
1993 Berthier NE, Singh SP, Barto AG, Houk JC. Distributed representation of limb motor programs in arrays of adjustable pattern generators. Journal of Cognitive Neuroscience. 5: 56-78. PMID 23972120 DOI: 10.1162/jocn.1993.5.1.56  1
1993 Houk JC, Keifer J, Barto AG. Distributed motor commands in the limb premotor network. Trends in Neurosciences. 16: 27-33. PMID 7679234 DOI: 10.1016/0166-2236(93)90049-R  1
1992 Sutton RS, Barto AG, Williams RJ. Reinforcement Learning is Direct Adaptive Optimal Control Ieee Control Systems. 12: 19-22. DOI: 10.1109/37.126844  1
1992 Gullapalli V, Grupen RA, Barto AG. Learning reactive admittance control Proceedings - Ieee International Conference On Robotics and Automation. 2: 1475-1480.  1
1991 Berthier NE, Barto AG, Moore JW. Linear systems analysis of the relationship between firing of deep cerebellar neurons and the classically conditioned nictitating membrane response in rabbits. Biological Cybernetics. 65: 99-105. PMID 1912007 DOI: 10.1007/BF00202384  1
1991 Jacobs RA, Jordan MI, Barto AG. Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks Cognitive Science. 15: 219-250. DOI: 10.1016/0364-0213(91)80006-Q  1
1990 Sinkjaer T, Wu CH, Barto AG, Houk JC. Cerebellar control of endpoint position--A simulation model Ijcnn. International Joint Conference On Neural Networks. 705-710.  1
1986 Moore JW, Desmond JE, Berthier NE, Blazis DE, Sutton RS, Barto AG. Simulation of the classically conditioned nictitating membrane response by a neuron-like adaptive element: response topography, neuronal firing, and interstimulus intervals. Behavioural Brain Research. 21: 143-54. PMID 3755947 DOI: 10.1016/0166-4328(86)90092-6  1
1986 Barto AG. ADAPTIVE NEURAL NETWORKS FOR LEARNING CONTROL: SOME COMPUTATIONAL EXPERIMENTS . 170-175.  1
1986 Barto AG, Anandan P, Anderson CW. COOPERATIVITY IN NETWORKS OF PATTERN RECOGNIZING STOCHASTIC LEARNING AUTOMATA . 235-246.  1
1985 Barto AG. Learning by statistical cooperation of self-interested neuron-like computing elements. Human Neurobiology. 4: 229-56. PMID 3915497  1
1985 Barto AG, Anandan P. Pattern-Recognizing Stochastic Learning Automata Ieee Transactions On Systems, Man and Cybernetics. 360-375. DOI: 10.1109/TSMC.1985.6313371  1
1983 Barto AG, Sutton RS, Anderson CW. Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems Ieee Transactions On Systems, Man and Cybernetics. 834-846. DOI: 10.1109/TSMC.1983.6313077  1
1982 Barto AG, Anderson CW, Sutton RS. Synthesis of nonlinear control surfaces by a layered associative search network. Biological Cybernetics. 43: 175-85. PMID 7093360 DOI: 10.1007/BF00319977  1
1982 Barto AG, Sutton RS. Simulation of anticipatory responses in classical conditioning by a neuron-like adaptive element. Behavioural Brain Research. 4: 221-35. PMID 6277346 DOI: 10.1016/0166-4328(82)90001-8  1
1981 Barto AG, Sutton RS. Landmark learning: an illustration of associative search. Biological Cybernetics. 42: 1-8. PMID 7326277 DOI: 10.1007/BF00335152  1
1981 Sutton RS, Barto AG. Toward a modern theory of adaptive networks: expectation and prediction. Psychological Review. 88: 135-70. PMID 7291377 DOI: 10.1037/0033-295X.88.2.135  1
1979 Barto AG, Sutton RS, Brouwer PS. Associative search network: A reinforcement learning associative memory Biological Cybernetics. 40: 201-211. DOI: 10.1007/BF00453370  1
1978 Barto AG. A note on pattern reproduction in tessellation structures Journal of Computer and System Sciences. 16: 445-455. DOI: 10.1016/0022-0000(78)90029-6  1
Show low-probability matches.