David Silver - Publications

Affiliations: 
2009 University of Alberta, Edmonton, Alberta, Canada 

20 high-probability publications. We are testing a new system for linking publications to authors. You can help! If you notice any inaccuracies, please sign in and mark papers as correct or incorrect matches. If you identify any major omissions or other inaccuracies in the publication list, please let us know.

Year Citation  Score
2023 Mankowitz DJ, Michi A, Zhernov A, Gelmi M, Selvi M, Paduraru C, Leurent E, Iqbal S, Lespiau JB, Ahern A, Köppe T, Millikin K, Gaffney S, Elster S, Broshear J, ... ... Silver D, et al. Faster sorting algorithms discovered using deep reinforcement learning. Nature. 618: 257-263. PMID 37286649 DOI: 10.1038/s41586-023-06004-9  0.339
2022 Perolat J, De Vylder B, Hennes D, Tarassov E, Strub F, de Boer V, Muller P, Connor JT, Burch N, Anthony T, McAleer S, Elie R, Cen SH, Wang Z, Gruslys A, ... ... Silver D, et al. Mastering the game of Stratego with model-free multiagent reinforcement learning. Science (New York, N.Y.). 378: 990-996. PMID 36454847 DOI: 10.1126/science.add4679  0.41
2022 Matsuo Y, LeCun Y, Sahani M, Precup D, Silver D, Sugiyama M, Uchibe E, Morimoto J. Deep learning, reinforcement learning, and world models. Neural Networks : the Official Journal of the International Neural Network Society. 152: 267-275. PMID 35569196 DOI: 10.1016/j.neunet.2022.03.037  0.65
2020 Schrittwieser J, Antonoglou I, Hubert T, Simonyan K, Sifre L, Schmitt S, Guez A, Lockhart E, Hassabis D, Graepel T, Lillicrap T, Silver D. Mastering Atari, Go, chess and shogi by planning with a learned model. Nature. 588: 604-609. PMID 33361790 DOI: 10.1038/s41586-020-03051-4  0.33
2020 Barreto A, Hou S, Borsa D, Silver D, Precup D. Fast reinforcement learning with generalized policy updates. Proceedings of the National Academy of Sciences of the United States of America. PMID 32817541 DOI: 10.1073/Pnas.1907370117  0.644
2019 Vinyals O, Babuschkin I, Czarnecki WM, Mathieu M, Dudzik A, Chung J, Choi DH, Powell R, Ewalds T, Georgiev P, Oh J, Horgan D, Kroiss M, Danihelka I, Huang A, ... ... Silver D, et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature. PMID 31666705 DOI: 10.1038/S41586-019-1724-Z  0.432
2019 Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castañeda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning. Science (New York, N.Y.). 364: 859-865. PMID 31147514 DOI: 10.1126/Science.Aau6249  0.502
2018 Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science (New York, N.Y.). 362: 1140-1144. PMID 30523106 DOI: 10.1126/Science.Aar6404  0.432
2018 Sun R, Silver D, Tesauro G, Huang GB. Introduction to the special issue on deep reinforcement learning: An editorial. Neural Networks : the Official Journal of the International Neural Network Society. PMID 30122431 DOI: 10.1016/J.Neunet.2018.08.001  0.535
2017 Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen Y, Lillicrap T, Hui F, Sifre L, van den Driessche G, et al. Mastering the game of Go without human knowledge. Nature. 550: 354-359. PMID 29052630 DOI: 10.1038/Nature24270  0.456
2016 Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 529: 484-9. PMID 26819042 DOI: 10.1038/Nature16961  0.383
2015 Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, et al. Human-level control through deep reinforcement learning. Nature. 518: 529-33. PMID 25719670 DOI: 10.1038/Nature14236  0.528
2013 Guez A, Silver D, Dayan P. Scalable and efficient bayes-adaptive reinforcement learning based on Monte-Carlo tree search Journal of Artificial Intelligence Research. 48: 841-883. DOI: 10.1613/Jair.4117  0.433
2012 Branavan SRK, Silver D, Barzilay R. Learning to win by reading manuals in a monte-carlo framework Journal of Artificial Intelligence Research. 43: 661-704. DOI: 10.1613/Jair.3484  0.377
2012 Gelly S, Kocsis L, Schoenauer M, Sebag M, Silver D, Szepesvári C, Teytaud O. The grand challenge of computer Go: Monte Carlo tree search and extensions Communications of the Acm. 55: 106-113. DOI: 10.1145/2093548.2093574  0.309
2012 Silver D, Sutton RS, Müller M. Temporal-difference search in computer Go Machine Learning. 87: 183-219. DOI: 10.1007/S10994-012-5280-0  0.602
2011 Miao J, Fleury AC, Kushnir CL, Silver DF, Naik R, Spirtos NM. Post fellowship training in "new-to-them" surgical techniques: assessment of learning curve characteristics. Gynecologic Oncology. 121: 620-4. PMID 21444106 DOI: 10.1016/j.ygyno.2011.02.036  0.304
2009 Sutton RS, Maei HR, Precup D, Bhatnagar S, Silver D, Szepesvári C, Wiewiora E. Fast gradient-descent methods for temporal-difference learning with linear function approximation Proceedings of the 26th International Conference On Machine Learning, Icml 2009. 993-1000. DOI: 10.1145/1553374.1553501  0.554
2008 Silver D, Sutton RS, Müller M. Sample-based learning and search with permanent and transient memories Proceedings of the 25th International Conference On Machine Learning. 968-975.  0.527
2007 Sutton RS, Koop A, Silver D. On the role of tracking in stationary environments Acm International Conference Proceeding Series. 227: 871-878. DOI: 10.1145/1273496.1273606  0.375
Show low-probability matches.