Year |
Citation |
Score |
2023 |
Mankowitz DJ, Michi A, Zhernov A, Gelmi M, Selvi M, Paduraru C, Leurent E, Iqbal S, Lespiau JB, Ahern A, Köppe T, Millikin K, Gaffney S, Elster S, Broshear J, ... ... Silver D, et al. Faster sorting algorithms discovered using deep reinforcement learning. Nature. 618: 257-263. PMID 37286649 DOI: 10.1038/s41586-023-06004-9 |
0.339 |
|
2022 |
Perolat J, De Vylder B, Hennes D, Tarassov E, Strub F, de Boer V, Muller P, Connor JT, Burch N, Anthony T, McAleer S, Elie R, Cen SH, Wang Z, Gruslys A, ... ... Silver D, et al. Mastering the game of Stratego with model-free multiagent reinforcement learning. Science (New York, N.Y.). 378: 990-996. PMID 36454847 DOI: 10.1126/science.add4679 |
0.41 |
|
2022 |
Matsuo Y, LeCun Y, Sahani M, Precup D, Silver D, Sugiyama M, Uchibe E, Morimoto J. Deep learning, reinforcement learning, and world models. Neural Networks : the Official Journal of the International Neural Network Society. 152: 267-275. PMID 35569196 DOI: 10.1016/j.neunet.2022.03.037 |
0.65 |
|
2020 |
Schrittwieser J, Antonoglou I, Hubert T, Simonyan K, Sifre L, Schmitt S, Guez A, Lockhart E, Hassabis D, Graepel T, Lillicrap T, Silver D. Mastering Atari, Go, chess and shogi by planning with a learned model. Nature. 588: 604-609. PMID 33361790 DOI: 10.1038/s41586-020-03051-4 |
0.33 |
|
2020 |
Barreto A, Hou S, Borsa D, Silver D, Precup D. Fast reinforcement learning with generalized policy updates. Proceedings of the National Academy of Sciences of the United States of America. PMID 32817541 DOI: 10.1073/Pnas.1907370117 |
0.644 |
|
2019 |
Vinyals O, Babuschkin I, Czarnecki WM, Mathieu M, Dudzik A, Chung J, Choi DH, Powell R, Ewalds T, Georgiev P, Oh J, Horgan D, Kroiss M, Danihelka I, Huang A, ... ... Silver D, et al. Grandmaster level in StarCraft II using multi-agent reinforcement learning. Nature. PMID 31666705 DOI: 10.1038/S41586-019-1724-Z |
0.432 |
|
2019 |
Jaderberg M, Czarnecki WM, Dunning I, Marris L, Lever G, Castañeda AG, Beattie C, Rabinowitz NC, Morcos AS, Ruderman A, Sonnerat N, Green T, Deason L, Leibo JZ, Silver D, et al. Human-level performance in 3D multiplayer games with population-based reinforcement learning. Science (New York, N.Y.). 364: 859-865. PMID 31147514 DOI: 10.1126/Science.Aau6249 |
0.502 |
|
2018 |
Silver D, Hubert T, Schrittwieser J, Antonoglou I, Lai M, Guez A, Lanctot M, Sifre L, Kumaran D, Graepel T, Lillicrap T, Simonyan K, Hassabis D. A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play. Science (New York, N.Y.). 362: 1140-1144. PMID 30523106 DOI: 10.1126/Science.Aar6404 |
0.432 |
|
2018 |
Sun R, Silver D, Tesauro G, Huang GB. Introduction to the special issue on deep reinforcement learning: An editorial. Neural Networks : the Official Journal of the International Neural Network Society. PMID 30122431 DOI: 10.1016/J.Neunet.2018.08.001 |
0.535 |
|
2017 |
Silver D, Schrittwieser J, Simonyan K, Antonoglou I, Huang A, Guez A, Hubert T, Baker L, Lai M, Bolton A, Chen Y, Lillicrap T, Hui F, Sifre L, van den Driessche G, et al. Mastering the game of Go without human knowledge. Nature. 550: 354-359. PMID 29052630 DOI: 10.1038/Nature24270 |
0.456 |
|
2016 |
Silver D, Huang A, Maddison CJ, Guez A, Sifre L, van den Driessche G, Schrittwieser J, Antonoglou I, Panneershelvam V, Lanctot M, Dieleman S, Grewe D, Nham J, Kalchbrenner N, Sutskever I, et al. Mastering the game of Go with deep neural networks and tree search. Nature. 529: 484-9. PMID 26819042 DOI: 10.1038/Nature16961 |
0.383 |
|
2015 |
Mnih V, Kavukcuoglu K, Silver D, Rusu AA, Veness J, Bellemare MG, Graves A, Riedmiller M, Fidjeland AK, Ostrovski G, Petersen S, Beattie C, Sadik A, Antonoglou I, King H, et al. Human-level control through deep reinforcement learning. Nature. 518: 529-33. PMID 25719670 DOI: 10.1038/Nature14236 |
0.528 |
|
2013 |
Guez A, Silver D, Dayan P. Scalable and efficient bayes-adaptive reinforcement learning based on Monte-Carlo tree search Journal of Artificial Intelligence Research. 48: 841-883. DOI: 10.1613/Jair.4117 |
0.433 |
|
2012 |
Branavan SRK, Silver D, Barzilay R. Learning to win by reading manuals in a monte-carlo framework Journal of Artificial Intelligence Research. 43: 661-704. DOI: 10.1613/Jair.3484 |
0.377 |
|
2012 |
Gelly S, Kocsis L, Schoenauer M, Sebag M, Silver D, Szepesvári C, Teytaud O. The grand challenge of computer Go: Monte Carlo tree search and extensions Communications of the Acm. 55: 106-113. DOI: 10.1145/2093548.2093574 |
0.309 |
|
2012 |
Silver D, Sutton RS, Müller M. Temporal-difference search in computer Go Machine Learning. 87: 183-219. DOI: 10.1007/S10994-012-5280-0 |
0.602 |
|
2011 |
Miao J, Fleury AC, Kushnir CL, Silver DF, Naik R, Spirtos NM. Post fellowship training in "new-to-them" surgical techniques: assessment of learning curve characteristics. Gynecologic Oncology. 121: 620-4. PMID 21444106 DOI: 10.1016/j.ygyno.2011.02.036 |
0.304 |
|
2009 |
Sutton RS, Maei HR, Precup D, Bhatnagar S, Silver D, Szepesvári C, Wiewiora E. Fast gradient-descent methods for temporal-difference learning with linear function approximation Proceedings of the 26th International Conference On Machine Learning, Icml 2009. 993-1000. DOI: 10.1145/1553374.1553501 |
0.554 |
|
2008 |
Silver D, Sutton RS, Müller M. Sample-based learning and search with permanent and transient memories Proceedings of the 25th International Conference On Machine Learning. 968-975. |
0.527 |
|
2007 |
Sutton RS, Koop A, Silver D. On the role of tracking in stationary environments Acm International Conference Proceeding Series. 227: 871-878. DOI: 10.1145/1273496.1273606 |
0.375 |
|
Show low-probability matches. |