フォロー
Arthur Guez
Arthur Guez
Google DeepMind
確認したメール アドレス: google.com - ホームページ
タイトル
引用先
引用先
Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ...
nature 529 (7587), 484-489, 2016
182962016
Mastering the game of go without human knowledge
D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ...
nature 550 (7676), 354-359, 2017
103932017
Deep reinforcement learning with double q-learning
H Van Hasselt, A Guez, D Silver
Proceedings of the AAAI conference on artificial intelligence 30 (1), 2016
86102016
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
Science 362 (6419), 1140-1144, 2018
41462018
Mastering atari, go, chess and shogi by planning with a learned model
J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ...
Nature 588 (7839), 604-609, 2020
21242020
Mastering chess and shogi by self-play with a general reinforcement learning algorithm
D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ...
arXiv preprint arXiv:1712.01815, 2017
20902017
Imagination-augmented agents for deep reinforcement learning
S Racanière, T Weber, D Reichert, L Buesing, A Guez, ...
Advances in neural information processing systems 30, 2017
4422017
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
3482023
The predictron: End-to-end learning and planning
D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ...
International Conference on Machine Learning, 3191-3199, 2017
2912017
Imagination-augmented agents for deep reinforcement learning
T Weber, S Racaniere, DP Reichert, L Buesing, A Guez, DJ Rezende, ...
arXiv preprint arXiv:1707.06203, 2017
2442017
Efficient Bayes-adaptive reinforcement learning using sample-based search
A Guez, D Silver, P Dayan
Advances in neural information processing systems 25, 2012
1872012
Learning values across many orders of magnitude
HP van Hasselt, A Guez, M Hessel, V Mnih, D Silver
Advances In Neural Information Processing Systems, 4287-4295, 2016
1812016
Increasing the action gap: New operators for reinforcement learning
MG Bellemare, G Ostrovski, A Guez, P Thomas, R Munos
Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016
1672016
Sifre L Van Den Driessche G Schrittwieser J Antonoglou I Panneershelvam V Lanctot M et al
SDHAM CJ, A Guez
Mastering the game of go with deep neural networks and tree search Nature …, 2016
1572016
Woulda, coulda, shoulda: Counterfactually-guided policy search
L Buesing, T Weber, Y Zwols, S Racaniere, A Guez, JB Lespiau, N Heess
arXiv preprint arXiv:1811.06272, 2018
1452018
Adaptive Treatment of Epilepsy via Batch-mode Reinforcement Learning.
A Guez, RD Vincent, M Avoli, J Pineau
AAAI 8, 1671-1678, 2008
1262008
Treating epilepsy via adaptive neurostimulation: a reinforcement learning approach
J Pineau, A Guez, R Vincent, G Panuccio, M Avoli
International journal of neural systems 19 (04), 227-240, 2009
1062009
Scalable and efficient Bayes-adaptive reinforcement learning based on Monte-Carlo tree search
A Guez, D Silver, P Dayan
Journal of Artificial Intelligence Research 48, 841-883, 2013
1042013
& Hassabis, D.(2016). Mastering the game of Go with deep neural networks and tree search
D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche
Nature 529 (7587), 484-489, 0
104
Learning to search with mctsnets
A Guez, T Weber, I Antonoglou, K Simonyan, O Vinyals, D Wierstra, ...
International conference on machine learning, 1822-1831, 2018
922018
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20