George Tucker

引用先

	すべて	2019 年以来
引用	22637	21209
h 指標	41	40
i10 指標	58	56

7000

3500

1750

5250

2015201620172018201920202021202220232024120 225 324 607 1068 1884 3137 3958 4878 6259

オープンアクセス

すべて表示

17 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Sergey LevineUC Berkeley, Physical Intelligence確認したメールアドレス: eecs.berkeley.edu
Aviral KumarCMU & Google DeepMind確認したメールアドレス: andrew.cmu.edu
Ofir NachumOpenAI確認したメールアドレス: openai.com
Justin FuUC Berkeley確認したメールアドレス: berkeley.edu
Bonnie BergerMIT確認したメールアドレス: mit.edu
Aurick ZhouWaymo確認したメールアドレス: berkeley.edu
Dieterich LawsonStanford University確認したメールアドレス: stanford.edu
Mohammad NorouziIdeogram確認したメールアドレス: ideogram.ai
Po-Ru LohBrigham and Women's Hospital / Harvard Medical School確認したメールアドレス: broadinstitute.org
Tuomas HaarnojaDeepMind確認したメールアドレス: google.com
Sehoon HaGeorgia Institute of Technology確認したメールアドレス: gatech.edu
Chris J. MaddisonUniversity of Toronto確認したメールアドレス: cs.toronto.edu
Łukasz KaiserOpenAI & CNRS確認したメールアドレス: openai.com
Andriy MnihResearch Scientist at Google DeepMind確認したメールアドレス: cs.toronto.edu
Chelsea FinnStanford University, Physical Intelligence確認したメールアドレス: cs.stanford.edu
Jie TanGoogle DeepMind確認したメールアドレス: google.com
Henryk MichalewskiGoogle確認したメールアドレス: google.com
Dumitru ErhanDirector of Research @ Google DeepMind確認したメールアドレス: google.com
Jascha Sohl-DicksteinAnthropic確認したメールアドレス: anthropic.com
Piotr KozakowskiUniversity of Warsaw確認したメールアドレス: mimuw.edu.pl

フォロー

George Tucker

Google Brain

確認したメールアドレス: google.com - ホームページ

Reinforcement Learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018	2787	2018
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020	2050*	2020
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine NeurIPS 2020, 2020	1753	2020
Efficient Bayesian mixed-model analysis increases association power in large cohorts PR Loh, G Tucker, BK Bulik-Sullivan, BJ Vilhjálmsson, HK Finucane, ... Nature genetics 47 (3), 284-290, 2015	1603	2015
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023	1419	2023
Regularizing neural networks by penalizing confident output distributions G Pereyra, G Tucker, J Chorowski, Ł Kaiser, G Hinton ICLR 2017 Workshop, 2017	1258	2017
D4rl: Datasets for deep data-driven reinforcement learning J Fu, A Kumar, O Nachum, G Tucker, S Levine arXiv preprint arXiv:2004.07219, 2020	1079	2020
Stabilizing off-policy q-learning via bootstrapping error reduction A Kumar, J Fu, G Tucker, S Levine NeurIPS 2019, 2019	1057	2019
Model-based reinforcement learning for atari L Kaiser, M Babaeizadeh, P Milos, B Osinski, RH Campbell, ... ICLR 2020 Spotlight, 2020	969	2020
On variational bounds of mutual information B Poole, S Ozair, A Van Den Oord, A Alemi, G Tucker International Conference on Machine Learning, 5171-5180, 2019	898	2019
Behavior regularized offline reinforcement learning Y Wu, G Tucker, O Nachum arXiv preprint arXiv:1911.11361, 2019	732	2019
Widespread macromolecular interaction perturbations in human genetic disorders N Sahni, S Yi, M Taipale, JIF Bass, J Coulombe-Huntington, F Yang, ... Cell 161 (3), 647-660, 2015	582	2015
Learning to walk via deep reinforcement learning T Haarnoja, S Ha, A Zhou, J Tan, G Tucker, S Levine RSS 2019, 2019	537	2019
A quantitative chaperone interaction network reveals the architecture of cellular protein homeostasis pathways M Taipale, G Tucker, J Peng, I Krykbaeva, ZY Lin, B Larsen, H Choi, ... Cell 158 (2), 434-448, 2014	445	2014
Deep bayesian bandits showdown: An empirical comparison of bayesian deep networks for thompson sampling C Riquelme, G Tucker, J Snoek ICLR 2018, 2018	441*	2018
Gemma: Open models based on gemini research and technology G Team, T Mesnard, C Hardin, R Dadashi, S Bhupatiraju, S Pathak, ... arXiv preprint arXiv:2403.08295, 2024	403	2024
Sample-efficient reinforcement learning with stochastic ensemble value expansion J Buckman, D Hafner, G Tucker, E Brevdo, H Lee NeurIPS 2018 Oral, 2018	390	2018
Rebar: Low-variance, unbiased gradient estimates for discrete latent variable models G Tucker, A Mnih, CJ Maddison, D Lawson, J Sohl-Dickstein NIPS 2017 Oral, 2017	351	2017
Don't blame the elbo! a linear vae perspective on posterior collapse J Lucas, G Tucker, RB Grosse, M Norouzi Advances in Neural Information Processing Systems 32, 2019	330*	2019
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024	320	2024

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者