Vivek Veeriah

Cited by

	All	Since 2019
Citations	1035	770
h-index	11	10
i10-index	14	11

160

120

2013201420152016201720182019202020212022202320246 9 6 30 96 103 119 129 133 153 157 78

Public access

View all

5 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Satinder SinghGoogle DeepMind / U. of MichiganVerified email at umich.edu
Junhyuk OhResearch Scientist, DeepMindVerified email at google.com
Richard S. SuttonKeen, Amii, and University of AlbertaVerified email at richsutton.com
Guo-Jun Qi (齐国君), Fellow of IEEE &...Computer Science, University of Central FloridaVerified email at ucf.edu
Zhongwen XuTencentVerified email at tencent.com
David SilverDeepMind, UCLVerified email at google.com
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCLVerified email at google.com
Matteo HesselResearch Engineer, Google DeepMindVerified email at google.com
Tom ZahavyStaff Research Scientist, Google DeepMindVerified email at deepmind.com
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of MichiganVerified email at umich.edu
Naifan ZhuangPhD student of Department of Computer Science, University of Central FloridaVerified email at knights.ucf.edu
Janarthanan RajendranAssistant Professor, Faculty of Computer Science, Dalhousie UniversityVerified email at umich.edu
Patrick M. PilarskiProfessor, University of Alberta, Amii (Alberta Machine Intelligence Institute)Verified email at ualberta.ca
Iurii KemaevDeepMindVerified email at deepmind.com
Alex KearneyPhD Candidate, University of AlbertaVerified email at ualberta.ca
Jaden TravnikUniversity of Alberta, Sony AIVerified email at ualberta.ca
Shangtong ZhangUniversity of VirginiaVerified email at virginia.edu
Zeyu ZhengDeepMindVerified email at deepmind.com
Nenad TomasevGoogle DeepMindVerified email at google.com
Matthew LaiDeepMindVerified email at google.com

Vivek Veeriah

Google DeepMind

Verified email at google.com

Reinforcement learning MCTS Artificial Intelligence Language Models Planning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Differential recurrent neural networks for action recognition V Veeriah, N Zhuang, GJ Qi Proceedings of the IEEE international conference on computer vision, 4041-4049, 2015	593	2015
Discovery of useful questions as auxiliary tasks V Veeriah, M Hessel, Z Xu, J Rajendran, RL Lewis, J Oh, HP van Hasselt, ... Advances in Neural Information Processing Systems 32, 2019	94	2019
A self-tuning actor-critic algorithm T Zahavy, Z Xu, V Veeriah, M Hessel, J Oh, HP van Hasselt, D Silver, ... Advances in neural information processing systems 33, 20913-20924, 2020	83	2020
Many-goals reinforcement learning V Veeriah, J Oh, S Singh arXiv preprint arXiv:1806.09605, 2018	56	2018
Discovery of options via meta-learned subgoals V Veeriah, T Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, HP van Hasselt, ... Advances in Neural Information Processing Systems 34, 29861-29873, 2021	34	2021
Face valuing: Training user interfaces with facial expressions and reinforcement learning V Veeriah, PM Pilarski, RS Sutton arXiv preprint arXiv:1606.02807, 2016	28	2016
Robust hand gesture recognition algorithm for simple mouse control V Veeriah, PL Swaminathan International Journal of Computer and Communication Engineering 2 (2), 219, 2013	26	2013
Deep Learning Architecture with Dynamically Programmed Layers for Brain Connectome Prediction V Veeriah J, R Durvasula, GJ Qi ACM KDD 2015, 2015	21	2015
Tidbd: Adapting temporal-difference step-sizes through stochastic meta-descent A Kearney, V Veeriah, JB Travnik, RS Sutton, PM Pilarski arXiv preprint arXiv:1804.03334, 2018	17	2018
Diversifying ai: Towards creative chess with alphazero T Zahavy, V Veeriah, S Hou, K Waugh, M Lai, E Leurent, N Tomasev, ... arXiv preprint arXiv:2308.09175, 2023	15	2023
Reload: Reinforcement learning with optimistic ascent-descent for last-iterate convergence in constrained mdps T Moskovitz, B O’Donoghue, V Veeriah, S Flennerhag, S Singh, T Zahavy International Conference on Machine Learning, 25303-25336, 2023	13	2023
How Should an Agent Practice? J Rajendran, R Lewis, V Veeriah, H Lee, S Singh Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 5454-5461, 2020	11	2020
Learning feature relevance through step size adaptation in temporal-difference learning A Kearney, V Veeriah, J Travnik, PM Pilarski, RS Sutton arXiv preprint arXiv:1903.03252, 2019	11	2019
Forward actor-critic for nonlinear function approximation in reinforcement learning V Veeriah, H van Seijen, RS Sutton Proceedings of the 16th Conference on Autonomous Agents and MultiAgent …, 2017	11	2017
Crossprop: Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton Machine Learning and Knowledge Discovery in Databases: European Conference …, 2017	9	2017
Learning state representations from random deep action-conditional predictions Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh Advances in Neural Information Processing Systems 34, 23679-23691, 2021	6	2021
Grasp: Gradient-based affordance selection for planning V Veeriah, Z Zheng, R Lewis, S Singh arXiv preprint arXiv:2202.04772, 2022	4	2022
Learning options for action selection with meta-gradients in multi-task reinforcement learning VVJ Veeraiah, TBZ Zahavy, M Hessel, Z Xu, J Oh, I Kemaev, ... US Patent App. 17/918,365, 2023	1	2023
Discovery in Reinforcement Learning V Veeriah	1	2022
Learning representations by stochastic meta-gradient descent in neural networks V Veeriah, S Zhang, RS Sutton arXiv preprint arXiv:1612.02879, 2016	1	2016

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors