Jean Harb

Cited by

	All	Since 2019
Citations	6698	6200
h-index	9	9
i10-index	9	9

1700

850

425

1275

2017201820192020202120222023202478 264 535 800 996 1299 1671 898

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Doina PrecupDeepMind and McGill UniversityVerified email at cs.mcgill.ca
Pierre-Luc BaconUniversity of MontrealVerified email at mila.quebec
Pieter AbbeelUC Berkeley | CovariantVerified email at cs.berkeley.edu
Yi WuInstitute for Interdisciplinary Information Sciences, Tsinghua UniversityVerified email at mail.tsinghua.edu.cn
Ryan LoweOpenAIVerified email at openai.com
Igor MordatchGoogle DeepMindVerified email at google.com
Aviv TamarTechnionVerified email at technion.ac.il

Jean Harb

OpenAI

Verified email at openai.com

Machine Learning Reinforcement Learning Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-agent actor-critic for mixed cooperative-competitive environments R Lowe, YI Wu, A Tamar, J Harb, OAI Pieter Abbeel, I Mordatch Advances in neural information processing systems 30, 2017	4943	2017
The option-critic architecture PL Bacon, J Harb, D Precup Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017	1243	2017
Investigating recurrence and eligibility traces in deep Q-networks J Harb, D Precup arXiv preprint arXiv:1704.05495, 2017	187	2017
When waiting is not an option: Learning options with a deliberation cost J Harb, PL Bacon, M Klissarov, D Precup Proceedings of the AAAI Conference on Artificial Intelligence 32 (1), 2018	162	2018
Learnings options end-to-end for continuous action tasks M Klissarov, PL Bacon, J Harb, D Precup arXiv preprint arXiv:1712.00004, 2017	62	2017
Policy evaluation networks J Harb, T Schaul, D Precup, PL Bacon arXiv preprint arXiv:2002.11833, 2020	38	2020
Waymax: An accelerated, data-driven simulator for large-scale autonomous driving research C Gulino, J Fu, W Luo, G Tucker, E Bronstein, Y Lu, J Harb, X Pan, ... Advances in Neural Information Processing Systems 36, 2024	37	2024
The barbados 2018 list of open issues in continual learning T Schaul, H van Hasselt, J Modayil, M White, A White, PL Bacon, J Harb, ... arXiv preprint arXiv:1811.07004, 2018	15	2018
General policy evaluation and improvement by learning to identify few but crucial states F Faccio, A Ramesh, V Herrmann, J Harb, J Schmidhuber arXiv preprint arXiv:2207.01566, 2022	10	2022
Learning options in deep reinforcement learning J Merheb-Harb McGill University (Canada), 2016	1	2016
Asynchronous Advantage Option-Critic with Deliberation Cost J Harb, PL Bacon, D Precup RLDM, 2017		2017

The system can't perform the operation now. Try again later.

Articles 1–11

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors