Huang Jiawei

Cited by

	All	Since 2019
Citations	447	447
h-index	7	7
i10-index	7	7

160

120

2019202020212022202320247 30 77 118 151 64

Public access

View all

4 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Nan JiangAssistant Professor of Computer Science, UIUCVerified email at illinois.edu
Masatoshi UeharaGenentechVerified email at gene.com
Ningning MaNIOVerified email at ust.hk
Xiangyu ZhangPrincipal Researcher, MEGVII TechnologyVerified email at megvii.com
Jian SunChief Scientist of Megvii, Managing Director of Megvii ResearchVerified email at megvii.com
Chengchun ShiLondon School of Economics and Political ScienceVerified email at lse.ac.uk
Li ZhaoResearcherVerified email at microsoft.com
Tao QinSenior Principal Research Manager, Microsoft ResearchVerified email at microsoft.com
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA FellowVerified email at microsoft.com
Jinglin ChenUniversity of Illinois Urbana-ChampaignVerified email at illinois.edu
Niao HeETH ZürichVerified email at inf.ethz.ch
Batuhan YardimETH ZurichVerified email at ethz.ch
Wei Chen （陈卫）Microsoft ResearchVerified email at microsoft.com

Huang Jiawei

ETH Zurich

Verified email at inf.ethz.ch - Homepage

Machine Learning Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Minimax weight and q-function learning for off-policy evaluation M Uehara, J Huang, N Jiang International Conference on Machine Learning, 9659-9668, 2019	178	2019
Weightnet: Revisiting the design space of weight networks N Ma, X Zhang, J Huang, J Sun European Conference on Computer Vision, 776-792, 2020	99	2020
Minimax value interval for off-policy evaluation and policy optimization N Jiang, J Huang Advances in Neural Information Processing Systems 33, 2747-2758, 2020	75	2020
A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes C Shi, M Uehara, J Huang, N Jiang International Conference on Machine Learning, 2022	31*	2022
From Importance Sampling to Doubly Robust Policy Gradient J Huang, N Jiang International Conference on Machine Learning, 4434-4443, 2019	26	2019
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality J Huang, J Chen, L Zhao, T Qin, N Jiang, TY Liu International Conference on Learning Representations 2022, 2022	24	2022
On the convergence rate of off-policy policy optimization methods with density-ratio correction J Huang, N Jiang International Conference on Artificial Intelligence and Statistics, 2658-2705, 2022	10*	2022
On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation J Huang, B Yardim, N He International Conference on Artificial Intelligence and Statistics, 289-297, 2024	2	2024
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret J Huang, L Zhao, T Qin, W Chen, N Jiang, TY Liu Advances in Neural Information Processing Systems 35, 2022	2	2022
Robust Knowledge Transfer in Tiered Reinforcement Learning J Huang, N He Advances in Neural Information Processing Systems 36, 2024		2024
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL J Huang, N He, A Krause arXiv preprint arXiv:2402.05724, 2024		2024

The system can't perform the operation now. Try again later.

Articles 1–11

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors