Follow
Huang Jiawei
Title
Cited by
Cited by
Year
Minimax weight and q-function learning for off-policy evaluation
M Uehara, J Huang, N Jiang
International Conference on Machine Learning, 9659-9668, 2019
892019
Weightnet: Revisiting the design space of weight networks
N Ma, X Zhang, J Huang, J Sun
European Conference on Computer Vision, 776-792, 2020
302020
Minimax value interval for off-policy evaluation and policy optimization
N Jiang, J Huang
Advances in Neural Information Processing Systems 33, 2747-2758, 2020
302020
From Importance Sampling to Doubly Robust Policy Gradient
J Huang, N Jiang
International Conference on Machine Learning, 4434-4443, 2019
132019
On the Convergence Rate of Off-Policy Policy Optimization Methods with Density-Ratio Correction
J Huang, N Jiang
arXiv preprint arXiv:2106.00993, 2021
22021
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality
J Huang, J Chen, L Zhao, T Qin, N Jiang, TY Liu
International Conference on Learning Representations, 2021
12021
The system can't perform the operation now. Try again later.
Articles 1–6