Follow
Yaqi Duan
Yaqi Duan
Princeton University
Verified email at mit.edu - Homepage
Title
Cited by
Cited by
Year
Minimax-optimal off-policy evaluation with linear function approximation
Y Duan, M Wang
International Conference on Machine Learning, 2701-2709, 2020
692020
State aggregation learning from Markov transition data
Y Duan, T Ke, M Wang
Advances in Neural Information Processing Systems, 4486-4495, 2019
342019
Risk bounds and Rademacher complexity in batch reinforcement learning
Y Duan, C Jin, Z Li
International Conference on Machine Learning, 2892-2902, 2021
162021
Sparse feature selection makes batch reinforcement learning more sample efficient
B Hao, Y Duan, T Lattimore, C Szepesvári, M Wang
International Conference on Machine Learning, 4063-4073, 2021
142021
Bootstrapping fitted Q-evaluation for off-policy inference
B Hao, X Ji, Y Duan, H Lu, C Szepesvari, M Wang
International Conference on Machine Learning, 4074-4084, 2021
11*2021
Optimal policy evaluation using kernel-based temporal difference methods
Y Duan, M Wang, MJ Wainwright
arXiv preprint arXiv:2109.12002, 2021
92021
Near-optimal offline reinforcement learning with linear representation: leveraging variance information with pessimism
M Yin, Y Duan, M Wang, YX Wang
International Conference on Learning Representations, 2022
82022
Learning low-dimensional state embeddings and metastable clusters from time series data
Y Sun, Y Duan, H Gong, M Wang
Advances in Neural Information Processing Systems, 4561-4570, 2019
82019
Adaptive low-nonnegative-rank approximation for state aggregation of Markov chains
Y Duan, M Wang, Z Wen, Y Yuan
SIAM Journal on Matrix Analysis and Applications 41 (1), 244-278, 2020
72020
Learning good state and action representations via tensor decomposition
C Ni, A Zhang, Y Duan, M Wang
2021 IEEE International Symposium on Information Theory (ISIT), 1682-1687, 2021
42021
Adaptive and robust multi-task learning
Y Duan, K Wang
arXiv preprint arXiv:2202.05250, 2022
22022
The system can't perform the operation now. Try again later.
Articles 1–11