Qiwen Cui

引用先

	すべて	2019 年以来
引用	264	264
h 指標	10	10
i10 指標	11	11

100

202020212022202320242 25 55 90 92

オープンアクセス

すべて表示

7 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Simon Shaolei DuAssistant Professor, School of Computer Science and Engineering, University of Washington確認したメールアドレス: cs.washington.edu
Lin F. Yang (杨林)Assistant Professor, Department of Electrical and Computer Engineering @ UCLA確認したメールアドレス: ee.ucla.edu
Maryam FazelMoorthy Family Professor of Electrical and Computer Engineering, University of Washington確認したメールアドレス: uw.edu
Zaiwen WenPeking University確認したメールアドレス: pku.edu.cn
Ruoqi ShenUniversity of Washington確認したメールアドレス: cs.washington.edu
Kaiqing ZhangAssistant Professor, University of Maryland, College Park確認したメールアドレス: umd.edu

フォロー

Qiwen Cui

PhD, University of Washington

確認したメールアドレス: uw.edu - ホームページ

Machine Learning Theory


タイトル引用回数順公開年順タイトル順	引用先引用先	年
When are offline two-player zero-sum Markov games solvable? Q Cui, SS Du Advances in Neural Information Processing Systems 35, 25779-25791, 2022	47	2022
Randomized Exploration for Reinforcement Learning with General Value Function Approximation H Ishfaq, Q Cui, V Nguyen, A Ayoub, Z Yang, Z Wang, D Precup, LF Yang Thirty-eighth International Conference on Machine Learning, 2021	40	2021
Minimax sample complexity for turn-based stochastic game Q Cui, LF Yang Thirty-seventh Conference on Uncertainty in Artificial Intelligence, 2020	25	2020
Breaking the curse of multiagents in a large state space: Rl in markov games with independent linear function approximation Q Cui, K Zhang, S Du The Thirty Sixth Annual Conference on Learning Theory, 2651-2652, 2023	24	2023
Near-optimal randomized exploration for tabular markov decision processes Z Xiong, R Shen, Q Cui, M Fazel, SS Du Advances in Neural Information Processing Systems 35, 6358-6371, 2022	24*	2022
Clinical decision support model for tooth extraction therapy derived from electronic dental records Q Cui, Q Chen, P Liu, D Liu, Z Wen The Journal of Prosthetic Dentistry 126 (1), 83-90, 2021	24	2021
Provably efficient offline multi-agent reinforcement learning via strategy-wise bonus Q Cui, SS Du Advances in Neural Information Processing Systems 35, 11739-11751, 2022	22	2022
Learning in congestion games with bandit feedback Q Cui, Z Xiong, M Fazel, SS Du Advances in Neural Information Processing Systems 35, 11009-11022, 2022	14	2022
On gap-dependent bounds for offline reinforcement learning X Wang, Q Cui, SS Du Advances in Neural Information Processing Systems 35, 14865-14877, 2022	14	2022
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning? Q Cui, LF Yang Thirty-fourth Conference on Neural Information Processing Systems, 2020	13	2020
An efficient Fisher matrix approximation method for large-scale neural network optimization M Yang, D Xu, Q Cui, Z Wen, P Xu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (5), 5391-5403, 2022	10*	2022
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning Z Zhou, C Zhu, R Zhou, Q Cui, A Gupta, SS Du arXiv preprint arXiv:2310.19308, 2023	3	2023
Learning Optimal Tax Design in Nonatomic Congestion Games Q Cui, M Fazel, SS Du arXiv preprint arXiv:2402.07437, 2024	2	2024
Refined sample complexity for markov games with independent linear function approximation Y Dai, Q Cui, SS Du arXiv preprint arXiv:2402.07082, 2024	1	2024
Offline congestion games: How feedback type affects data coverage requirement H Jiang, Q Cui, Z Xiong, M Fazel, SS Du arXiv preprint arXiv:2210.13396, 2022	1	2022
-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model Y Zhang, L Chen, B Liu, Y Yang, Q Cui, Y Tao, H Yang arXiv preprint arXiv:2403.07191, 2024		2024
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning H Jiang, Q Cui, Z Xiong, M Fazel, SS Du arXiv preprint arXiv:2306.07465, 2023		2023

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–17

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者