Yuexiang Zhai

引用先

	すべて	2019 年以来
引用	392	392
h 指標	11	11
i10 指標	11	11

160

120

2019202020212022202320248 24 57 52 149 101

オープンアクセス

すべて表示

4 件の論文

0 件の論文

利用可能

利用不可

助成機関の要件に基づく

共著者

Yi Ma (马毅)Professor of EECS, UC Berkeley; Director of IDS & Head of CS, University of Hong Kong確認したメールアドレス: eecs.berkeley.edu
Sergey LevineUC Berkeley, Physical Intelligence確認したメールアドレス: eecs.berkeley.edu
Qing QuAssistant Professor, Dept. of EECS, University of Michigan確認したメールアドレス: umich.edu
Shengbang TongNYU Courant確認したメールアドレス: berkeley.edu
Zhihui ZhuAssistant Professor, Ohio State University確認したメールアドレス: osu.edu
Zhengyuan ZhouDept of Technology, Operations and Statistics at NYU Stern確認したメールアドレス: stern.nyu.edu
John WrightElectrical Engineering, Columbia University確認したメールアドレス: columbia.edu
Xiao Li (李虓)Ph.D. candidate at University of Michigan確認したメールアドレス: umich.edu
Xiao LiThe Chinese University of Hong Kong, Shenzhen確認したメールアドレス: cuhk.edu.cn
Yuqian ZhangAssistant Professor, Rutgers University確認したメールアドレス: rutgers.edu
Yong Jae LeeAssociate Professor of Computer Sciences, UW-Madison確認したメールアドレス: wisc.edu
Mu CaiCS Ph.D. Student, University of Wisconsin-Madison確認したメールアドレス: cs.wisc.edu
Li-Yi WeiAdobe Research確認したメールアドレス: adobe.com
Haozhi QiUC Berkeley確認したメールアドレス: berkeley.edu
Yichao ZhouUC Berkeley確認したメールアドレス: berkeley.edu
Qi SunNew York University確認したメールアドレス: nyu.edu
Zhili ChenByteDance/TikTok確認したメールアドレス: bytedance.com
Zitong YangStanford University確認したメールアドレス: stanford.edu
Zhenyu LiaoApplied Scientist in Amazon Inc.確認したメールアドレス: amazon.com
Chelsea FinnStanford University, Google確認したメールアドレス: cs.stanford.edu

フォロー

Yuexiang Zhai

その他の名前Simon Zhai

UC Berkeley

確認したメールアドレス: berkeley.edu - ホームページ

Artificial Intelligence Machine Learning Reinforcement Learning


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Learning to Reconstruct 3D Manhattan Wireframes from a Single Image Y Zhou, H Qi, Y Zhai, Q Sun, Z Chen, LY Wei, Y Ma International Conference on Computer Vision (ICCV), 2019, 2019	63	2019
Complete dictionary learning via l4-norm maximization over the orthogonal group Y Zhai, Z Yang, Z Liao, J Wright, Y Ma Journal of Machine Learning Research 21 (165), 1-68, 2020	62	2020
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning M Nakamoto, S Zhai, A Singh, M Sobol Mark, Y Ma, C Finn, A Kumar, ... Advances in Neural Information Processing Systems 36, 2024	45	2024
Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu International Conference on Learning Representations (ICLR), 2020, 2020	44*	2020
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma Conference on Parsimony and Learning, 202-227, 2024	39*	2024
Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity A Gupta, A Pacchiano, Y Zhai, S Kakade, S Levine Advances in Neural Information Processing Systems 35, 15281-15295, 2022	29	2022
Eyes wide shut? exploring the visual shortcomings of multimodal llms S Tong, Z Liu, Y Zhai, Y Ma, Y LeCun, S Xie arXiv preprint arXiv:2401.06209, 2024	27	2024
Convolutional normalization: Improving deep convolutional network robustness and training S Liu, X Li, Y Zhai, C You, Z Zhu, C Fernandez-Granda, Q Qu Advances in neural information processing systems 34, 28919-28928, 2021	24	2021
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness Y Zhai, H Mehta, Z Zhou, Y Ma International Conference on Learning Representations (ICLR), 2020, 2020	20	2020
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning Y Zhai, C Baek, Z Zhou, J Jiao, Y Ma Journal of Artificial Intelligence Research 73, 847-896, 2022	16	2022
Understanding the complexity gains of single-task rl with a curriculum Q Li, Y Zhai, Y Ma, S Levine International Conference on Machine Learning, 20412-20451, 2023	12	2023
Lmrl gym: Benchmarks for multi-turn reinforcement learning with language models M Abdulhai, I White, CV Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine	6	2023
Closed-Loop Transcription via Convolutional Sparse Coding X Dai, K Chen, S Tong, J Zhang, X Gao, M Li, D Pai, Y Zhai, XI Yuan, ... arXiv preprint arXiv:2302.09347, 2023	3	2023
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is? Y Yu, S Buchanan, D Pai, T Chu, Z Wu, S Tong, H Bai, Y Zhai, ... arXiv preprint arXiv:2311.13110, 2023	1	2023
RLIF: Interactive Imitation Learning as Reinforcement Learning J Luo, P Dong, Y Zhai, Y Ma, S Levine arXiv preprint arXiv:2311.12996, 2023	1	2023
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement R Zhang, Y Zhai, A Zanette arXiv preprint arXiv:2402.15703, 2024		2024

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–16

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者