フォロー
Yuexiang Zhai
Yuexiang Zhai
確認したメール アドレス: berkeley.edu - ホームページ
タイトル
引用先
引用先
Learning to Reconstruct 3D Manhattan Wireframes from a Single Image
Y Zhou, H Qi, Y Zhai, Q Sun, Z Chen, LY Wei, Y Ma
International Conference on Computer Vision (ICCV), 2019, 2019
632019
Complete dictionary learning via l4-norm maximization over the orthogonal group
Y Zhai, Z Yang, Z Liao, J Wright, Y Ma
Journal of Machine Learning Research 21 (165), 1-68, 2020
622020
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning
M Nakamoto, S Zhai, A Singh, M Sobol Mark, Y Ma, C Finn, A Kumar, ...
Advances in Neural Information Processing Systems 36, 2024
452024
Geometric Analysis of Nonconvex Optimization Landscapes for Overcomplete Learning
Q Qu, Y Zhai, X Li, Y Zhang, Z Zhu
International Conference on Learning Representations (ICLR), 2020, 2020
44*2020
Investigating the Catastrophic Forgetting in Multimodal Large Language Model Fine-Tuning
Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma
Conference on Parsimony and Learning, 202-227, 2024
39*2024
Unpacking reward shaping: Understanding the benefits of reward engineering on sample complexity
A Gupta, A Pacchiano, Y Zhai, S Kakade, S Levine
Advances in Neural Information Processing Systems 35, 15281-15295, 2022
292022
Eyes wide shut? exploring the visual shortcomings of multimodal llms
S Tong, Z Liu, Y Zhai, Y Ma, Y LeCun, S Xie
arXiv preprint arXiv:2401.06209, 2024
272024
Convolutional normalization: Improving deep convolutional network robustness and training
S Liu, X Li, Y Zhai, C You, Z Zhu, C Fernandez-Granda, Q Qu
Advances in neural information processing systems 34, 28919-28928, 2021
242021
Understanding l4-based Dictionary Learning: Interpretation, Stability, and Robustness
Y Zhai, H Mehta, Z Zhou, Y Ma
International Conference on Learning Representations (ICLR), 2020, 2020
202020
Computational Benefits of Intermediate Rewards for Goal-Reaching Policy Learning
Y Zhai, C Baek, Z Zhou, J Jiao, Y Ma
Journal of Artificial Intelligence Research 73, 847-896, 2022
162022
Understanding the complexity gains of single-task rl with a curriculum
Q Li, Y Zhai, Y Ma, S Levine
International Conference on Machine Learning, 20412-20451, 2023
122023
Lmrl gym: Benchmarks for multi-turn reinforcement learning with language models
M Abdulhai, I White, CV Snell, C Sun, J Hong, Y Zhai, K Xu, S Levine
62023
Closed-Loop Transcription via Convolutional Sparse Coding
X Dai, K Chen, S Tong, J Zhang, X Gao, M Li, D Pai, Y Zhai, XI Yuan, ...
arXiv preprint arXiv:2302.09347, 2023
32023
White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?
Y Yu, S Buchanan, D Pai, T Chu, Z Wu, S Tong, H Bai, Y Zhai, ...
arXiv preprint arXiv:2311.13110, 2023
12023
RLIF: Interactive Imitation Learning as Reinforcement Learning
J Luo, P Dong, Y Zhai, Y Ma, S Levine
arXiv preprint arXiv:2311.12996, 2023
12023
Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement
R Zhang, Y Zhai, A Zanette
arXiv preprint arXiv:2402.15703, 2024
2024
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–16