Offline Meta Reinforcement Learning--Identifiability Challenges and Effective Data Collection Strategies R Dorfman, I Shenfeld, A Tamar Advances in Neural Information Processing Systems 34, 4607-4618, 2021 | 96* | 2021 |
Curiosity-driven red-teaming for large language models ZW Hong, I Shenfeld, TH Wang, YS Chuang, A Pareja, J Glass, ... arXiv preprint arXiv:2402.19464, 2024 | 21 | 2024 |
TGRL: An Algorithm for Teacher Guided Reinforcement Learning I Shenfeld, ZW Hong, A Tamar, P Agrawal ICML 2023, 2023 | 12* | 2023 |
Value Augmented Sampling for Language Model Alignment and Personalization I Shenfeld, S Han, A Srivastava, Y Kim, P Agrawal ICLR 2024 Workshop on Reliable and Responsible Foundation Models (Oral), 2024 | 3* | 2024 |
JUICER: Data-Efficient Imitation Learning for Robotic Assembly L Ankile, A Simeonov, I Shenfeld, P Agrawal arXiv preprint arXiv:2404.03729, 2024 | 1 | 2024 |
The Future of Open Human Feedback S Don-Yehiya, B Burtenshaw, RF Astudillo, C Osborne, M Jaiswal, ... arXiv preprint arXiv:2408.16961, 2024 | | 2024 |
From Imitation to Refinement--Residual RL for Precise Visual Assembly L Ankile, A Simeonov, I Shenfeld, M Torne, P Agrawal arXiv preprint arXiv:2407.16677, 2024 | | 2024 |