フォロー
Alborz Geramifard
Alborz Geramifard
Research Scientist Director at Meta
確認したメール アドレス: meta.com - ホームページ
タイトル
引用先
引用先
Dyna-style planning with linear function approximation and prioritized sweeping
RS Sutton, C Szepesvári, A Geramifard, MP Bowling
arXiv preprint arXiv:1206.3285, 2012
2352012
A tutorial on linear function approximators for dynamic programming and reinforcement learning
A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How
Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013
1642013
Decentralized control of partially observable Markov decision processes
C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer
52nd IEEE Conference on Decision and Control, 2398-2405, 2013
1542013
Cooperative mission planning for multi-UAV teams
SS Ponda, LB Johnson, A Geramifard, JP How
Handbook of unmanned aerial vehicles 2, 1447-1490, 2015
992015
Incremental least-squares temporal difference learning
A Geramifard, M Bowling, RS Sutton
Proceedings of the 21st national conference on Artificial intelligence …, 2006
942006
RLPy: a value-function-based reinforcement learning framework for education and research.
A Geramifard, C Dann, RH Klein, W Dabney, JP How
J. Mach. Learn. Res. 16 (1), 1573-1578, 2015
922015
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations
S Kottur, S Moon, A Geramifard, B Damavandi
arXiv preprint arXiv:2104.08667, 2021
912021
Situated and interactive multimodal conversations
S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ...
arXiv preprint arXiv:2006.01460, 2020
842020
Online Discovery of Feature Dependencies.
A Geramifard, F Doshi, J Redding, N Roy, JP How
ICML, 881-888, 2011
822011
Overview of the ninth dialog system technology challenge: Dstc9
C Gunasekara, S Kim, LF D'Haro, A Rastogi, YN Chen, M Eric, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
712024
iLSTD: Eligibility traces and convergence analysis
A Geramifard, M Bowling, M Zinkevich, RS Sutton
Advances in Neural Information Processing Systems 19, 2006
662006
Intelligent cooperative control architecture: a framework for performance improvement using safe learning
A Geramifard, J Redding, JP How
Journal of Intelligent & Robotic Systems 72, 83-103, 2013
572013
On the design and use of a micro air vehicle to track and avoid adversaries
R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ...
The International Journal of Robotics Research 29 (5), 529-546, 2010
572010
Customized movie trailers
A Geramifard
US Patent App. 14/105,428, 2015
512015
Reinforcement learning with misspecified model classes
J Joseph, A Geramifard, JW Roberts, JP How, N Roy
2013 IEEE International Conference on Robotics and Automation, 939-946, 2013
472013
UAV cooperative control with stochastic risk models
A Geramifard, J Redding, N Roy, JP How
Proceedings of the 2011 american control conference, 3393-3398, 2011
452011
Biased cost pathfinding
A Geramifard, P Chubak, V Bulitko
Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006
452006
An intelligent cooperative control architecture
J Redding, A Geramifard, A Undurti, HL Choi, JP How
Proceedings of the 2010 American control conference, 57-62, 2010
372010
Annotation inconsistency and entity bias in MultiWOZ
K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar
arXiv preprint arXiv:2105.14150, 2021
342021
Memformer: A memory-augmented transformer for sequence modeling
Q Wu, Z Lan, K Qian, J Gu, A Geramifard, Z Yu
arXiv preprint arXiv:2010.06891, 2020
332020
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20