Alborz Geramifard

引用先

	すべて	2019 年以来
引用	1925	1086
h 指標	22	16
i10 指標	36	25

260

130

195

20072008200920102011201220132014201520162017201820192020202120222023202412 12 18 14 46 89 85 100 108 92 104 138 138 157 229 240 253 67

共著者

Jonathan P. HowRichard C. Maclaurin Professor of Aerospace Engineering, Massachusetts Institute of Technology確認したメールアドレス: mit.edu
Nicholas RoyMIT確認したメールアドレス: csail.mit.edu
Satwik KotturResearch Scientist, Facebook AI確認したメールアドレス: fb.com
Seungwhan MoonFacebook, Carnegie Mellon University確認したメールアドレス: fb.com
Ahmad BeiramiGoogle Research確認したメールアドレス: google.com
Michael BowlingUniversity of Alberta確認したメールアドレス: ualberta.ca
Paul A CrookResearch Scientist, Meta Platforms, Inc.確認したメールアドレス: fb.com
Nazim Kemal UreIstanbul Technical University確認したメールアドレス: itu.edu.tr
Richard S. SuttonKeen, Amii, and University of Alberta確認したメールアドレス: richsutton.com
Rajen SubbaGoogle確認したメールアドレス: google.com
Girish ChowdharyAssociate Professor確認したメールアドレス: illinois.edu
Chinnadhurai SankarResearch Lead, SliceX AI | ex-Meta AI確認したメールアドレス: fb.com
Ankita DeFacebook確認したメールアドレス: fb.com
Thomas J. WalshSony AI確認したメールアドレス: sony.com
Csaba SzepesvariDeepMind & University of Alberta確認したメールアドレス: cs.ualberta.ca
Babak DamavandiMeta Reality Labs確認したメールアドレス: fb.com
David WhitneyMeta確認したメールアドレス: meta.com
Christoph DannResearch Scientist, Google確認したメールアドレス: google.com
Stefanie TellexBrown University確認したメールアドレス: cs.brown.edu
Will DabneyDeepMind確認したメールアドレス: google.com

フォロー

Alborz Geramifard

Research Scientist Director at Meta

確認したメールアドレス: meta.com - ホームページ

Reinforcement Learning Conversational AI Planning Brain and Cognitive Sciences


タイトル引用回数順公開年順タイトル順	引用先引用先	年
Dyna-style planning with linear function approximation and prioritized sweeping RS Sutton, C Szepesvári, A Geramifard, MP Bowling arXiv preprint arXiv:1206.3285, 2012	227	2012
A tutorial on linear function approximators for dynamic programming and reinforcement learning A Geramifard, TJ Walsh, S Tellex, G Chowdhary, N Roy, JP How Foundations and Trends® in Machine Learning 6 (4), 375-451, 2013	161	2013
Decentralized control of partially observable Markov decision processes C Amato, G Chowdhary, A Geramifard, NK Üre, MJ Kochenderfer 52nd IEEE Conference on Decision and Control, 2398-2405, 2013	147	2013
Cooperative mission planning for multi-UAV teams SS Ponda, LB Johnson, A Geramifard, JP How Handbook of unmanned aerial vehicles 2, 1447-1490, 2015	100	2015
Incremental least-squares temporal difference learning A Geramifard, M Bowling, RS Sutton Proceedings of the 21st national conference on Artificial intelligence …, 2006	91	2006
RLPy: a value-function-based reinforcement learning framework for education and research. A Geramifard, C Dann, RH Klein, W Dabney, JP How J. Mach. Learn. Res. 16 (1), 1573-1578, 2015	86	2015
Online Discovery of Feature Dependencies. A Geramifard, F Doshi, J Redding, N Roy, JP How ICML, 881-888, 2011	81	2011
SIMMC 2.0: A task-oriented dialog dataset for immersive multimodal conversations S Kottur, S Moon, A Geramifard, B Damavandi arXiv preprint arXiv:2104.08667, 2021	75	2021
Situated and interactive multimodal conversations S Moon, S Kottur, PA Crook, A De, S Poddar, T Levin, D Whitney, ... arXiv preprint arXiv:2006.01460, 2020	75	2020
Overview of the ninth dialog system technology challenge: Dstc9 C Gunasekara, S Kim, LF D'Haro, A Rastogi, YN Chen, M Eric, ... arXiv preprint arXiv:2011.06486, 2020	69	2020
iLSTD: Eligibility traces and convergence analysis A Geramifard, M Bowling, M Zinkevich, RS Sutton Advances in Neural Information Processing Systems 19, 2006	62	2006
On the design and use of a micro air vehicle to track and avoid adversaries R He, A Bachrach, M Achtelik, A Geramifard, D Gurdan, S Prentice, ... The International Journal of Robotics Research 29 (5), 529-546, 2010	54	2010
Intelligent cooperative control architecture: a framework for performance improvement using safe learning A Geramifard, J Redding, JP How Journal of Intelligent & Robotic Systems 72, 83-103, 2013	52	2013
Customized movie trailers A Geramifard US Patent App. 14/105,428, 2015	50	2015
Reinforcement learning with misspecified model classes J Joseph, A Geramifard, JW Roberts, JP How, N Roy 2013 IEEE International Conference on Robotics and Automation, 939-946, 2013	46	2013
UAV cooperative control with stochastic risk models A Geramifard, J Redding, N Roy, JP How Proceedings of the 2011 american control conference, 3393-3398, 2011	46	2011
Biased cost pathfinding A Geramifard, P Chubak, V Bulitko Proceedings of the AAAI Conference on Artificial Intelligence and …, 2006	41	2006
An intelligent cooperative control architecture J Redding, A Geramifard, A Undurti, HL Choi, JP How Proceedings of the 2010 American control conference, 57-62, 2010	37	2010
Adaptive planning for Markov decision processes with uncertain transition models via incremental feature dependency discovery NK Ure, A Geramifard, G Chowdhary, JP How Machine Learning and Knowledge Discovery in Databases: European Conference …, 2012	32	2012
Annotation inconsistency and entity bias in MultiWOZ K Qian, A Beirami, Z Lin, A De, A Geramifard, Z Yu, C Sankar arXiv preprint arXiv:2105.14150, 2021	30	2021

現在システムで処理を実行できません。しばらくしてからもう一度お試しください。

論文 1–20

年間引用数

重複した引用

結合された引用

共著者を追加共著者

フォロー

引用先

共著者