High-confidence off-policy evaluation P Thomas, G Theocharous, M Ghavamzadeh Proceedings of the AAAI Conference on Artificial Intelligence 29 (1), 2015 | 327 | 2015 |
High confidence policy improvement P Thomas, G Theocharous, M Ghavamzadeh International Conference on Machine Learning, 2380-2388, 2015 | 224 | 2015 |
Ad recommendation systems for life-time value optimization G Theocharous, PS Thomas, M Ghavamzadeh Proceedings of the 24th international conference on world wide web, 1305-1310, 2015 | 205 | 2015 |
Learning action representations for reinforcement learning Y Chandak, G Theocharous, J Kostas, S Jordan, P Thomas International conference on machine learning, 941-950, 2019 | 195 | 2019 |
Lifetime value marketing using reinforcement learning G Theocharous, A Hallak RLDM 2013, 19, 2013 | 156 | 2013 |
Optimizing production manufacturing using reinforcement learning. S Mahadevan, G Theocharous FLAIRS, 372-377, 1998 | 122 | 1998 |
Approximate planning in POMDPs with macro-actions G Theocharous, L Kaelbling Advances in neural information processing systems 16, 2003 | 118 | 2003 |
Method and apparatus for user-activity-based dynamic power management and policy creation for mobile platforms GN Theocharous, NN Shah, UK Sengupta, WN Schilit, KC Silvester, ... US Patent 7,861,099, 2010 | 109 | 2010 |
Learning hierarchical observable markov decision process models for robot navigation G Theocharous, K Rohanimanesh, S Maharlevan Proceedings 2001 ICRA. IEEE International Conference on Robotics and …, 2001 | 98 | 2001 |
Representing hierarchical POMDPs as DBNs for multi-scale robot localization G Theocharous, K Murphy, LP Kaelbling IEEE International Conference on Robotics and Automation, 2004. Proceedings …, 2004 | 91 | 2004 |
Approximate planning with hierarchical partially observable Markov decision process models for robot navigation G Theocharous, S Mahadevan Proceedings 2002 IEEE International Conference on Robotics and Automation …, 2002 | 79 | 2002 |
Optimizing for the future in non-stationary mdps Y Chandak, G Theocharous, S Shankar, M White, S Mahadevan, ... International Conference on Machine Learning, 1414-1425, 2020 | 74 | 2020 |
Predictive off-policy policy evaluation for nonstationary decision problems, with applications to digital marketing P Thomas, G Theocharous, M Ghavamzadeh, I Durugkar, E Brunskill Proceedings of the AAAI Conference on Artificial Intelligence 31 (2), 4740-4745, 2017 | 68 | 2017 |
Risk Quantification for Policy Deployment PS Thomas, G Theocharous, M Ghavamzadeh US Patent App. 14/552,047, 2016 | 58 | 2016 |
Hierarchical learning and planning in partially observable Markov decision processes GN Theocharous Michigan State University, 2002 | 52 | 2002 |
Method and apparatus for user-activity-based dynamic power management and policy creation for mobile platforms GN Theocharous, NN Shah, UK Sengupta, WN Schilit US Patent 7,861,098, 2010 | 47 | 2010 |
An expert system for assigning patients into clinical trials based on Bayesian networks C Papaconstantinou, G Theocharous, S Mahadevan Journal of Medical Systems 22 (3), 189-202, 1998 | 46 | 1998 |
Rapid concept learning for mobile robots S Mahadevan, G Theocharous, N Khaleeli Autonomous robots 5, 239-251, 1998 | 43 | 1998 |
Kernel-based reinforcement learning on representative states B Kveton, G Theocharous Proceedings of the AAAI Conference on Artificial Intelligence 26 (1), 977-983, 2012 | 35 | 2012 |
Tractable POMDP planning algorithms for optimal teaching in “SPAIS” G Theocharous, R Beckwith, N Butko, M Philipose IJCAI PAIR Workshop, 11-17, 2009 | 33 | 2009 |