Yao Ma
Yao Ma
Verified email at bu.edu
Title
Cited by
Cited by
Year
Theoretical comparisons of positive-unlabeled learning against positive-negative learning
G Niu, MC du Plessis, T Sakai, Y Ma, M Sugiyama
Advances in neural information processing systems 29, 1199-1207, 2016
612016
Hybrid constraint SVR for facial age estimation
J Liu, Y Ma, L Duan, F Wang, Y Liu
Signal Processing 94, 576-582, 2014
392014
A policy search method for temporal logic specified reinforcement learning tasks
X Li, Y Ma, C Belta
2018 Annual American Control Conference (ACC), 240-245, 2018
272018
Gradient Descent for Sparse Rank-One Matrix Completion for Crowd-Sourced Aggregation of Sparsely Interacting Workers
Y Ma, A Olshevsky, C Szepesvári, V Saligrama
ICML 2018, 2018
102018
Double layer multiple task learning for age estimation with insufficient training samples
Y Ma, J Liu, X Yang, Y Liu, N Zheng
Neurocomputing 147, 380-386, 2015
102015
Facial age estimation from web photos using multiple-instance learning
X Yang, J Liu, Y Ma, J Xue
2014 IEEE international conference on multimedia and expo (ICME), 1-6, 2014
92014
Bandit-based task assignment for heterogeneous crowdsourcing
H Zhang, Y Ma, M Sugiyama
Neural computation 27 (11), 2447-2475, 2015
82015
Automata guided reinforcement learning with demonstrations
X Li, Y Ma, C Belta
arXiv preprint arXiv:1809.06305, 2018
62018
Online Markov decision processes with policy iteration
Y Ma, H Zhang, M Sugiyama
arXiv preprint arXiv:1510.04454, 2015
52015
Automata guided hierarchical reinforcement learning for zero-shot skill composition
X Li, Y Ma, C Belta
42018
Crowdsourcing with sparsely interacting workers
Y Ma, A Olshevsky, V Saligrama, C Szepesvari
arXiv preprint arXiv:1706.06660, 2017
42017
An Online Policy Gradient Algorithm for Markov Decision Processes with Continuous States and Actions
Y Ma, T Zhao, K Hatano, M Sugiyama
ECML PKDD 2014, 2014
22014
Automata-Guided Hierarchical Reinforcement Learning for Skill Composition
X Li, Y Ma, C Belta
arXiv preprint arXiv:1711.00129, 2017
12017
Automata Guided Skill Composition
X Li, Y Ma, C Belta
2018
AUTOMATA GUIDED HIERARCHICAL REINFORCE-MENT LEARNING FOR ZERO-SHOT SKILL COMPOSI
X Li, Y Ma, C Belta
arXiv preprint arXiv:1711.00129, 2017
2017
Online decision making in non-stationary Markovian environments
Y Ma, Y Ma
東京工業大学, 2015
2015
Online Markov decision processes with policy iteration (情報論的学習理論と機械学習 情報論的学習理論ワークショップ (IBIS2015))
Y MA, H ZHANG, M SUGIYAMA
電子情報通信学会技術研究報告= IEICE technical report: 信学技報 115 (323 …, 2015
2015
An Online Policy Gradient Algorithm for Continuous State and Action Markov Decision Processes with Bandit Feedback (情報論的学習理論と機械学習 情報論的学習理論ワークショップ)
Y MA, M SUGIYAMA
電子情報通信学会技術研究報告= IEICE technical report: 信学技報 114 (306 …, 2014
2014
The system can't perform the operation now. Try again later.
Articles 1–18