フォロー
Byung-Jun Lee
Byung-Jun Lee
確認したメール アドレス: kaist.ac.kr - ホームページ
タイトル
引用先
引用先
Optidice: Offline policy optimization via stationary distribution correction estimation
J Lee, W Jeon, B Lee, J Pineau, KE Kim
International Conference on Machine Learning, 6120-6130, 2021
752021
Representation balancing offline model-based reinforcement learning
BJ Lee, J Lee, KE Kim
International Conference on Learning Representations, 2020
482020
Winning the l2rpn challenge: Power grid management via semi-markov afterstate actor-critic
D Yoon, S Hong, BJ Lee, KE Kim
International Conference on Learning Representations, 2020
412020
Optimizing generative dialog state tracker via cascading gradient descent
BJ Lee, W Lim, D Kim, KE Kim
Proceedings of the 15th Annual Meeting of the Special Interest Group on …, 2014
212014
Hierarchically-partitioned Gaussian process approximation
BJ Lee, J Lee, KE Kim
Artificial Intelligence and Statistics, 822-831, 2017
182017
Neural dialog state tracker for large ontologies by attention mechanism
Y Jang, J Ham, BJ Lee, Y Chang, KE Kim
2016 IEEE spoken language technology workshop (SLT), 531-537, 2016
172016
Batch reinforcement learning with hyperparameter gradients
B Lee, J Lee, P Vrancx, D Kim, KE Kim
International Conference on Machine Learning, 5725-5735, 2020
162020
Reinforcement learning for control with multiple frequencies
J Lee, BJ Lee, KE Kim
Advances in Neural Information Processing Systems 33, 3254-3264, 2020
162020
Dialog history construction with long-short term memory for robust generative dialog state tracking
BJ Lee, KE Kim
Dialogue & Discourse 7 (3), 47-64, 2016
122016
Cross-language neural dialog state tracker for large ontologies using hierarchical attention
Y Jang, J Ham, BJ Lee, KE Kim
IEEE/ACM Transactions on Audio, Speech, and Language Processing 26 (11 …, 2018
112018
Residual neural processes
BJ Lee, S Hong, KE Kim
Proceedings of the AAAI Conference on Artificial Intelligence 34 (04), 4545-4552, 2020
72020
Local metric learning for off-policy evaluation in contextual bandits with continuous actions
H Lee, J Lee, Y Choi, W Jeon, BJ Lee, YK Noh, KE Kim
Advances in Neural Information Processing Systems 35, 3913-3925, 2022
32022
MARS: Multiagent Reinforcement Learning for Spatial–Spectral and Temporal Feature Selection in EEG-Based BCI
DH Shin, YH Son, JM Kim, HJ Ahn, JH Seo, CH Ji, JW Han, BJ Lee, ...
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2024
12024
Adaptive Online Time-Series Prediction for Virtual Metrology in Semiconductor Manufacturing
S Zabrocki, PS Jo, C Park, D Yim, S Yun, BJ Lee
2023 34th Annual SEMI Advanced Semiconductor Manufacturing Conference (ASMC …, 2023
12023
Relaxed Stationary Distribution Correction Estimation for Improved Offline Policy Optimization
W Kim, D Ki, BJ Lee
Proceedings of the AAAI Conference on Artificial Intelligence 38 (12), 13185 …, 2024
2024
Offline Imitation Learning by Controlling the Effective Planning Horizon
HJ Ahn, SW Shim, BJ Lee
arXiv preprint arXiv:2401.09728, 2024
2024
Quantifying Information of Tokens for Simple and Flexible Simultaneous Machine Translation
D Lee, M Park, BJ Lee
Proceedings of the 27th Conference on Computational Natural Language …, 2023
2023
Improving Neural Machine Translation with Offline Evaluations
MK Park, BJ Lee
Proceedings of the 13th International Joint Conference on Natural Language …, 2023
2023
Learning variable-length skills through Novelty-based Decision Point Identification
M Kim, H Lee, JH Seo, SW Shim, BJ Lee
2023
Offline Reinforcement Learning via Weighted -divergence
W Kim, D Ki, BJ Lee
2022
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20