フォロー
Rémy Degenne
Rémy Degenne
Inria Lille
確認したメール アドレス: inria.fr - ホームページ
タイトル
引用先
引用先
Non-asymptotic pure exploration by solving games
R Degenne, WM Koolen, P Ménard
Advances in Neural Information Processing Systems 32, 2019
872019
Gamification of pure exploration for linear bandits
R Degenne, P Ménard, X Shang, M Valko
International Conference on Machine Learning, 2432-2442, 2020
862020
Pure exploration with multiple correct answers
R Degenne, WM Koolen
Advances in Neural Information Processing Systems 32, 2019
712019
Anytime optimal algorithms in stochastic multi-armed bandits
R Degenne, V Perchet
International Conference on Machine Learning, 1587-1595, 2016
652016
Combinatorial semi-bandit with known covariance
R Degenne, V Perchet
Advances in Neural Information Processing Systems 29, 2016
552016
Top two algorithms revisited
M Jourdan, R Degenne, D Baudry, R de Heide, E Kaufmann
Advances in Neural Information Processing Systems 35, 26791-26803, 2022
352022
Structure adaptive algorithms for stochastic bandits
R Degenne, H Shao, W Koolen
International Conference on Machine Learning, 2443-2452, 2020
352020
Bridging the gap between regret minimization and best arm identification, with application to a/b tests
R Degenne, T Nedelec, C Calauzènes, V Perchet
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
212019
On the existence of a complexity in fixed budget bandit identification
R Degenne
The Thirty Sixth Annual Conference on Learning Theory, 1131-1154, 2023
162023
Dealing with unknown variances in best-arm identification
M Jourdan, D Rémy, K Emilie
International Conference on Algorithmic Learning Theory, 776-849, 2023
142023
Dealing with misspecification in fixed-confidence linear top-m identification
C Réda, A Tirinzoni, R Degenne
Advances in Neural Information Processing Systems 34, 25489-25501, 2021
112021
Bandits with side observations: Bounded vs. logarithmic regret
R Degenne, E Garcelon, V Perchet
arXiv preprint arXiv:1807.03558, 2018
112018
Non-asymptotic analysis of a ucb-based top two algorithm
M Jourdan, R Degenne
Advances in Neural Information Processing Systems 36, 2024
72024
Accelerated gradient temporal difference learning algorithms
D Meyer, R Degenne, A Omrane, H Shen
2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement …, 2014
72014
An -Best-Arm Identification Algorithm for Fixed-Confidence and Beyond
M Jourdan, R Degenne, E Kaufmann
Advances in Neural Information Processing Systems 36, 16578-16649, 2023
62023
On elimination strategies for bandit fixed-confidence identification
A Tirinzoni, R Degenne
Advances in Neural Information Processing Systems 35, 18586-18598, 2022
62022
Impact of structure on the design and analysis of bandit algorithms
R Degenne
Université Paris Cité, 2019
52019
Online sign identification: Minimization of the number of errors in thresholding bandits
R Ouhamma, OA Maillard, V Perchet
Advances in Neural Information Processing Systems 34, 18577-18589, 2021
32021
Choosing Answers in Epsilon-Best-Answer Identification for Linear Bandits
M Jourdan, R Degenne
International Conference on Machine Learning, 10384-10430, 2022
22022
A Formalization of Doob’s Martingale Convergence Theorems in mathlib
K Ying, R Degenne
Proceedings of the 12th ACM SIGPLAN International Conference on Certified …, 2023
12023
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20