PAC-Bayesian lifelong learning for multi-armed bandits H Flynn, D Reeb, M Kandemir, J Peters Data Mining and Knowledge Discovery 36 (2), 841-876, 2022 | 13 | 2022 |
Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures H Flynn, D Reeb, M Kandemir, JR Peters Advances in Neural Information Processing Systems 36, 2024 | 6 | 2024 |
PAC-Bayes Bounds for Bandit Problems: A Survey and Experimental Comparison H Flynn, D Reeb, M Kandemir, J Peters IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 5 | 2023 |
Device, computer program and computer-implemented method for machine learning H Flynn, D Reeb, J Peters, M Kandemir US Patent App. 17/858,980, 2023 | 1 | 2023 |
Tighter Confidence Bounds for Sequential Kernel Regression H Flynn, D Reeb arXiv preprint arXiv:2403.12732, 2024 | | 2024 |
Method and device for reinforcement learning H Flynn, J Peters, M Kandemir US Patent App. 18/046,564, 2023 | | 2023 |
PAC-Bayesian Bandit Algorithms With Guarantees H Flynn Technische Universität Darmstadt, 2023 | | 2023 |
Device for and computer implemented method of machine learning H Flynn, J Peters, M Kandemir US Patent App. 17/445,428, 2022 | | 2022 |