Asaf Cassel

Cited by

	All	Since 2020
Citations	280	274
h-index	6	6
i10-index	5	5

201820192020202120222023202420251 3 20 39 67 55 65 28

Public access

View all

2 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Alon CohenTel-Aviv University and GoogleVerified email at google.com
Tomer KorenAssociate Professor at Tel Aviv UniversityVerified email at tauex.tau.ac.il
Shie MannorProfessor of Electrical Engineering @ Technion & Researcher @ NvidiaVerified email at technion.ac.il
assaf zeeviColumbia universityVerified email at gsb.columbia.edu
Aviv RosenbergGoogle ResearchVerified email at google.com
Orin LevyTel Aviv UniversityVerified email at mail.tau.ac.il
Yishay MansourTel Aviv UniversityVerified email at tauex.tau.ac.il
Lior ShaniGoogle ResearchVerified email at google.com
Rémi MunosFAIR, MetaVerified email at inria.fr
Haipeng LuoAssociate Professor, University of Southern CaliforniaVerified email at usc.edu
Guy TennenholtzResearch Scientist, Google ResearchVerified email at google.com

Asaf Cassel

School of Computer Science, Tel Aviv University

Verified email at mail.tau.ac.il

Machine Learning Optimization Reinforcement Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
A General Approach to Multi-Armed Bandits Under Risk Criteria A Cassel, S Mannor, A Zeevi Proceedings of the 31st Conference On Learning Theory 75, 1295--1306, 2018	103	2018
Logarithmic Regret for Learning Linear Quadratic Regulators Efficiently A Cassel, A Cohen, T Koren Proceedings of the 37th International Conference on Machine Learning 119 …, 2020	82	2020
Bandit linear control A Cassel, T Koren Advances in Neural Information Processing Systems 33, 8872-8882, 2020	24	2020
Multi-turn reinforcement learning from preference human feedback L Shani, A Rosenberg, A Cassel, O Lang, D Calandriello, A Zipori, ... arXiv preprint arXiv:2405.14655, 2024	17	2024
Online Policy Gradient for Model Free Learning of Linear Quadratic Regulators with $\sqrt $ T Regret AB Cassel, T Koren International Conference on Machine Learning, 1304-1313, 2021	17	2021
A general framework for bandit problems beyond cumulative objectives A Cassel, S Mannor, A Zeevi Mathematics of Operations Research 48 (4), 2196-2232, 2023	6	2023
Efficient rate optimal regret for adversarial contextual mdps using online function approximation O Levy, A Cohen, A Cassel, Y Mansour International Conference on Machine Learning, 19287-19314, 2023	6	2023
Rate-optimal online convex optimization in adaptive linear control AB Cassel, A Peled-Cohen, T Koren Advances in Neural Information Processing Systems 35, 7410-7422, 2022	6	2022
Efficient online linear control with stochastic convex costs and unknown dynamics AB Cassel, A Cohen, T Koren Conference on Learning Theory, 3589-3604, 2022	6	2022
Near-optimal regret in linear mdps with aggregate bandit feedback A Cassel, H Luo, A Rosenberg, D Sotnikov arXiv preprint arXiv:2405.07637, 2024	4	2024
Eluder-based regret for stochastic contextual mdps O Levy, A Cassel, A Cohen, Y Mansour arXiv preprint arXiv:2211.14932, 2022	4	2022
A General Framework for Bandit Problems Beyond Cumulative Objectives A Cassel, S Mannor, A Zeevi arXiv preprint arXiv:1806.01380, 2018	3	2018
Batch ensemble for variance dependent regret in stochastic bandits A Cassel, O Levy, Y Mansour Proceedings of the AAAI Conference on Artificial Intelligence 39 (15), 15678 …, 2025	1	2025
The Pendulum Arrangement: Maximizing the Escape Time of Heterogeneous Random Walks A Cassel, S Mannor, G Tennenholtz arXiv preprint arXiv:2007.13232, 2020	1	2020
Warm-up Free Policy Optimization: Improved Regret in Linear Markov Decision Processes A Cassel, A Rosenberg arXiv preprint arXiv:2407.03065, 2024		2024
Counterfactual Optimism: Rate Optimal Regret for Stochastic Contextual MDPs. O Levy, AB Cassel, A Cohen, Y Mansour CoRR, 2022		2022

The system can't perform the operation now. Try again later.

Articles 1–16

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors