Jia Yuan Yu

Cited by

	All	Since 2019
Citations	1411	952
h-index	18	16
i10-index	34	25

180

135

200920102011201220132014201520162017201820192020202120222023202415 17 16 25 33 48 52 60 78 106 142 176 177 155 174 127

Public access

View all

16 articles

6 articles

available

not available

Based on funding mandates

Jia Yuan Yu

Amazon

Verified email at amazon.com


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Markov decision processes with arbitrary reward processes JY Yu, S Mannor, N Shimkin Mathematics of Operations Research 34 (3), 737-757, 2009	133	2009
Online Learning with Sample Path Constraints. S Mannor, JN Tsitsiklis, JY Yu Journal of Machine Learning Research 10 (3), 2009	129	2009
Piecewise-stationary bandit problems with side observations JY Yu, S Mannor Proceedings of the 26th annual international conference on machine learning …, 2009	116	2009
Unimodal Bandits. JY Yu, S Mannor ICML, 41-48, 2011	107	2011
A reinforcement learning technique for optimizing downlink scheduling in an energy-limited vehicular network RF Atallah, CM Assi, JY Yu IEEE Transactions on Vehicular Technology 66 (6), 4592-4601, 2016	98	2016
Lipschitz bandits without the lipschitz constant S Bubeck, G Stoltz, JY Yu Algorithmic Learning Theory: 22nd International Conference, ALT 2011, Espoo …, 2011	91	2011
Online learning in Markov decision processes with arbitrarily changing rewards and transitions JY Yu, S Mannor 2009 international conference on game theory for networks, 314-322, 2009	54	2009
On the design of campus parking systems with QoS guarantees W Griggs, JY Yu, F Wirth, F Häusler, R Shorten IEEE Transactions on Intelligent Transportation Systems 17 (5), 1428-1437, 2015	52	2015
Sample Complexity of Risk-Averse Bandit-Arm Selection. JY Yu, E Nikolova IJCAI, 2576-2582, 2013	48	2013
Arbitrarily modulated Markov decision processes JY Yu, S Mannor Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held …, 2009	45	2009
Distributed parking space detection, characterization, advertisement, and enforcement RL Cogill, O Gallay, C Lee, Z Nabi, M Rufli, R Shorten, T Tchrakian, ... US Patent 9,601,018, 2017	37	2017
Data-driven distributionally robust polynomial optimization M Mevissen, E Ragnoli, JY Yu Advances in Neural Information Processing Systems 26, 2013	32	2013
Reward modeling for mitigating toxicity in transformer-based language models F Faal, K Schmitt, JY Yu Applied Intelligence 53 (7), 8421-8435, 2023	25	2023
Reinforcement mechanism design for electric vehicle demand response in microgrid charging stations L Hou, S Ma, J Yan, C Wang, JY Yu 2020 International Joint Conference on Neural Networks (IJCNN), 1-8, 2020	22	2020
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and network utility maximization F Wirth, S Stuedli, JY Yu, M Corless, R Shorten arXiv preprint arXiv:1404.5064, 2014	22	2014
Mean field equilibria of multi armed bandit games R Gummadi, R Johari, JY Yu 2012 50th Annual Allerton Conference on Communication, Control, and …, 2012	21	2012
Mean field analysis of multi-armed bandit games R Gummadi, R Johari, S Schmit, JY Yu Available at SSRN 2045842, 2013	20	2013
Online Learning with Expert Advice and Finite-Horizon Constraints. B Kveton, JY Yu, G Theocharous, S Mannor AAAI, 331-336, 2008	18	2008
A price-based iterative double auction for charger sharing markets J Gao, T Wong, C Wang, JY Yu IEEE Transactions on Intelligent Transportation Systems 23 (6), 5116-5127, 2021	17	2021
Nonhomogeneous place-dependent Markov chains, unsynchronised AIMD, and optimisation FR Wirth, S Stüdli, JY Yu, M Corless, R Shorten Journal of the ACM (JACM) 66 (4), 1-37, 2019	17	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by