追蹤
Daan Wout
Daan Wout
在 student.tudelft.nl 的電子郵件地址已通過驗證
標題
引用次數
引用次數
年份
Deep reinforcement learning with feedback-based exploration
J Scholten, D Wout, C Celemin, J Kober
2019 IEEE 58th Conference on Decision and Control (CDC), 803-808, 2019
52019
Learning Gaussian policies from corrective human feedback
D Wout, J Scholten, C Celemin, J Kober
arXiv preprint arXiv:1903.05216, 2019
32019
Policy Learning with Human Teachers
D Wout
2019
系統目前無法執行作業,請稍後再試。
文章 1–3