When will AI exceed human performance? Evidence from AI experts
K Grace, J Salvatier, A Dafoe, B Zhang, O Evans
arXiv preprint arXiv:1705.08807, 2017
Help or hinder: Bayesian models of social goal inference
T Ullman, C Baker, O Macindoe, O Evans, N Goodman, JB Tenenbaum
Advances in neural information processing systems, 1874-1882, 2009
The malicious use of artificial intelligence: Forecasting, prevention, and mitigation
M Brundage, S Avin, J Clark, H Toner, P Eckersley, B Garfinkel, A Dafoe, ...
arXiv preprint arXiv:1802.07228, 2018
Learning the Preferences of Ignorant, Inconsistent Agents
O Evans, A Stuhlmüller, ND Goodman
Proceedings of the 30th AAAI Conference on Artificial Intelligence (AAAI-2016), 2016
Trial without error: Towards safe reinforcement learning via human intervention
W Saunders, G Sastry, A Stuhlmueller, O Evans
Proceedings of the 17th International Conference on Autonomous Agents and …, 2018
Learning the Preferences of Bounded Agents
O Evans, A Stuhlmüller, ND Goodman
Advances in Neural Information Processing Systems (Bounded Optimality Workshop), 2015
Learning structured preferences
O Evans, L Bergen, JB Tenenbaum
Proceedings of the 32nd annual conference of the cognitive science society, 2010
Agent-Agnostic Human-in-the-Loop Reinforcement Learning
D Abel, J Salvatier, A Stuhlmüller, O Evans
arXiv:1701.0407, 2017
Active Reinforcement Learning: Observing Rewards at a Cost
D Krueger, J Leike, O Evans, J Salvatier
NIPS 2016 Workshop, 2016
Modeling Agents with Probabilistic Programs
O Evans, A Stuhlmüller, J Salvatier, D Filan
agentmodels.org, 2017
Predicting Human Deliberative Judgments with Machine Learning
O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ...
FHI Technical Report, 2018
Active Reinforcement Learning with Monte-Carlo Tree Search
S Schulze, O Evans
arXiv preprint arXiv:1803.04926, 2018
