Follow
Zachary Kenton
Zachary Kenton
Google DeepMind
Verified email at google.com - Homepage
Title
Cited by
Cited by
Year
Ethical and social risks of harm from language models
L Weidinger, J Mellor, M Rauh, C Griffin, J Uesato, PS Huang, M Cheng, ...
arXiv preprint arXiv:2112.04359, 2021
5662021
Three factors influencing minima in sgd
S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey
arXiv preprint arXiv:1711.04623, 2017
4572017
Taxonomy of risks posed by language models
L Weidinger, J Uesato, M Rauh, C Griffin, PS Huang, J Mellor, A Glaese, ...
Proceedings of the 2022 ACM Conference on Fairness, Accountability, and …, 2022
2932022
Alignment of language agents
Z Kenton, T Everitt, L Weidinger, I Gabriel, V Mikulik, G Irving
arXiv preprint arXiv:2103.14659, 2021
1092021
A systematic comparison of bayesian deep learning robustness in diabetic retinopathy tasks
A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ...
arXiv preprint arXiv:1912.10481, 2019
1082019
On the relation between the sharpest directions of DNN loss and the SGD step length
S Jastrzębski, Z Kenton, N Ballas, A Fischer, Y Bengio, A Storkey
arXiv preprint arXiv:1807.05031, 2018
1052018
Specification gaming: the flip side of AI ingenuity
V Krakovna, J Uesato, V Mikulik, M Rahtz, T Everitt, R Kumar, Z Kenton, ...
DeepMind Blog 3, 2020
872020
Imitating interactive intelligence
J Abramson, A Ahuja, I Barr, A Brussee, F Carnevale, M Cassin, ...
arXiv preprint arXiv:2012.05672, 2020
712020
Goal misgeneralization: why correct specifications aren't enough for correct goals
R Shah, V Varma, R Kumar, M Phuong, V Krakovna, J Uesato, Z Kenton
arXiv preprint arXiv:2210.01790, 2022
452022
The squeezed limit of the bispectrum in multi-field inflation
Z Kenton, DJ Mulryne
Journal of Cosmology and Astroparticle Physics 2015 (10), 018, 2015
402015
D-brane potentials in the warped resolved conifold and natural inflation
Z Kenton, S Thomas
Journal of High Energy Physics 2015 (2), 1-42, 2015
362015
Finding flatter minima with sgd
S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey
332018
Width of minima reached by stochastic gradient descent is influenced by learning rate to batch size ratio
S Jastrzębski, Z Kenton, D Arpit, N Ballas, A Fischer, Y Bengio, A Storkey
Artificial Neural Networks and Machine Learning–ICANN 2018: 27th …, 2018
282018
Generalizing from a few environments in safety-critical reinforcement learning
Z Kenton, A Filos, Y Gal, O Evans
Safe Machine Learning workshop at ICLR, 2019
24*2019
The separate universe approach to soft limits
Z Kenton, DJ Mulryne
Journal of Cosmology and Astroparticle Physics 2016 (10), 035, 2016
232016
Benchmarking Bayesian deep learning with diabetic retinopathy diagnosis
A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ...
Preprint at https://arxiv. org/abs/1912.10481, 2019
212019
Generating the cosmic microwave background power asymmetry with
Z Kenton, DJ Mulryne, S Thomas
Physical Review D 92 (2), 023505, 2015
202015
Predicting Human Deliberative Judgments with Machine Learning
O Evans, A Stuhlmüller, C Cundy, R Carey, Z Kenton, T McGrath, ...
https://zackenton.github.io/files/predicting_judgments_final.pdf, 2018
152018
Explaining grokking through circuit efficiency
V Varma, R Shah, Z Kenton, J Kramár, R Kumar
arXiv preprint arXiv:2309.02390, 2023
142023
Discovering agents
Z Kenton, R Kumar, S Farquhar, J Richens, M MacDermott, T Everitt
Artificial Intelligence 322, 103963, 2023
142023
The system can't perform the operation now. Try again later.
Articles 1–20