フォロー
Siddhant M. Jayakumar
Siddhant M. Jayakumar
Finster AI
確認したメール アドレス: finster.ai - ホームページ
タイトル
引用先
引用先
Scaling language models: Methods, analysis & insights from training gopher
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
arXiv preprint arXiv:2112.11446, 2021
7762021
Compressive transformers for long-range sequence modelling
JW Rae, A Potapenko, SM Jayakumar, TP Lillicrap
arXiv preprint arXiv:1911.05507, 2019
4632019
Stabilizing transformers for reinforcement learning
E Parisotto, F Song, J Rae, R Pascanu, C Gulcehre, S Jayakumar, ...
International conference on machine learning, 7487-7498, 2020
3472020
Adapting auxiliary losses using gradient similarity
Y Du, WM Czarnecki, SM Jayakumar, M Farajtabar, R Pascanu, ...
arXiv preprint arXiv:1812.02224, 2018
1452018
Distilling policy distillation
WM Czarnecki, R Pascanu, S Osindero, S Jayakumar, G Swirszcz, ...
The 22nd international conference on artificial intelligence and statistics …, 2019
1312019
Multiplicative interactions and where to find them
SM Jayakumar, WM Czarnecki, J Menick, J Schwarz, J Rae, S Osindero, ...
1212020
Memory-based parameter adaptation
P Sprechmann, SM Jayakumar, JW Rae, A Pritzel, AP Badia, B Uria, ...
arXiv preprint arXiv:1802.10542, 2018
1072018
Information asymmetry in KL-regularized RL
A Galashov, SM Jayakumar, L Hasenclever, D Tirumala, J Schwarz, ...
arXiv preprint arXiv:1905.01240, 2019
1022019
Been there, done that: Meta-learning with episodic recall
S Ritter, J Wang, Z Kurth-Nelson, S Jayakumar, C Blundell, R Pascanu, ...
International conference on machine learning, 4354-4363, 2018
982018
Mix & match agent curricula for reinforcement learning
W Czarnecki, S Jayakumar, M Jaderberg, L Hasenclever, YW Teh, ...
International Conference on Machine Learning, 1087-1095, 2018
922018
Top-kast: Top-k always sparse training
S Jayakumar, R Pascanu, J Rae, S Osindero, E Elsen
Advances in Neural Information Processing Systems 33, 20744-20754, 2020
852020
Cyprien de Masson d’Autume
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, F Song, J Aslanides, ...
842021
Meta-learning of sequential strategies
PA Ortega, JX Wang, M Rowland, T Genewein, Z Kurth-Nelson, ...
arXiv preprint arXiv:1905.03030, 2019
832019
Powerpropagation: A sparsity inducing weight reparameterisation
J Schwarz, S Jayakumar, R Pascanu, PE Latham, Y Teh
Advances in neural information processing systems 34, 28889-28903, 2021
512021
Cyprien de Masson d’Autume, Yujia Li, Tayfun Terzi, Vladimir Mikulik, Igor Babuschkin, Aidan Clark, Diego de Las Casas, Aurelia Guy, Chris Jones, James Bradbury, Matthew J
JW Rae, S Borgeaud, T Cai, K Millican, J Hoffmann, HF Song, J Aslanides, ...
Johnson, Blake A. Hechtman, Laura Weidinger, Iason Gabriel, William S. Isaac …, 2021
482021
Reinforcement learning using agent curricula
W Czarnecki, S Jayakumar
US Patent 11,113,605, 2021
82021
Low-pass recurrent neural networks-a memory architecture for longer-term correlation discovery
T Stepleton, R Pascanu, W Dabney, SM Jayakumar, H Soyer, R Munos
arXiv preprint arXiv:1805.04955, 2018
52018
Machine learning systems with memory based parameter adaptation for learning fast and slower
P Sprechmann, S Jayakumar, JW Rae, A Pritzel, AP Badia, O Vinyals, ...
US Patent App. 16/759,561, 2020
42020
Gated attention neural networks
E Parisotto, H Song, JW Rae, SM Jayakumar, ME Jaderberg, R Pascanu, ...
US Patent App. 17/763,984, 2022
22022
Perception-prediction-reaction agents for deep reinforcement learning
A Stooke, V Dalibard, SM Jayakumar, WM Czarnecki, M Jaderberg
arXiv preprint arXiv:2006.15223, 2020
22020
現在システムで処理を実行できません。しばらくしてからもう一度お試しください。
論文 1–20