Follow
Aidan Gomez
Aidan Gomez
Cohere
Verified email at cohere.ai - Homepage
Title
Cited by
Cited by
Year
Attention is all you need
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Advances in neural information processing systems 30, 2017
1119192017
Advances in neural information processing systems
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Attention is All you Need, 2017
7712017
Tensor2tensor for neural machine translation
A Vaswani, S Bengio, E Brevdo, F Chollet, AN Gomez, S Gouws, L Jones, ...
arXiv preprint arXiv:1803.07416, 2018
5992018
The reversible residual network: Backpropagation without storing activations
AN Gomez, M Ren, R Urtasun, RB Grosse
Advances in neural information processing systems 30, 2017
5232017
One model to learn them all
L Kaiser, AN Gomez, N Shazeer, A Vaswani, N Parmar, L Jones, ...
arXiv preprint arXiv:1706.05137, 2017
3782017
Disease variant prediction with deep generative models of evolutionary data
J Frazer, P Notin, M Dias, A Gomez, JK Min, K Brock, Y Gal, DS Marks
Nature 599 (7883), 91-95, 2021
3522021
Depthwise Separable Convolutions for Neural Machine Translation
L Kaiser, AN Gomez, F Chollet
International Conference on Learning Representations, 2018
3362018
A systematic comparison of Bayesian deep learning robustness in diabetic retinopathy tasks
A Filos, S Farquhar, AN Gomez, TGJ Rudner, Z Kenton, L Smith, ...
arXiv preprint arXiv:1912.10481, 2019
123*2019
Learning Sparse Networks Using Targeted Dropout
AN Gomez, I Zhang, S Rao Kamalakara, D Madaan, K Swersky, Y Gal, ...
arXiv preprint arXiv:1905.13678, 2019
1132019
Tranception: protein fitness prediction with autoregressive transformers and inference-time retrieval
P Notin, M Dias, J Frazer, JM Hurtado, AN Gomez, D Marks, Y Gal
International Conference on Machine Learning, 16990-17017, 2022
1012022
Attention is all you need. 2017. doi: 10.48550
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint ARXIV.1706.03762, 0
87
The difficulty of training sparse neural networks
U Evci, F Pedregosa, A Gomez, E Elsen
arXiv preprint arXiv:1906.10732, 2019
852019
Unsupervised cipher cracking using discrete GANs
AN Gomez, S Huang, I Zhang, BM Li, M Osama, L Kaiser
arXiv preprint arXiv:1801.04883, 2018
812018
31st Conference on Neural Information Processing Systems (NIPS 2017)
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
Long Beach, CA, 2017
802017
Attention is all you need. arXiv, 2017. doi: 10.48550
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez, ...
arXiv preprint arXiv.1706.03762 1706, 0
79
Self-attention between datapoints: Going beyond individual input-output pairs in deep learning
J Kossen, N Band, C Lyle, AN Gomez, T Rainforth, Y Gal
Advances in Neural Information Processing Systems 34, 28742-28756, 2021
752021
Prioritized training on points that are learnable, worth learning, and not yet learnt
S Mindermann, JM Brauner, MT Razzak, M Sharma, A Kirsch, W Xu, ...
International Conference on Machine Learning, 15630-15649, 2022
632022
undefinedukasz Kaiser, and Illia Polosukhin. 2017
A Vaswani, N Shazeer, N Parmar, J Uszkoreit, L Jones, AN Gomez
Attention is All You Need (NIPS’17). Curran Associates Inc., Red Hook, NY …, 2017
512017
Wat zei je? detecting out-of-distribution translations with variational transformers
TZ Xiao, AN Gomez, Y Gal
arXiv preprint arXiv:2006.08344, 2020
30*2020
Attention-based sequence transduction neural networks
NM Shazeer, AN Gomez, LM Kaiser, JD Uszkoreit, LO Jones, NJ Parmar, ...
US Patent 10,452,978, 2019
292019
The system can't perform the operation now. Try again later.
Articles 1–20