Edward Hu
Edward Hu
Verified email at - Homepage
Cited by
Cited by
LoRA: Low-Rank Adaptation of Large Language Models
EJ Hu, Y Shen, P Wallis, Z Allen-Zhu, Y Li, S Wang, L Wang, W Chen
International Conference on Learning Representations, 2022
Tensor programs iv: Feature learning in infinite-width neural networks
G Yang, EJ Hu
International Conference on Machine Learning, 11727-11737, 2021
Randomized smoothing of all shapes and sizes
G Yang, T Duan, EJ Hu, H Salman, I Razenshteyn, J Li
International Conference on Machine Learning, 10693-10705, 2020
Collecting Diverse Natural Language Inference Problems for Sentence Representation Evaluation
A Poliak, A Haldar, R Rudinger, EJ Hu, E Pavlick, AS White, B Van Durme
Proceedings of the 2018 Conference on Empirical Methods in Natural Language …, 2018
Improved Lexically Constrained Decoding for Translation and Monolingual Rewriting
EJ Hu, H Khayrallah, R Culkin, P Xia, T Chen, M Post, B Van Durme
Proceedings of the 2019 Conference of the North American Chapter of the …, 2019
Tuning large neural networks via zero-shot hyperparameter transfer
G Yang, EJ Hu, I Babuschkin, S Sidor, X Liu, D Farhi, N Ryder, J Pachocki, ...
Advances in Neural Information Processing Systems 34, 17084-17097, 2021
Gflownet foundations
Y Bengio, S Lahlou, T Deleu, EJ Hu, M Tiwari, E Bengio
Journal of Machine Learning Research 24 (210), 1-55, 2023
ParaBank: Monolingual bitext generation and sentential paraphrasing via lexically-constrained neural machine translation
EJ Hu, R Rudinger, M Post, B Van Durme
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 6521-6528, 2019
Large-scale, Diverse, Paraphrastic Bitexts via Sampling and Clustering
EJ Hu, A Singh, N Holzenberger, M Post, B Van Durme
Proceedings of the 23rd Conference on Computational Natural Language …, 2019
GFlowNets and variational inference
N Malkin, S Lahlou, T Deleu, X Ji, EJ Hu, K Everett, D Zhang, Y Bengio
arXiv preprint arXiv:2210.00580, 2022
GFlowNet-EM for learning compositional latent variable models
EJ Hu, N Malkin, M Jain, KE Everett, A Graikos, Y Bengio
International Conference on Machine Learning, 13528-13549, 2023
Improved Image Wasserstein Attacks and Defenses
EJ Hu, A Swaminathan, H Salman, G Yang
arXiv preprint arXiv:2004.12478, 2020
Efficient computation of deep nonlinear infinite-width neural networks that learn features
G Yang, M Santacroce, EJ Hu
International Conference on Learning Representations, 2022
Iterative paraphrastic augmentation with discriminative span alignment
R Culkin, EJ Hu, E Stengel-Eskin, G Qin, BV Durme
Transactions of the Association for Computational Linguistics 9, 494-509, 2021
Amortizing intractable inference in large language models
EJ Hu, M Jain, E Elmoznino, Y Kaddar, G Lajoie, Y Bengio, N Malkin
arXiv preprint arXiv:2310.04363, 2023
Differentiable Tree Operations Promote Compositional Generalization
P Soulos, EJ Hu, K McCurdy, Y Chen, R Fernandez, P Smolensky, J Gao
International Conference on Machine Learning, 32499-32520, 2023
NIST TAC SM-KBP 2019 System Description: JHU/UR Framework.
Y Chen, S Ebner, T Chen, P Xia, E Stengel-Eskin, TR Su, EJ Hu, ...
TAC, 2019
GFlowNets for Causal Discovery: an Overview
DC Manta, EJ Hu, Y Bengio
ICML 2023 Workshop on Structured Probabilistic Inference {\&} Generative …, 2023
The system can't perform the operation now. Try again later.
Articles 1–18