StarCoder: may the source be with you! R Li, LB Allal, Y Zi, N Muennighoff, D Kocetkov, C Mou, M Marone, C Akiki, ... Transactions on Machine Learning Research (TMLR 2023), 2023 | 1081* | 2023 |
An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models N Meade, E Poole-Dayan, S Reddy Association for Computational Linguistics (ACL 2022), 2022 | 201 | 2022 |
Evaluating correctness and faithfulness of instruction-following models for question answering V Adlakha, P BehnamGhader, XH Lu, N Meade, S Reddy Transactions of the Association for Computational Linguistics (TACL 2024), 2023 | 128 | 2023 |
Evaluating the faithfulness of importance measures in nlp by recursively masking allegedly important tokens and retraining A Madsen, N Meade, V Adlakha, S Reddy Findings of Empirical Methods in Natural Language Processing (EMNLP 2022), 2022 | 41 | 2022 |
Using In-Context Learning to Improve Dialogue Safety N Meade, S Gella, D Hazarika, P Gupta, D Jin, S Reddy, Y Liu, ... Findings of Empirical Methods in Natural Language Processing (EMNLP 2023), 2023 | 34 | 2023 |
Exploring conditioning for generative music systems with human-interpretable controls N Meade, N Barreyre, SC Lowe, S Oore International Conference on Computational Creativity (ICCC 2019), 2019 | 23 | 2019 |
Universal adversarial triggers are not universal N Meade, A Patel, S Reddy arXiv preprint arXiv:2404.16020, 2024 | 7 | 2024 |
Exploiting Instruction-Following Retrievers for Malicious Information Retrieval P BehnamGhader, N Meade, S Reddy arXiv preprint arXiv:2503.08644, 2025 | | 2025 |
SafeArena: Evaluating the Safety of Autonomous Web Agents AD Tur, N Meade, XH Lù, A Zambrano, A Patel, E Durmus, S Gella, ... arXiv preprint arXiv:2503.04957, 2025 | | 2025 |
SafeArena: Evaluating the Safety of Autonomous Web Agents A Defne Tur, N Meade, XH Lù, A Zambrano, A Patel, E Durmus, S Gella, ... arXiv e-prints, arXiv: 2503.04957, 2025 | | 2025 |
Societal Alignment Frameworks Can Improve LLM Alignment K Stańczak, N Meade, M Bhatia, H Zhou, K Böttinger, J Barnes, J Stanley, ... arXiv preprint arXiv:2503.00069, 2025 | | 2025 |