CodeIt: Self-Improving Language Models with Prioritized Hindsight Replay N Butt, B Manczak, A Wiggers, C Rainone, D Zhang, M Defferrard, ... arXiv preprint arXiv:2402.04858, 2024 | 2 | 2024 |
Towards Self-Improving Language Models for Code Generation M Defferrard, C Rainone, DW Zhang, B Manczak, N Butt, T Cohen ICLR 2024 Workshop on Large Language Model (LLM) Agents, 0 | | |