A survey of zero-shot generalisation in deep reinforcement learning R Kirk, A Zhang, E Grefenstette, T Rocktäschel Journal of Artificial Intelligence Research 76, 201-264, 2023 | 276* | 2023 |
Minihack the planet: A sandbox for open-ended reinforcement learning research M Samvelyan, R Kirk, V Kurin, J Parker-Holder, M Jiang, E Hambro, ... arXiv preprint arXiv:2109.13202, 2021 | 70 | 2021 |
Understanding the effects of rlhf on llm generalisation and diversity R Kirk, I Mediratta, C Nalmpantis, J Luketina, E Hambro, E Grefenstette, ... arXiv preprint arXiv:2310.06452, 2023 | 22 | 2023 |
Reward model ensembles help mitigate overoptimization T Coste, U Anwar, R Kirk, D Krueger arXiv preprint arXiv:2310.02743, 2023 | 20 | 2023 |
Insights from the neurips 2021 nethack challenge E Hambro, S Mohanty, D Babaev, M Byeon, D Chakraborty, ... NeurIPS 2021 Competitions and Demonstrations Track, 41-52, 2022 | 17 | 2022 |
Mechanistically analyzing the effects of fine-tuning on procedurally defined tasks S Jain, R Kirk, ES Lubana, RP Dick, H Tanaka, E Grefenstette, ... arXiv preprint arXiv:2311.12786, 2023 | 15 | 2023 |
Generalization to new sequential decision making tasks with in-context learning SC Raparthy, E Hambro, R Kirk, M Henaff, R Raileanu arXiv preprint arXiv:2312.03801, 2023 | 4 | 2023 |
Graph backup: Data efficient backup exploiting markovian transitions Z Jiang, T Zhang, R Kirk, T Rocktäschel, E Grefenstette arXiv preprint arXiv:2205.15824, 2022 | 4* | 2022 |
A study of off-policy learning in environments with procedural content generation A Ehrenberg, R Kirk, M Jiang, E Grefenstette, T Rocktäschel ICLR Workshop on Agent Learning in Open-Endedness, 2022 | 4 | 2022 |
Leading the Pack: N-player Opponent Shaping A Souly, T Willi, A Khan, R Kirk, C Lu, E Grefenstette, T Rocktäschel arXiv preprint arXiv:2312.12564, 2023 | 1 | 2023 |
Domain Generalization for Robust Model-Based Offline Reinforcement Learning A Clark, SA Siddiqui, R Kirk, U Anwar, S Chung, D Krueger arXiv preprint arXiv:2211.14827, 2022 | 1 | 2022 |
What Mechanisms Does Knowledge Distillation Distill? C Wu, ES Lubana, BK Mlodozeniec, R Kirk, D Krueger UniReps: the First Workshop on Unifying Representations in Neural Models, 2023 | | 2023 |