Distributional hamilton-jacobi-bellman equations for continuous-time reinforcement learning HE Wiltzer, D Meger, MG Bellemare International Conference on Machine Learning, 23832-23856, 2022 | 8 | 2022 |
Policy optimization in a noisy neighborhood: On return landscapes in continuous control N Rahn, P D'Oro, H Wiltzer, PL Bacon, M Bellemare Advances in Neural Information Processing Systems 36, 2024 | 3 | 2024 |
A Distributional Analogue to the Successor Representation H Wiltzer, J Farebrother, A Gretton, Y Tang, A Barreto, W Dabney, ... arXiv preprint arXiv:2402.08530, 2024 | | 2024 |
A Distributional Analogue to the Successor Representation J Farebrother, H Wiltzer, A Gretton, Y Tang, A Barreto, W Dabney, ... | | 2023 |
On the Evolution of Return Distributions in Continuous-Time Reinforcement Learning H Wiltzer McGill University (Canada), 2021 | | 2021 |