Follow
Miguel Suau
Title
Cited by
Cited by
Year
Influence-aware Memory Architectures for Deep Reinforcement Learning in POMDPs
M Suau, J He, E Congeduti, RAN Starre, A Czechowski, FA Oliehoek
Neural Computing and Applications 2022, 2019
17*2019
Diseño y construcción de una mini–turbina eólica
M Suau de Castro
Universidad Politécnica de Madrid, 2014
122014
Influence-Augmented Online Planning for Complex Environments
J He, M Suau, F Oliehoek
NeurIPS 2020, 2020
102020
Offline Contextual Bandits for Wireless Network Optimization
M Suau, A Agapitos, D Lynch, D Farrell, M Zhou, A Milenovic
NeurIPS 2021, Offline RL workshop, 2021
62021
Influence-Augmented Local Simulators: a Scalable Solution for Fast Deep RL in Large Networked Systems
M Suau, J He, MTJ Spaan, FA Oliehoek
ICML 2022, 2022
52022
Bad habits: Policy confounding and out-of-trajectory generalization in RL
M Suau, MTJ Spaan, FA Oliehoek
RLC 2024, 2023
42023
Distributed Influence-Augmented Local Simulators for Parallel MARL in Large Networked Systems
M Suau, J He, MM Çelikok, MTJ Spaan, FA Oliehoek
NeurIPS 2022, 2022
32022
Speeding up deep reinforcement learning through influence-augmented local simulators
M Suau, J He, MTJ Spaan, FA Oliehoek
AAMAS 2022, 2022
32022
Online Planning in POMDPs with Self-Improving Simulators
J He, M Suau, H Baier, M Kaisers, FA Oliehoek
IJCAI 2022, 2022
22022
Using Bisimulation Metrics to Analyze and Evaluate Latent State Representations
N Albers, M Suau, FA Oliehoek
BNAIC 2021, 2021
22021
Influence-Based Abstraction in Deep Reinforcement Learning
M Suau, E Congeduti, RAN Starre, AT Czechowski, FA Oliehoek
AAMAS Workshop on Adaptive Learning Agents (ALA), 2019
22019
Leveraging Factored State Representations for Enhanced Efficiency in Reinforcement Learning
M Suau
12024
Learning What to Attend to: Using Bisimulation Metrics to Explore and Improve Upon What a Deep Reinforcement Learning Agent Learns⋆
N Albers, M Suau, FA Oliehoek
TU Delft, 2020
12020
AboutInfluence'
FA Oliehoek, E Congeduti, A Czechowski, J He, A Mey, RAN Starre, ...
2022
Bad Habits: Policy Confounding and Out-of-Trajectory Generalization in Reinforcement Learning
M Suau, MTJ Spaan, FA Oliehoek
The system can't perform the operation now. Try again later.
Articles 1–15