R. Srikant – SILO

SILO: Finite‑Time Bounds for Robust Reinforcement Learning with Linear Function Approximation

Abstract Robust reinforcement learning (RL) focuses on designing optimal policies from data for MDPs with model uncertainties. Existing convergence guarantees for robust RL are either limited to tabular settings or use restrictive assumptions in the function approximation setting. We will present an RL algorithm for learning the optimal policy from …