Towards the theory of efficient planning in large MDPs with huge state spaces in the presence of linear function approximation Csaba Szepesvari University of Alberta