Towards Data Efficient Monte Carlo Estimates in Reinforcement Learning
Josiah Hanna
University of Wisconsin–Madison
Josiah Hanna
University of Wisconsin–Madison
Jeff Linderoth
University of Wisconsin–Madison
Greg Canal
University of Wisconsin–Madison
How does a 110-layer ResNet learn a high-complexity classifier using relatively few training examples and short training time? We present a theory towards explaining this learning process in terms of hierarchical learning. We refer to hierarchical learning as the learner learns to represent a complicated target function by decomposing it into a …
Recent advances of computational robust statistics have produced efficient estimators with provable near-optimal statistical guarantees for a variety of problems. These estimators often involve non-convex optimization, and it is not clear why these non-convex problems are efficiently solvable, but many classical non-convex formulations are not. We make an attempt to …