SILO: Invariant Low-Dimensional Subspaces in Gradient Descent for Learning Deep Networks Qing Qu, UMich