- Presence of linear inequality constraints,
- Compression of gradients in order to minimize communication during distributed computation.
In both cases for convergence to a second-order stationary point linear dependence on the dimension is currently required in otherwise (almost) dimension-free methods. It remains open whether such dependence is necessary.
Joint work with Dmitrii Avdiukhin (Indiana University, Bloomington) and Chi Jin (Princeton).