Systems | Information | Learning | Optimization
 

SILO: Forgetting sensitive data on open-weight models with guarantees

Abstract:  The proliferation of open-weight models trained on vast, public datasets (that often include sensitive data) introduces a critical privacy challenge: how do we erase the influence of sensitive incorrect, or obsolete data after a model is pre-trained? While machine unlearning offers a promising direction, we argue that current methods …

SILO: Characterizing the power of MCMC methods for sparse estimation

Abstract: Markov Chain Monte Carlo (MCMC) and local-search optimization methods have been extensively used in the practice of statistical research for many decades now. However, their exact theoretical performance has been strikingly eluding even for simple parametric estimation tasks. This is in stark contrast to other classes of estimators such …