Spring 2025 – Page 2

SILO: Towards Principled AI-Agents with Decentralized and Asymmetric Information

Abstract: AI Models have been increasingly deployed to develop “Autonomous Agents” for decision-making, with prominent application examples including playing Go and video games, robotics, autonomous driving, healthcare, human-assistant, etc. Most such success stories naturally involve multiple AI-agents interacting dynamically with each other and humans. More importantly, these agents oftentimes operate with asymmetric …

SILO: Minimizing quadratics over integers

Abstract: Mixed integer quadratic programming is the problem of minimizing a quadratic polynomial over points in a polyhedral region with some integer components. It is a natural extension of mixed integer linear programming, and it has a wide array of applications. In this talk, I will survey some recent theoretical …

SILO: Neural Operators for Scientific Applications: Learning on Function Spaces

Abstract: Applying AI to scientific problems like weather forecasting and aerodynamics is an active research area, promising to accelerate model development and enable faster scientific discovery and engineering design. In practice, these applications require learning spatiotemporal processes and solutions to partial differential equations on continuous domains at multiple scales – …

SILO: Self-Improving Transformers: Overcoming Length Generalization Challenges

Abstract: Large language models can perform algorithmic tasks through test-time computation but struggle to generalize far beyond the task difficulty of the training distribution. These limitations manifest across even simple tasks like arithmetic, string manipulation, and maze solving, where transformers learn shortcuts rather than the underlying algorithms. While prior solutions …

SILO: Efficiently Searching for Distributions

Abstract: How efficiently can we search distributions? The problem is modeled as follows: we are given knowledge of k discrete distributions v_i for 1 <= i <= k over the domain [n] = {1,…,n} which we can preprocess. Then we get samples from an unknown discrete distribution p, also over …