Weijie Su – SILO

SILO: Do Large Language Models Need Statistical Foundations?

Abstract: In this talk, we advocate for the development of rigorous statistical foundations for large language models (LLMs). We begin by elaborating two key features that motivate statistical perspectives for LLMs: (1) the probabilistic, autoregressive nature of next-token prediction, and (2) the complexity and black box nature of Transformer architectures. …

Instructor: Weijie Su

SILO: Do Large Language Models Need Statistical Foundations?

Local Elasticity: A Phenomenological Approach Toward Understanding Deep Learning