Views Navigation

Event Views Navigation

Calendar of Events

S Sun

M Mon

T Tue

W Wed

T Thu

F Fri

S Sat

0 events,

0 events,

0 events,

0 events,

0 events,

1 event,

Statistics and Data Science Seminar Series Robert Nowak

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

1 event,

Statistics and Data Science Seminar Series Shay Moran

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

1 event,

Statistics and Data Science Seminar Series Benjamin Recht

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

0 events,

1 event,

Statistics and Data Science Seminar Series Yuejie Chi

0 events,

Function Space Perspectives on Neural Networks

Robert Nowak (University of Wisconsin-Madison)
E18-304

Abstract: This talk reviews a theory of the functions learned by ReLU neural networks. At its core is the observation that deep ReLU networks can be characterized as solutions to data-fitting problems in certain infinite dimensional function spaces. The solutions are compositions of functions from Banach spaces of second-order bounded variation, defined in the Radon…

Find out more »

Characterizations of Uniform Learnability: Vapnik–Chervonenkis, Natarajan, and Daniely–Shalev Shwartz Dimensions

Shay Moran (Technion and Google Research)
E18-304

Abstract: One of the central goals in statistical learning theory is to understand which hypothesis classes can be learned from data. A landmark result in this direction is the Fundamental Theorem of PAC Learning, which characterizes the learnability of binary classification problems via the VC dimension, which was introduced by Vapnik and Chervonenkis in the…

Find out more »

The Irrational Decision

Benjamin Recht (University of California, Berkeley)
E18-304

Abstract: This talk traces the intellectual history of automated decision-making to its origins in the post-World War II development of computers. Mathematicians of the 1940s set out to design machines that could act as ideal rational agents in the face of uncertainty. In this pursuit, a cluster of foundational mathematical technologies—including linear programming, game theory,…

Find out more »

Transformers Learn Generalizable Chain-of-Thought Reasoning via Gradient Descent

Yuejie Chi (Yale University)
E18-304

Abstract: Transformers have demonstrated remarkable chain-of-thought reasoning capabilities, yet, the underlying mechanisms by which they acquire and extrapolate these capabilities remain limited. This talk presents a theoretical analysis of transformers trained via gradient descent for symbolic reasoning and state tracking tasks with increasing problem complexity. Our analysis reveals the coordination of multi-head attention to solve…

Find out more »


MIT Institute for Data, Systems, and Society
Massachusetts Institute of Technology
77 Massachusetts Avenue
Cambridge, MA 02139-4307
617-253-1764