Benya Fallenstein

Benya Fallenstein works on basic theoretical questions raised by the challenge of aligning advanced AI systems with human goals. These include decision- and game-theoretic problems that arise when artificial agents reason about future versions of themselves or about other, similarly powerful agents in their environment. Since joining the research team in 2014, she has spent time developing models of logical uncertainty (uncertainty about which mathematical statements are true), self-reference in higher-order theorem-proving systems, and the specification of safe AI goals. Benya holds a bachelor’s in mathematics from the University of Vienna.


Org chart

Sign up to view 0 direct reports

Get started