3 — Stuart Russell

Apr 21

Human Compatible AI

April 21, 2021
_____________

Today’s guest is Stuart Russell, and when it comes to AI you might just say, “he wrote the book.” In fact, Stuart is a co-author of the standard textbook that is used to teach AI at universities across the world. He has also written multiple books for general audiences, testifying to both his range and prolific body of work.

Stuart is currently a Professor of Computer Science at UC Berkeley (where he is also Director of the Center for Human Compatible AI) and has been a renowned voice in the AI field for years.

In his latest book — “Human Compatible: AI and the Problem of Control,” Stuart asserts that if we continue to design AI based on optimizing for fixed objectives (the standard approach), it will evolve in the context of superhuman AI to create disastrous consequences that unfold outside of our control. Stuart also explains this as "the King Midas Problem” of AI.

Thankfully, he proposes a new approach — derived from inverse reinforcement learning and designated “provably beneficial AI”— that just might save us from this fate. In this model, AI is designed to 1) optimize for human preferences yet 2) is inherently uncertain about those preferences and 3) deferential to human behavior in figuring those out over time.

“In this new model, the machine will allow itself to be switched off. The incentive to allow yourself to be switched off comes directly from the uncertainty about the objective — and the fact that the human is ‘the owner’ of the objective.”

So how do we get to a place where this model becomes the industry standard? Stuart walks us through the practical mechanics of standing this up. We’ll discuss the behavioral and collective challenge of identifying human preferences and steps that must first happen through research to change the industry’s foundation for building AI.

We also couldn’t end the conversation without briefly touching on the opportunity to promote human thriving in a new paradigm for the future of work. Whether you’re a casual observer or have been working in the field for years, my guess is you will come away from this conversation with a better understanding of how we should — how we must — think about controlling systems with capabilities that exceed our own.

Your Host,

listen to Full Episodes

Aaina Agarwal

3 — Stuart Russell

Human Compatible AI

listen to Full Episodes

4 — Eileen Donahoe

2 — Mark Surman

Indivisible