Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

Special: Defeating AI Defenses (with Nicholas Carlini and Nathan Labenz)

3.7k Plays7 months ago

In this special episode, we feature Nathan Labenz interviewing Nicholas Carlini on the Cognitive Revolution podcast. Nicholas Carlini works as a security researcher at Google DeepMind, and has published extensively on adversarial machine learning and cybersecurity. Carlini discusses his pioneering work on adversarial attacks against image classifiers, and the challenges of ensuring neural network robustness. He examines the difficulties of defending against such attacks, the role of human intuition in his approach, open-source AI, and the potential for scaling AI security research.

00:00 Nicholas Carlini's contributions to cybersecurity

08:19 Understanding attack strategies

29:39 High-dimensional spaces and attack intuitions

51:00 Challenges in open-source model safety

01:00:11 Unlearning and fact editing in models

01:10:55 Adversarial examples and human robustness

01:37:03 Cryptography and AI robustness

01:55:51 Scaling AI security research

Recommended

Breaking the Intelligence Curse (with Luke Drago)

Future of Life Institute Podcast

01:09:37·1 month ago

What Markets Tell Us About AI Timelines (with Basil Halperin)

Future of Life Institute Podcast

01:36:09·1 month ago

AGI Security: How We Defend the Future (with Esben Kran)

Future of Life Institute Podcast

01:18:20·2 months ago

Reasoning, Robots, and How to Prepare for AGI (with Benjamin Todd)

Future of Life Institute Podcast

01:27:00·2 months ago

From Peak Horse to Peak Human: How AI Could Replace Us (with Calum Chace)

Future of Life Institute Podcast

01:37:20·3 months ago

How AI Could Help Overthrow Governments (with Tom Davidson)

Future of Life Institute Podcast

01:53:49·3 months ago

What Happens After Superintelligence? (with Anders Sandberg)

Future of Life Institute Podcast

01:44:54·3 months ago

Why the AI Race Ends in Disaster (with Daniel Kokotajlo)

Future of Life Institute Podcast

01:10:26·3 months ago

Preparing for an AI Economy (with Daniel Susskind)

Future of Life Institute Podcast

01:03:37·4 months ago

Will AI Companies Respect Creators' Rights? (with Ed Newton-Rex)

Future of Life Institute Podcast

01:27:14·4 months ago

AI Timelines and Human Psychology (with Sarah Hastings-Woodhouse)

Future of Life Institute Podcast

01:15:49·4 months ago

Could Powerful AI Break Our Fragile World? (with Michael Nielsen)

Future of Life Institute Podcast

01:01:28·4 months ago

Facing Superintelligence (with Ben Goertzel)

Future of Life Institute Podcast

01:32:33·5 months ago

Will Future AIs Be Conscious? (with Jeff Sebo)

Future of Life Institute Podcast

01:34:27·5 months ago

Understanding AI Agents: Time Horizons, Sycophancy, and Future Risks (with Zvi Mowshowitz)

Future of Life Institute Podcast

01:35:09·5 months ago

Inside China’s AI Strategy: Innovation, Diffusion, and US Relations (with Jeffrey Ding)

Future of Life Institute Podcast

01:02:32·6 months ago

How Will We Cooperate with AIs? (with Allison Duettmann)

Future of Life Institute Podcast

01:36:02·6 months ago

Brain-like AGI and why it's Dangerous (with Steven Byrnes)

Future of Life Institute Podcast

01:13:13·6 months ago

How Close Are We to AGI? Inside Epoch’s GATE Model (with Ege Erdil)

Future of Life Institute Podcast

01:34:33·7 months ago

Keep the Future Human (with Anthony Aguirre)

Future of Life Institute Podcast

01:21:03·7 months ago

Transcript