Oops!Zencastr was unable to start because Javascript is disabled
To fix this problem, check your browser's settings and enable Javascript

Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

SE Radio 703: Sahaj Garg on Low Latency AI

E703 · Software Engineering Radio - the podcast for professional software developers

0 Plays4 months ago

In this episode, Sahaj Garg, CTO of wispr.ai, joins SE Radio host Robert Blumen to talk about the challenges of building low-latency AI applications. They discuss latency's effect on consumer behavior as well as interactive applications. The conversation explores how to measure latency and how scale impacts it. Then Sahaj and Robert shift to themes around AI, including whether "AI" means LLMs or something broader, as they look at latency requirements and challenges around subtypes of AI applications. The final part of the episode explores techniques for managing latency in AI: speed vs accuracy trade-offs; speed vs cost; latency vs cost; choosing the right model; reducing quantization; distillation; and guessing + validating.

Recommended

SE Radio 720: Martin Dilger on Understanding Eventsourcing image

SE Radio 720: Martin Dilger on Understanding Eventsourcing

E720 · Software Engineering Radio - the podcast for professional software developers

00:55:40·4 days ago

SE Radio 719: Birol Yildiz on Building an Agentic AI SRE image

SE Radio 719: Birol Yildiz on Building an Agentic AI SRE

E718 · Software Engineering Radio - the podcast for professional software developers

00:54:01·11 days ago

SE Radio 718: Will Sentance on JS Modernization image

SE Radio 718: Will Sentance on JS Modernization

E718 · Software Engineering Radio - the podcast for professional software developers

00:58:42·18 days ago

SE Radio 717: Eric Tschetter on Decoupling Observability image

SE Radio 717: Eric Tschetter on Decoupling Observability

E717 · Software Engineering Radio - the podcast for professional software developers

01:00:12·25 days ago

SE Radio 716: Martin Kleppmann Local-First Software image

SE Radio 716: Martin Kleppmann Local-First Software

E716 · Software Engineering Radio - the podcast for professional software developers

00:55:13·1 month ago

SE Radio 715: Sahaj Garg on Designing for Ambiguity in Human Input image

SE Radio 715: Sahaj Garg on Designing for Ambiguity in Human Input

E715 · Software Engineering Radio - the podcast for professional software developers

00:48:01·1 month ago

SE Radio 714: Costa Alexoglou on Remote Pair Programming image

SE Radio 714: Costa Alexoglou on Remote Pair Programming

E724 · Software Engineering Radio - the podcast for professional software developers

00:51:26·1 month ago

SE Radio 713: Héctor Ramón Jiménez on Building a GUI library in Rust image

SE Radio 713: Héctor Ramón Jiménez on Building a GUI library in Rust

E713 · Software Engineering Radio - the podcast for professional software developers

00:59:14·1 month ago

SE Radio 712: Dan Lorenc on Sigstore image

SE Radio 712: Dan Lorenc on Sigstore

E712 · Software Engineering Radio - the podcast for professional software developers

00:39:03·1 month ago

SE Radio 711: Scott Hanselman on AI-Assisted Development Tools image

SE Radio 711: Scott Hanselman on AI-Assisted Development Tools

E711 · Software Engineering Radio - the podcast for professional software developers

01:02:14·2 months ago

SE Radio 710: Marc Brooker on Spec-Driven AI Dev image

SE Radio 710: Marc Brooker on Spec-Driven AI Dev

E710 · Software Engineering Radio - the podcast for professional software developers

01:03:32·2 months ago

SE Radio 709: Bryan Cantrill on the Data Center Control Plane image

SE Radio 709: Bryan Cantrill on the Data Center Control Plane

E709 · Software Engineering Radio - the podcast for professional software developers

01:05:07·2 months ago

SE Radio 708: Jens Gustedt on C in 2026 image

SE Radio 708: Jens Gustedt on C in 2026

E708 · Software Engineering Radio - the podcast for professional software developers

00:59:31·2 months ago

SE Radio 707: Subhajit Paul on ERP Automation and AI image

SE Radio 707: Subhajit Paul on ERP Automation and AI

E707 · Software Engineering Radio - the podcast for professional software developers

00:59:46·3 months ago

SE Radio 706: Yechezkel "Chez" Rabinovich on Observability Tool Migration Techniques image

SE Radio 706: Yechezkel "Chez" Rabinovich on Observability Tool Migration Techniques

E706 · Software Engineering Radio - the podcast for professional software developers

00:39:15·3 months ago

SE Radio 705: Murat Erder and Eoin Woods on Continuous Architecture image

SE Radio 705: Murat Erder and Eoin Woods on Continuous Architecture

E705 · Software Engineering Radio - the podcast for professional software developers

00:57:53·3 months ago

SE Radio 704: Sriram Panyam on System Design Interviews image

SE Radio 704: Sriram Panyam on System Design Interviews

E704 · Software Engineering Radio - the podcast for professional software developers

00:45:01·3 months ago

SE Radio 702: Derick Schaefer on Modern CLIs image

SE Radio 702: Derick Schaefer on Modern CLIs

E702 · Software Engineering Radio - the podcast for professional software developers

00:48:08·4 months ago

SE Radio 701: Max Guernsey, III and Luniel de Beer on Readiness in Software Engineering image

SE Radio 701: Max Guernsey, III and Luniel de Beer on Readiness in Software Engineering

E701 · Software Engineering Radio - the podcast for professional software developers

01:01:49·4 months ago

SE Radio 700: Mojtaba Sarooghi on Waiting Rooms for High-Traffic Events image

SE Radio 700: Mojtaba Sarooghi on Waiting Rooms for High-Traffic Events

E700 · Software Engineering Radio - the podcast for professional software developers

01:07:45·4 months ago