Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

Making Apache Kafka Diskless (with Filip Yonov & Josep Prat)

3.4k Plays1 month ago

How do you retrofit a clustered data-processing system to use cheap commodity storage? That’s the big question in this episode as we look at one of the many attempts to build a version of Kafka that uses object storage services like S3 as its main disk, sacrificing a little latency for cheap, infinitely-scalable disks.

There are several companies trying to walk down that road, and it’s clearly big business - one of them recently got bought out for a rumoured $250m. But one of them is actively trying to get those changes back into the community, as are pushing to make Apache Kafka speak object storage natively.

Joining me to explain why and how are Josep Prat and Filip Yonov of Aiven. We break down what it takes to make Kafka’s storage layer optional on a per-topic basis, how they’re making sure it’s not a breaking change, and how they plan to get such a foundational feature merged.

–

Announcement Post: https://aiven.io/blog/guide-diskless-apache-kafka-kip-1150

Aiven’s (Temporary) Fork, Project Inkless: https://github.com/aiven/inkless/blob/main/docs/inkless/README.md

Kafka Improvement Process (KIP) Articles:

KIP-1150: https://cwiki.apache.org/confluence/display/KAFKA/KIP-1150%3A+Diskless+Topics
KIP-1163: Diskless Core: https://cwiki.apache.org/confluence/display/KAFKA/KIP-1163%3A+Diskless+Core
KIP-1164: Topic Based Batch Coordinator: https://cwiki.apache.org/confluence/display/KAFKA/KIP-1164%3A+Topic+Based+Batch+Coordinator
KIP-1165: Object Compaction for Diskless: https://cwiki.apache.org/confluence/display/KAFKA/KIP-1165%3A+Object+Compaction+for+Diskless

Support Developer Voices on Patreon: https://patreon.com/DeveloperVoices

Support Developer Voices on YouTube: https://www.youtube.com/@developervoices/join

Filip on LinkedIn: https://www.linkedin.com/in/filipyonov

Josep on LinkedIn: https://www.linkedin.com/in/jlprat/

Kris on Bluesky: https://bsky.app/profile/krisajenkins.bsky.social

Kris on Mastodon: http://mastodon.social/@krisajenkins

Kris on LinkedIn: https://www.linkedin.com/in/krisjenkins/

Recommended

InfluxDB: The Evolution of a Time Series Database (with Paul Dix)

Developer Voices

01:49:23·5 days ago

Beyond AI Hype, What Will Developers Actually Use? (with Zach Lloyd)

Developer Voices

01:18:06·19 days ago

The $500 Billion Integration Problem, And One Possible Solution (with Marty Pitt)

Developer Voices

01:31:33·1 month ago

Making Software Crash Before It Breaks (with Isaac Van Doren)

Developer Voices

00:57:08·1 month ago

Java’s Cutting Edge Comeback (with Josh Long)

Developer Voices

01:24:29·2 months ago

The State & Future of Apache Kafka (with Anatoly Zelenin)

Developer Voices

01:12:22·2 months ago

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Developer Voices

01:32:10·3 months ago

Jupyter’s Architecture Unpacked (with Afshin Darian & Sylvain Corlay)

Developer Voices

01:29:11·3 months ago

Nix, The Build-Everything Language (with Julian Arni)

Developer Voices

01:20:36·4 months ago

Graphite: Image Editing as a Syntax Tree (with Keavon Chambers & Dennis Kobert)

Developer Voices

01:17:32·4 months ago

ReScript: A Better Typed JavaScript? (with Gabriel Nordeborn)

Developer Voices

01:32:53·5 months ago

A universal query engine in Rust (with Predrag Gruevski)

Developer Voices

01:15:31·5 months ago

Raspberry Pi Hardware & A Lisp Brain (with Dimitris Kyriakoudis)

Developer Voices

01:34:43·6 months ago

Software Systems Aren’t Just Software (with Diana Montalion)

Developer Voices

01:50:14·6 months ago

Building Fyrox: A Rust Game Engine (with Dmitry Stepanov)

Developer Voices

01:43:45·6 months ago

Testing TVs At Scale With Elixir (with Dave Lucia)

Developer Voices

01:16:05·7 months ago

Programming As An Expressive Instrument (with Sam Aaron)

Developer Voices

01:50:01·7 months ago

Elm & The Future of Open Source (with Evan Czaplicki)

Developer Voices

00:50:24·8 months ago

Programmers, ADHD, And How To Manage Them Both (with Chris Ferdinandi)

Developer Voices

01:39:14·8 months ago

MicroServices For Better And Worse (with Ian Cooper and James Lewis)

Developer Voices

00:47:09·8 months ago

Transcript