Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

Bringing Pure Python to Apache Kafka (with Tomáš Neubauer)

2k Plays1 year ago

The “big data infrastructure” world is dominated by Java, but the data-analysis world is dominated by Python. So if you need to analyse and process huge amounts of data, chances are you’re in for a less-than-ideal time. The impedance mismatch will probably make your life hard somehow.

So there are a lot of projects and companies trying to solve that problem. To bridge those two worlds seamlessly, and many of the popular solutions see SQL as the glue. But this week we’re going to look at another solution - ignore Java, treat Kafka as a protocol, and build up all the infrastructure tools you need with a pure Python library. It’s a lot of work, but in theory it would make Python the one language for data storage, analysis and processing, at scale. Tempting, but is it feasible?

Joining me to discuss the pros, cons, and massive scope of that approach is Tomáš Neubauer. He started off doing real time data analysis for the Maclaren’s F1 team, and is now deep in the Python mines effectively rewriting Kafka Streams in Python. But how? How much work is actually involved in porting those ideas to Python-land, and how do you even get started? And perhaps most fundamental of all - even if you succeed, will that be enough to make the job easy, or will you still have to scale the mountain of teaching people how to use the new tools you’ve built? Let's find out.

–

Quix Streams on Github: https://github.com/quixio/quix-streams

Quix Streams getting started guide: https://quix.io/get-started-with-quix-streams

Quix: https://quix.io/

Tomáš on LinkedIn: https://www.linkedin.com/in/tom%C3%A1%C5%A1-neubauer-a10bb144

Tomáš on Twitter: https://twitter.com/TomasNeubauer0

Kris on Mastodon: http://mastodon.social/@krisajenkins

Kris on LinkedIn: https://www.linkedin.com/in/krisjenkins/

Kris on Twitter: https://twitter.com/krisajenkins

#podcast #softwaredevelopment #datascience #apachekafka #streamprocessing

Recommended

Beyond AI Hype, What Will Developers Actually Use? (with Zach Lloyd)

Developer Voices

01:18:06·5 days ago

The $500 Billion Integration Problem, And One Possible Solution (with Marty Pitt)

Developer Voices

01:31:33·18 days ago

Making Software Crash Before It Breaks (with Isaac Van Doren)

Developer Voices

00:57:08·1 month ago

Making Apache Kafka Diskless (with Filip Yonov & Josep Prat)

Developer Voices

01:29:29·1 month ago

Java’s Cutting Edge Comeback (with Josh Long)

Developer Voices

01:24:29·1 month ago

The State & Future of Apache Kafka (with Anatoly Zelenin)

Developer Voices

01:12:22·2 months ago

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Developer Voices

01:32:10·2 months ago

Jupyter’s Architecture Unpacked (with Afshin Darian & Sylvain Corlay)

Developer Voices

01:29:11·3 months ago

Nix, The Build-Everything Language (with Julian Arni)

Developer Voices

01:20:36·3 months ago

Graphite: Image Editing as a Syntax Tree (with Keavon Chambers & Dennis Kobert)

Developer Voices

01:17:32·4 months ago

ReScript: A Better Typed JavaScript? (with Gabriel Nordeborn)

Developer Voices

01:32:53·5 months ago

A universal query engine in Rust (with Predrag Gruevski)

Developer Voices

01:15:31·5 months ago

Raspberry Pi Hardware & A Lisp Brain (with Dimitris Kyriakoudis)

Developer Voices

01:34:43·5 months ago

Software Systems Aren’t Just Software (with Diana Montalion)

Developer Voices

01:50:14·6 months ago

Building Fyrox: A Rust Game Engine (with Dmitry Stepanov)

Developer Voices

01:43:45·6 months ago

Testing TVs At Scale With Elixir (with Dave Lucia)

Developer Voices

01:16:05·7 months ago

Programming As An Expressive Instrument (with Sam Aaron)

Developer Voices

01:50:01·7 months ago

Elm & The Future of Open Source (with Evan Czaplicki)

Developer Voices

00:50:24·7 months ago

Programmers, ADHD, And How To Manage Them Both (with Chris Ferdinandi)

Developer Voices

01:39:14·8 months ago

MicroServices For Better And Worse (with Ian Cooper and James Lewis)

Developer Voices

00:47:09·8 months ago

Transcript