Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

How Apache Pinot Achieves 200,000 Queries per Second (with Tim Berglund)

2.2k Plays1 year ago

The likes of LinkedIn and Uber use Pinot to power some astonishingly high-scale queries against realtime data. The numbers alone would make an impressive case-study. But behind the headline lies a fascinating set of architectural decisions and constraints to get there. So how does Pinot work? How does it process queries? How are the various roles split across a cluster? And equally important - what does it *not* try to achieve.

Joining me to go through the nuts and bolts of how Pinot handles SQL queries is Tim Berglund, veteran technology explainer of the realtime-data world. He takes us through Pinot step-by-step, covering the roles of brokers, servers, controllers and minions as we build up the picture of a query engine that's interesting in theory and massively performant in practice.

–

Apache Pinot: https://pinot.apache.org/

Apache Pinot Docs: https://docs.pinot.apache.org/

StarTree: https://startree.ai/

Event Driven Design episode with Bobby Calderwood: https://youtu.be/V7vhSHqMxus

Tim on Twitter: https://twitter.com/tlberglund

Kris on Mastodon: http://mastodon.social/@krisajenkins

Kris on LinkedIn: https://www.linkedin.com/in/krisjenkins/

Kris on Twitter: https://twitter.com/krisajenkins

–

#podcast #softwaredevelopment #apachepinot #database #dataengineering #sql

Recommended

InfluxDB: The Evolution of a Time Series Database (with Paul Dix)

Developer Voices

01:49:23·3 days ago

Beyond AI Hype, What Will Developers Actually Use? (with Zach Lloyd)

Developer Voices

01:18:06·16 days ago

The $500 Billion Integration Problem, And One Possible Solution (with Marty Pitt)

Developer Voices

01:31:33·29 days ago

Making Software Crash Before It Breaks (with Isaac Van Doren)

Developer Voices

00:57:08·1 month ago

Making Apache Kafka Diskless (with Filip Yonov & Josep Prat)

Developer Voices

01:29:29·1 month ago

Java’s Cutting Edge Comeback (with Josh Long)

Developer Voices

01:24:29·2 months ago

The State & Future of Apache Kafka (with Anatoly Zelenin)

Developer Voices

01:12:22·2 months ago

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Developer Voices

01:32:10·3 months ago

Jupyter’s Architecture Unpacked (with Afshin Darian & Sylvain Corlay)

Developer Voices

01:29:11·3 months ago

Nix, The Build-Everything Language (with Julian Arni)

Developer Voices

01:20:36·4 months ago

Graphite: Image Editing as a Syntax Tree (with Keavon Chambers & Dennis Kobert)

Developer Voices

01:17:32·4 months ago

ReScript: A Better Typed JavaScript? (with Gabriel Nordeborn)

Developer Voices

01:32:53·5 months ago

A universal query engine in Rust (with Predrag Gruevski)

Developer Voices

01:15:31·5 months ago

Raspberry Pi Hardware & A Lisp Brain (with Dimitris Kyriakoudis)

Developer Voices

01:34:43·6 months ago

Software Systems Aren’t Just Software (with Diana Montalion)

Developer Voices

01:50:14·6 months ago

Building Fyrox: A Rust Game Engine (with Dmitry Stepanov)

Developer Voices

01:43:45·6 months ago

Testing TVs At Scale With Elixir (with Dave Lucia)

Developer Voices

01:16:05·7 months ago

Programming As An Expressive Instrument (with Sam Aaron)

Developer Voices

01:50:01·7 months ago

Elm & The Future of Open Source (with Evan Czaplicki)

Developer Voices

00:50:24·8 months ago

Programmers, ADHD, And How To Manage Them Both (with Chris Ferdinandi)

Developer Voices

01:39:14·8 months ago

Transcript