Become a Creator today!Start creating today - Share your story with the world!

00:00:00

00:00:01

Semantic Search: A Deep Dive Into Vector Databases (with Zain Hasan)

1.6k Plays1 year ago

As interesting and useful as LLMs (Large Language Models) are proving, they have a severe limitation: they only know about the information they were trained on. If you train it on a snapshot of the internet from 2023, it’ll think it’s 2023 forever. So what do you do if you want to teach it some new information, but don’t want to burn a million AWS credits to get there?

In exploring that answer, we dive deep into the world of semantic search, augmented LLMs, and exactly how vector databases bridge that gap from the old dog to the new tricks. Along the way we’ll go from an easy trick to teach ChatGPT some new information by hand, all the way down to how vector databases store documents by their meaning, and how they efficiently search through those meanings to give custom, relevant answers to your questions.

Zain on Twitter: https://twitter.com/zainhasan6
Zain on LinkedIn: https://www.linkedin.com/in/zainhas
Kris on Twitter: https://twitter.com/krisajenkins
Kris on LinkedIn: https://www.linkedin.com/in/krisjenkins/
HNSW Paper: https://arxiv.org/abs/1603.09320
ImageBind - One Embedding Space To Bind Them All (pdf): https://openaccess.thecvf.com/content/CVPR2023/papers/Girdhar_ImageBind_One_Embedding_Space_To_Bind_Them_All_CVPR_2023_paper.pdf
Weaviate: https://weaviate.io/
Source: https://github.com/weaviate/weaviate
Examples: https://github.com/weaviate/weaviate-examples
Community Links: https://forum.weaviate.io/ and https://weaviate.io/slack

--

#vectordb #vectordatabase #semanticsearch #openai #chatgpt #weaviate #knn

Recommended

Beyond AI Hype, What Will Developers Actually Use? (with Zach Lloyd)

Developer Voices

01:18:06·7 days ago

The $500 Billion Integration Problem, And One Possible Solution (with Marty Pitt)

Developer Voices

01:31:33·20 days ago

Making Software Crash Before It Breaks (with Isaac Van Doren)

Developer Voices

00:57:08·1 month ago

Making Apache Kafka Diskless (with Filip Yonov & Josep Prat)

Developer Voices

01:29:29·1 month ago

Java’s Cutting Edge Comeback (with Josh Long)

Developer Voices

01:24:29·2 months ago

The State & Future of Apache Kafka (with Anatoly Zelenin)

Developer Voices

01:12:22·2 months ago

DataFusion - The Database Building Toolkit (with Andrew Lamb)

Developer Voices

01:32:10·2 months ago

Jupyter’s Architecture Unpacked (with Afshin Darian & Sylvain Corlay)

Developer Voices

01:29:11·3 months ago

Nix, The Build-Everything Language (with Julian Arni)

Developer Voices

01:20:36·3 months ago

Graphite: Image Editing as a Syntax Tree (with Keavon Chambers & Dennis Kobert)

Developer Voices

01:17:32·4 months ago

ReScript: A Better Typed JavaScript? (with Gabriel Nordeborn)

Developer Voices

01:32:53·5 months ago

A universal query engine in Rust (with Predrag Gruevski)

Developer Voices

01:15:31·5 months ago

Raspberry Pi Hardware & A Lisp Brain (with Dimitris Kyriakoudis)

Developer Voices

01:34:43·6 months ago

Software Systems Aren’t Just Software (with Diana Montalion)

Developer Voices

01:50:14·6 months ago

Building Fyrox: A Rust Game Engine (with Dmitry Stepanov)

Developer Voices

01:43:45·6 months ago

Testing TVs At Scale With Elixir (with Dave Lucia)

Developer Voices

01:16:05·7 months ago

Programming As An Expressive Instrument (with Sam Aaron)

Developer Voices

01:50:01·7 months ago

Elm & The Future of Open Source (with Evan Czaplicki)

Developer Voices

00:50:24·7 months ago

Programmers, ADHD, And How To Manage Them Both (with Chris Ferdinandi)

Developer Voices

01:39:14·8 months ago

MicroServices For Better And Worse (with Ian Cooper and James Lewis)

Developer Voices

00:47:09·8 months ago

Transcript