759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko by @Jon Krohn · Zencastr

Oops!Zencastr was unable to start because Javascript is disabled
To fix this problem, check your browser's settings and enable Javascript

759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko image

Super Data Science

759: Full Encoder-Decoder Transformers Fully Explained, with Kirill Eremenko

1 Plays

2 months ago

Encoders, cross attention and masking for LLMs: SuperDataScience Founder Kirill Eremenko returns to the SuperDataScience podcast, where he speaks with Jon Krohn about transformer architectures and why they are a new frontier for generative AI. If you’re interested in applying LLMs to your business portfolio, you’ll want to pay close attention to this episode! This episode is brought to you by Ready Tensor, where innovation meets reproducibility (https://www.readytensor.ai/), by Oracle NetSuite business software (netsuite.com/superdata), and by Intel and HPE Ezmeral Software Solutions (http://hpe.com/ezmeral/chatbots). Interested in sponsoring a SuperDataScience Podcast episode? Visit https://passionfroot.me/superdatascience for sponsorship information. In this episode you will learn: • How decoder-only transformers work [15:51] • How cross-attention works in transformers [41:05] • How encoders and decoders work together (an example) [52:46] • How encoder-only architectures excel at understanding natural language [1:20:34] • The importance of masking during self-attention [1:27:08] Additional materials: www.superdatascience.com/759

by

@superdatascience

Recommended

778: Mixtral 8x22B: SOTA Open-Source LLM Capabilities at a Fraction of the Compute image

0

778: Mixtral 8x22B: SOTA Open-Source LLM Capabilities at a Fraction of the Compute

Super Data Science

00:06:52·1 day ago

777: Generative AI in Practice, with Bernard Marr image

0

777: Generative AI in Practice, with Bernard Marr

Super Data Science

01:08:49·4 days ago

776: Deep Utopia: AI Could Solve All Human Problems in Our Lifetime image

0

776: Deep Utopia: AI Could Solve All Human Problems in Our Lifetime

Super Data Science

00:07:36·8 days ago

775: What will humans do when machines are vastly more intelligent? With Aleksa Gordić image

0

775: What will humans do when machines are vastly more intelligent? With Aleksa Gordić

Super Data Science

01:36:41·11 days ago

774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities image

0

774: RFM-1 Gives Robots Human-like Reasoning and Conversation Abilities

Super Data Science

00:12:52·15 days ago

773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas image

0

773: Deep Reinforcement Learning for Maximizing Profits, with Prof. Barrett Thomas

Super Data Science

01:07:40·18 days ago

772: In Case You Missed It in March 2024 image

0

772: In Case You Missed It in March 2024

Super Data Science

00:24:00·22 days ago

771: Gradient Boosting: XGBoost, LightGBM and CatBoost, with Kirill Eremenko image

0

771: Gradient Boosting: XGBoost, LightGBM and CatBoost, with Kirill Eremenko

Super Data Science

01:59:00·25 days ago

770: The Neuroscientific Guide to Confidence image

0

770: The Neuroscientific Guide to Confidence

Super Data Science

00:45:22·29 days ago

769: Generative AI for Medicine, with Prof. Zack Lipton image

0

769: Generative AI for Medicine, with Prof. Zack Lipton

Super Data Science

01:49:12·1 month ago

768: Is Claude 3 Better than GPT-4? image

0

768: Is Claude 3 Better than GPT-4?

Super Data Science

00:12:55·1 month ago

767: Open-Source LLM Libraries and Techniques, with Dr. Sebastian Raschka image

0

767: Open-Source LLM Libraries and Techniques, with Dr. Sebastian Raschka

Super Data Science

01:48:12·1 month ago

766: Vonnegut's Player Piano (1952): An Eerie Novel on the Current AI Revolution image

0

766: Vonnegut's Player Piano (1952): An Eerie Novel on the Current AI Revolution

Super Data Science

00:08:13·1 month ago

765: NumPy, SciPy and the Economics of Open-Source, with Dr. Travis Oliphant image

0

765: NumPy, SciPy and the Economics of Open-Source, with Dr. Travis Oliphant

Super Data Science

01:37:29·1 month ago

764: The Top 10 Episodes of 2023 image

1

764: The Top 10 Episodes of 2023

Super Data Science

00:08:04·1 month ago

763: The Best A.I. Startup Opportunities, with venture capitalist Rudina Seseri image

0

763: The Best A.I. Startup Opportunities, with venture capitalist Rudina Seseri

Super Data Science

01:27:14·1 month ago

762: Gemini 1.5 Pro, the Million-Token-Context LLM image

0

762: Gemini 1.5 Pro, the Million-Token-Context LLM

Super Data Science

00:16:58·1 month ago

761: Gemini Ultra: How to Release an A.I. Product for Billions of Users, with Google's Lisa Cohen image

0

761: Gemini Ultra: How to Release an A.I. Product for Billions of Users, with Google's Lisa Cohen

Super Data Science

01:10:15·2 months ago

760: Humans Love A.I.-Crafted Beer image

0

760: Humans Love A.I.-Crafted Beer

Super Data Science

00:06:31·2 months ago

758: The Mamba Architecture: Superior to Transformers in LLMs image

1

758: The Mamba Architecture: Superior to Transformers in LLMs

Super Data Science

00:08:12·2 months ago