EP21 - Large Language Models and The Power of Scale cover art

EP21 - Large Language Models and The Power of Scale

EP21 - Large Language Models and The Power of Scale

Listen for free

View show details

About this listen

This episode moves from the Transformer architecture to the models that define our era: Large Language Models (LLMs). We explore how the simple act of "next-word prediction," when combined with internet-scale data and massive compute, leads to the surprising "emergent abilities" of models like GPT-4, and we break down the crucial training paradigm of pre-training and fine-tuning.
No reviews yet