Claude Code Cast

Episodes

Better Agent Tools Made Code Review Worse

Jul 14 2026

GitHub gave its code-review agent better tools and watched cost rise while useful findings fell. Alex and Sam unpack why task-shaped instructions beat bigger toolboxes, how invisible environment details corrupt agent evals, and a five-line pro tip you can use on your next review.

Show More Show Less

18 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
Your AI Coding Benchmarks Are Lying To You

Jul 3 2026

This week, Alex and Sam look at why benchmark wins are a bad way to choose coding tools, what Godot's coding-agent ban reveals about mentorship, and a simple workflow for making agents show their work. If your team is still asking "which model scored highest?", this episode gives you a better test.

Show More Show Less

19 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
The Tiny Local Model That Changes Your Agent Budget

Jun 26 2026

Small, local models are suddenly good enough for real agent chores, but the win is not replacing your smartest model. Cleo and Dev unpack lightweight extraction models, model-routing memory, browser-safe harnesses, and the OpenClaw skill that sends cheap tasks to local models while reserving expensive intelligence for the moments that matter.

Show More Show Less

18 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
Your Coding Agent Needs a Bouncer Now

Jun 19 2026

AI coding agents are getting longer runs, more context, and more ways to touch production workflows, but this week made the real bottleneck obvious: authorization. Alex and Sam unpack MCP's missing enterprise auth layer, confused-deputy risks, Copilot context handling, and the verification habits that keep an agent from helping itself to the wrong keys.

Show More Show Less

19 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
Verification Is Now Your Coding Agent Bottleneck

Jun 17 2026

Coding agents are getting better at long runs, but this week's news points at the real limit: proof. Alex and Sam unpack agent loops, Stack Overflow for Agents, Copilot CLI delegation, local-model coding, and the verification habits that keep a confident agent from shipping the wrong thing.

Show More Show Less

12 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
Cursor's Tokenomics Reckoning Hits Every Coding Agent

Jun 5 2026

Coding agents are no longer just a workflow story; they are a cost, context, and control story. Alex and Sam unpack Cursor's pricing reset, Uber capping Claude Code usage, GitHub's agent-native desktop app, Microsoft Rayfin, and the spending harness every team needs before the next invoice arrives.

Show More Show Less

17 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
The Agent Benchmark That Should Scare Managers

May 29 2026

Agentic coding tools are moving into enterprise workflows, but the week's most useful signal is a benchmark where frontier models still struggle below 50% on real IT tasks. Alex and Sam unpack Microsoft Learn grounding, agent deception, Copilot data leaks, and the practical harness every team should build before handing agents production authority.

Show More Show Less

19 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free
The Workflow Feature That Makes Agents Less Expensive

May 22 2026

Claude Code workflows, enterprise Codex deployments, and rising token costs all point to the same lesson: coding agents need operating systems, not just better prompts. Alex and Sam dig into /workflows, on-prem Codex, CI for agents, and the new decision fatigue of choosing where each task should run.

Show More Show Less

22 mins

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Listen for free

Episodes

Better Agent Tools Made Code Review Worse

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

Your AI Coding Benchmarks Are Lying To You

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

The Tiny Local Model That Changes Your Agent Budget

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

Your Coding Agent Needs a Bouncer Now

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

Verification Is Now Your Coding Agent Bottleneck

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

Cursor's Tokenomics Reckoning Hits Every Coding Agent

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

The Agent Benchmark That Should Scare Managers

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed

The Workflow Feature That Makes Agents Less Expensive

Failed to add items

Add to basket failed.

Add to wishlist failed.

Remove from wishlist failed.

Adding to library failed

Follow podcast failed

Unfollow podcast failed