Why AI Hardware Spending Is Shifting From Training to Inference

Failed to add items

Sorry, we are unable to add the item because your shopping cart is already at capacity.

Add to basket failed.

Please try again later

Add to wishlist failed.

Please try again later

Remove from wishlist failed.

Please try again later

Adding to library failed

Please try again

Follow podcast failed

Unfollow podcast failed

Why AI Hardware Spending Is Shifting From Training to Inference

Listen for free

View show details

Lucas and Luna unpack a major inflection point in AI infrastructure: the shift from training-focused hardware spending to inference. They examine why NVIDIA's 7.7 percent weekly drop amid a broader AI hardware selloff may signal market recognition that the training buildout is peaking, while inference workloads — and the chips optimized for them — become the next growth frontier. The hosts walk through the economics: training a single large model can cost over $100 million, but inference — actually running that model millions of times for users — is where the recurring revenue lives. They cite Microsoft's 1.5 percent resilience this week as a sign that software platforms monetizing inference are outperforming pure hardware plays. The episode also explores how startups like Groq, d-Matrix, and Cerebras are challenging NVIDIA with inference-specialized chips, and why the hyperscalers (Amazon, Google, Microsoft) are designing their own inference silicon. A concrete look at why the AI chip narrative is shifting in mid-2026. #AI #Inference #NVIDIA #AMD #Microsoft #AIHardware #TrainingVsInference #Groq #Cerebras #DMaxtrix #Hyperscalers #CustomSilicon #Semiconductors #Technology #AIInfrastructure #FexingoBusiness #BusinessPodcast #GenAI Keep every episode free: buymeacoffee.com/fexingo

No reviews yet