LLMs in Production cover art

LLMs in Production

Engineering AI Applications

Preview
Get this deal Try Premium Plus free
Offer ends December 16, 2025 11:59pm GMT.
Prime members: New to Audible? Get 2 free audiobooks during trial.
Just £0.99/mo for your first 3 months of Audible.
1 bestseller or new release per month—yours to keep.
Listen all you want to thousands of included audiobooks, podcasts, and Originals.
Auto-renews at £8.99/mo after 3 months. Cancel monthly.
Pick 1 audiobook a month from our unmatched collection - including bestsellers and new releases.
Listen all you want to thousands of included audiobooks, Originals, celeb exclusives, and podcasts.
Access exclusive sales and deals.
£8.99/month after 30 days. Renews automatically.

LLMs in Production

By: Christopher Brousseau, Matt Sharp
Narrated by: Christopher Kendrick
Get this deal Try Premium Plus free

£8.99/mo after 3 months. Cancel monthly. Offer ends December 16, 2025 11:59pm GMT.

£8.99/month after 30 days. Renews automatically. See here for eligibility.

Buy Now for £18.99

Buy Now for £18.99

Only £0.99 a month for the first 3 months. Pay £0.99 for the first 3 months, and £8.99/month thereafter. Renews automatically. Terms apply. Start my membership

About this listen

Unlock the potential of Generative AI with this Large Language Model production-ready playbook for seamless deployment, optimization, and scaling. This hands-on guide takes you beyond theory, offering expert strategies for integrating LLMs into real-world applications using retrieval-augmented generation (RAG), vector databases, PEFT, LoRA, and scalable inference architectures. Whether you're an ML engineer, data scientist, or MLOps practitioner, you’ll gain the technical know-how to operationalize LLMs efficiently, reduce compute costs, and ensure rock-solid reliability in production.

What You’ll Learn:

  • Master LLM Fundamentals – Understand tokenization, transformer architectures, and the evolution linguistics to the creation of foundation models.
  • RAG & Vector Databases – Augment model capabilities with real-time retrieval and memory-optimized embeddings.
  • Training vs Fine-tuning – Learn how to train your own model as well as cutting edge techniques like Distillation, RLHF, PEFT, LoRA, and QLoRA for cost-effective adaptation.
  • Prompt Engineering – Discover the quickly evolving world of prompt engineering and go beyond simple prompt and pray methods and learn how to implement structured outputs, complex workflows, and LLM agents.
  • Scaling & Cost Optimization – Deploy LLMs into your favorite cloud of choice, on commodity hardware, Kubernetes clusters, and edge devices.
  • Securing AI Workflows – Implement guardrails for hallucination mitigation, adversarial testing, and compliance monitoring.
  • MLOps for LLMs – Learn all about LLMOps, automate model lifecycle management, retraining pipelines, and continuous evaluation.

Hands-on Projects Include:

• Training a custom LLM from scratch – Build and optimize an industry-specific model.

• AI-Powered VSCode Extension – Use LLMs to enhance developer productivity with intelligent code completion.

• Deploying on Edge Devices – Run a lightweight LLM on a Raspberry Pi or Jetson Nano for real-world AI applications.

PLEASE NOTE: When you purchase this title, the accompanying PDF will be available in your Audible Library along with the audio.

©2024 Manning Publications (P)2025 Manning Publications
Computer Science Machine Theory & Artificial Intelligence Programming & Software Development Programming Languages Management Software Development

Listeners also enjoyed...

AI Engineering cover art
Agentic Artificial Intelligence cover art
AI Agents in Action cover art
Prompt Engineering for Generative AI cover art
Thinking in Systems cover art
Building Microservices cover art
Build a Large Language Model (From Scratch) cover art
Designing Data-Intensive Applications cover art
AI and Machine Learning for Coders cover art
Designing Machine Learning Systems cover art
Effective Software Testing cover art
Clean Code cover art
The Art of Prompt Engineering cover art
Grokking Simplicity: Taming Complex Software with Functional Thinking cover art
Software Architecture: The Hard Parts cover art
$100M Money Models cover art
All stars
Most relevant
Sort of makes a chunk of the book near impossible to follow. Any reference to a fig x. is lost as the listener can’t see this!

No PDF

Something went wrong. Please try again in a few minutes.