• How Data Scientists Use Neural Radiance Fields for 3D Reconstruction
    Jul 1 2026
    Lucas and Luna dive into Neural Radiance Fields (NeRFs), a technique that has reshaped 3D reconstruction from 2D images. They walk through how NeRFs work at a high level—converting sparse photographs into continuous volumetric scene representations—and why this matters for industries like autonomous driving, cultural heritage preservation, and virtual production. The episode anchors on a concrete example: how the Google Research team originally trained a NeRF on 100 images of a single scene to synthesize novel views with photorealistic quality, and how recent advances like Instant NGP have cut training time from hours to seconds. Lucas explains the key algorithmic steps: ray marching through a neural network that outputs color and density per point, then volumetric rendering to produce a pixel value. Luna questions where the bottleneck remains (data capture, not compute) and probes the real-world trade-off between quality and speed. The conversation stays grounded in tools and techniques data scientists actually use—no math beyond a brief mention of positional encoding—and closes by asking what happens when NeRFs meet generative AI for full scene editing. #NeuralRadianceFields #NeRF #3DReconstruction #ComputerVision #DeepLearning #InstantNGP #VolumetricRendering #RayMarching #GoogleResearch #PositionalEncoding #AutonomousDriving #VirtualProduction #CulturalHeritage #DataScience #Technology #FexingoBusiness #BusinessPodcast #MachineLearning Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    11 mins
  • How Data Scientists Use Diffusion Models for Image Generation
    Jul 1 2026
    In this episode of The Data Science Podcast, Lucas and Luna explore how data scientists are using diffusion models — the technology behind tools like DALL-E and Stable Diffusion — for image generation. They break down the core idea of gradually denoising random pixels into coherent images, discuss training and inference costs, and contrast diffusion models with GANs and autoregressive models. Using a concrete example from a mid-size e-commerce company that used a fine-tuned diffusion model to generate product images in underrepresented categories, they walk through the practical pipeline: dataset preparation, conditioning on text prompts, and handling hallucination artifacts. Lucas explains why diffusion models have become the dominant paradigm in generative image AI since 2022, and Luna questions whether the compute cost will limit adoption for smaller teams. They also touch on ethical considerations around deepfakes and copyright. The episode is grounded in real numbers: training a latent diffusion model from scratch can cost upwards of $600,000 in compute, but fine-tuning an existing open-source model can be done for under $5,000. #DiffusionModels #ImageGeneration #GenerativeAI #DeepLearning #StableDiffusion #DALLE #ComputerVision #MachineLearning #Technology #DataScience #AIEthics #ComputeCost #FineTuning #TextToImage #DenoisingDiffusion #LatentDiffusion #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • How Data Scientists Use Transfer Learning for Few-Shot Image Classification
    Jun 30 2026
    In this episode, Lucas and Luna explore how data scientists apply transfer learning to solve image classification problems with very little labeled data. They break down the concrete steps: taking a pre-trained model like ResNet-50 trained on ImageNet's 14 million images, freezing early layers, fine-tuning later layers on a new task with as few as 50 images per class. Lucas shares a case study from a medical startup that used this approach to classify skin lesions from dermoscopic images with 94% accuracy using only 200 labeled samples. The hosts discuss practical gotchas including domain mismatch, learning rate selection, and the trade-off between freezing and fine-tuning. If today's conversation gave you a concrete technique you can use, consider supporting the show at buy me a coffee dot com slash fexingo. #TransferLearning #FewShotLearning #ImageClassification #DeepLearning #ResNet #ImageNet #FineTuning #FeatureExtraction #MedicalImaging #Dermatology #DomainAdaptation #PreTrainedModels #DataScience #MachineLearning #Technology #FexingoBusiness #BusinessPodcast #AI Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    7 mins
  • How Data Scientists Use Bayesian A-B Testing in Marketing
    Jun 30 2026
    Lucas and Luna dive into Bayesian A/B testing—why it's replacing frequentist methods for marketing experiments. They walk through a real case from a mid-size e-commerce company that used Bayesian inference to compare email subject lines, reaching a decision in half the time with clearer probability statements. The episode covers the core difference: instead of a p-value, you get a direct probability that version A beats version B. They explain how priors work, the role of Monte Carlo simulation, and a concrete example where a company saved thousands by avoiding a false negative. By the end, you'll understand why more data teams are adopting Bayesian methods for faster, more intuitive decision-making—and how to explain it to stakeholders without the math. #BayesianA/BTesting #DataScience #MachineLearning #MarketingAnalytics #Experimentation #ABTesting #BayesianInference #MonteCarlo #Priors #PosteriorProbability #Ecommerce #EmailMarketing #StatisticalMethods #DataDrivenMarketing #ConversionRate #Frequentist #Technology #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • How Data Scientists Use Federated Learning for Privacy
    Jun 29 2026
    Federated learning is reshaping how organisations train machine learning models on sensitive data without ever centralising it. In this episode, Lucas and Luna break down a real-world case: how a consortium of six European hospitals used federated learning to train a diagnostic model for rare paediatric cancers — achieving accuracy comparable to a centralised model while keeping each hospital's patient data behind its own firewall. They walk through the technical architecture: the role of a coordination server, how model updates are aggregated using FedAvg, and what happens when non-IID data distributions cause client drift. Luna pushes back on the communication cost argument, and Lucas explains how compression techniques and asynchronous updates are making federated learning practical at scale. They also touch on the regulatory angle — why GDPR and HIPAA are driving adoption faster than any technical breakthrough. Whether you're a data scientist evaluating privacy-preserving ML or just curious how Apple trains Siri without reading your keystrokes, this episode gives you the concrete mechanics behind a paradigm shift in distributed machine learning. #FederatedLearning #PrivacyPreservingML #DataScience #Technology #HealthcareAI #GDPR #HIPAA #FedAvg #FexingoBusiness #BusinessPodcast #MachineLearning #DistributedLearning #ModelAggregation #NonIIDData #ClientDrift #Siri #Apple #RareCancerDiagnosis Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • How Data Scientists Use Shapley Values for Model Interpretability
    Jun 29 2026
    Episode 80 of The Data Science Podcast dives into Shapley values — a game-theoretic approach to explaining model predictions. Lucas walks through the core intuition: how Shapley values fairly distribute prediction contributions among features, using a concrete example from a credit approval model. Luna asks about the practical trade-offs, including computational cost with high-dimensional data. The hosts discuss real-world usage at a mid-sized fintech lender that reduced model risk by 30 percent after implementing Shapley-based explanations. They also touch on open-source libraries like SHAP and its Python implementation. The episode avoids dry math in favor of conceptual clarity, making it accessible to data scientists and business analysts alike. By the end, listeners understand why Shapley values are becoming the gold standard for regulatory compliance and stakeholder trust. #ShapleyValues #ModelInterpretability #ExplainableAI #XAI #GameTheory #SHAP #FeatureImportance #CreditModeling #Fintech #DataScience #MachineLearning #ModelRisk #Python #OpenSource #Interpretability #Technology #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    9 mins
  • How Data Scientists Use Synthetic Control for Causal Impact
    Jun 28 2026
    Episode 79 of The Data Science Podcast explores synthetic control — a causal inference method that estimates what would have happened to a treated unit if the intervention never occurred. Lucas and Luna break down a real-world case: how a ride-hailing company used synthetic control to measure the impact of a surge-pricing algorithm change on driver supply in Austin, Texas. They walk through building a synthetic control from a weighted combination of similar cities, interpreting the gap between actual and synthetic outcomes, and running placebo tests to assess statistical significance. The hosts also discuss when to choose synthetic control over difference-in-differences, the importance of having a strong donor pool, and how this method is gaining traction in policy evaluation and A/B testing for large-scale platform changes. No clickbait, just a practical, concrete guide to a powerful causal technique. #SyntheticControl #CausalInference #DataScience #MachineLearning #Experimentation #RideHailing #PolicyEvaluation #ABTesting #DifferenceInDifferences #PlaceboTest #Counterfactual #SurgePricing #Austin #DonorPool #CausalImpact #Technology #FexingoBusiness #BusinessPodcast Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    7 mins
  • How Data Scientists Use Conformal Prediction for Reliable Uncertainty Estimates
    Jun 28 2026
    In this episode, Lucas and Luna dive into conformal prediction, a model-agnostic framework that gives machine learning models reliable uncertainty estimates without sacrificing coverage guarantees. They discuss how it works — using a calibration set to produce prediction sets with a user-specified confidence level — and walk through a concrete example from medical imaging where a model flags skin lesions. They contrast it with Bayesian methods and softmax probabilities, and explore why it's gaining traction in regulated industries like healthcare and finance. No prior knowledge of conformal prediction required; just a curious mind about making AI more trustworthy. If today's tech conversation gave you something usable, consider supporting the show at buy me a coffee dot com slash fexingo — keeping it free from ads so we can focus on substance. #ConformalPrediction #UncertaintyQuantification #MachineLearning #DataScience #AI #TrustworthyAI #HealthcareAI #PredictiveModeling #HypothesisTesting #ModelInterpretability #Technology #Podcast #FexingoBusiness #BusinessPodcast #DataDriven #MLOps #AIEthics #Calibration Keep every episode free: buymeacoffee.com/fexingo
    Show More Show Less
    11 mins