From Model to Production: Deploying Deep Learning Without the Drama

Last Updated on October 4, 2025 by Editorial Team

Author(s): Van Khanh Do

Originally published on Towards AI.

Creating a model is the rehearsal; deploying it is the live concert — with no room for off-key notes.

To successfully roll deep learning models out in the real world, we need to consider many things: from model quality in terms of its performance on the test dataset, how different the data in the wild is compared to the model’s training data, and environments aren’t frozen in time — what happens when they drift? — and that, as time goes by, the data feeding our model will inevitably change.

Jeremy Howard, who wrote Deep Learning for Coders with fastai and PyTorch, came up with something he calls the Drivetrain Approach — a way to make sure our models don’t just work in theory, but deliver real value in practice.

“Many accurate models are of no use to anyone, and many inaccurate models are highly useful.”, Jeremy wrote.

So, What Is The Drivetrain Approach?

Simply put, the Drivetrain Approach starts with your problem’s objective, then ask — what actions you can take to meet that objective? Next, identify the data that supports those actions, and finally, build a model that you can use to get the best outcome in terms of your initial objective.

From Model to Production: Deploying Deep Learning Without the Drama — *Deep Learning for Coders with fastai and PyTorch — The Drivetrain Approach.*

Think about autonomous driving. The objective isn’t “build a car that recognizes stop signs” — that’s just one piece of the puzzle. The real goal is safe and efficient travel. So, we start with the objective (safety + efficiency), then list the possible actions the car can take (brake, accelerate, steer). Next, we ask what data supports those actions (cameras, LiDAR, radar, maps). Finally, we build models that use this data to guide the best action in each situation. That’s the Drivetrain Approach in motion.

At the end of the day, the Drivetrain Approach is what keeps us focused on what truly matters. By grounding everything in objectives, actions, and data, we give our models a real chance to solve meaningful problems — whether it’s safer driving, better healthcare, or smarter businesses.

Forget Linear, Go Spiral

Chasing the “perfect” dataset is a trap. There’s lots of students, researchers, and even pros wasted months, even years — searching for it. Meanwhile, the ones who just start are already three iterations ahead, learning and improving. So, what is the secret? Think end-to-end. Don’t get lost fine-tuning models, polishing GUI interfaces, or hand-labeling data. Ship the whole pipeline, see what breaks, and focus on the parts that truly shape the outcome. That’s how real progress happens.

By iterating end-to-end, you’ll quickly gain insight into how much data is truly necessary. For example, you might discover that 200 labeled samples are enough to achieve the performance your application requires — something you can’t know until you actually go through all the pipeline end-to-end.

Always Double-Check Your Data?

Models are only as good as the data they’re trained on — and the real world data is usually packed with hidden biases. Take this example: imagine you want to build an app to detect healthy skin. You scrape search results for “healthy skin” and what do you get? The following figure was the results for that search query.

Deep Learning for Coders with fastai and PyTorch — Data for a healthy skin detector?

Not diversity, not reality — just endless images of young white women touching their faces. Train your model on that data, and you won’t have a healthy skin detector — guess what you’ll have? — it ends up getting a “young white woman touching her face” detector instead.

That’s why it’s critical to think ahead about the kinds of data your application will encounter in practice, and to make sure your training data truly represents them. Otherwise, your model will reflect not the truth — but the bias baked into its data.

It’s Very Likely That You Used Wrong Data Augmentation Techniques

Data augmentation is all about giving your training data a makeover — changing its appearance without changing its meaning. For images, that could mean rotating, flipping, tweaking brightness or contrast, or even warping the perspective.

But wait — data augmentation isn’t all sunshine and accuracy boosts. Some techniques can actually hurt your model if used carelessly. Here are some examples:

Deep Learning for Coders with fastai and PyTorch — Data Augmentation with Crop method.

Data augmentation with cropping, as shown in the figure above, can chop out crucial details. Imagine that you’re classifying dog or cat breeds, you might lose the very face or body part that distinguishes one breed from another.

Deep Learning for Coders with fastai and PyTorch — Data Augmentation with Squish method.

Next, with squishing or stretching, as shown in the figure above, that creates shapes that don’t exist in the real world. As a result, the model would learn warped versions of reality and perform worse.

Deep Learning for Coders with fastai and PyTorch — Data Augmentation with Pad method.

Finally, data augmentation with padding, as shown in the figure above, that might seem harmless, but it comes with a hidden cost. All that extra empty space is nothing but wasted computation for your model. Even worse, it reduces the effective resolution of the part of the image that really matters — the actual content you want the model to learn from.

So, what’s the right way to handle this? Instead of cropping, padding or stretching, the usual practice is to randomly crop different parts of an image during training. Each time the model sees an image, it gets a slightly different view. Over many passes through the dataset, this teaches the model to pay attention to different features — just like in real life, where the same object might show up in photos framed in slightly different ways.

It’s important to remember that a fresh neural network starts from zero. It doesn’t even know that an object rotated by a tiny angle is still the same thing. By training it on images where objects appear at slightly different sizes or positions, we help it learn the basic idea of what an object is and how it can look under different conditions.

So, how is this smarter approach any different from the “bad crop” we talked about earlier?

Naive cropping: You slice off parts of the image and hope for the best. But in doing so, you might lose the very clues that matter — a dog’s face, a cat’s ears, the detail that separates one breed from another. The model ends up handicapped because it never gets to see the whole story.
Random, repeated cropping during training: Instead of permanently cutting things out, you give the model a fresh view each time. One pass, it might see the top of the image. Next pass, the bottom. Over time, it sees the whole object, just framed in different ways. This mirrors real life — objects shift, zoom, and sometimes only show partly in photos. The result? A model that’s tougher, smarter, and far better at recognizing what truly matters.

Your Model Isn’t Just a Learner, It’s a Cleaner

Assuming we are training a bear classifier, and now we have already prepared all our data. Time to train the model. Note: We’ll use the random-cropping augmentation technique mentioned above to train the model.

Now evaluate its performance using something called confusion matrix, that then showed as follow:

Deep Learning for Coders with fastai and PyTorch — Confusion Matrix for Bear detector.

This handy chart doesn’t just tell us how many predictions were right or wrong — it shows us where the mistakes are happening. That’s crucial, because errors can come from two very different sources:

Dataset problem — for example, images that aren’t bears at all, or are labeled incorrectly.
Model problem — perhaps the model isn’t handling images taken with unusual lighting, or from a different angle, etc.

To dig deeper, we sort the images by their loss. We don’t cover how to calculate the loss here, but just knowing that the loss is a number that is higher: if the model is incorrect and confident of its incorrect answer, or if it’s correct but not confident of its correct answer.

Deep Learning for Coders with fastai and PyTorch — Top images with highest loss.

As showed in the figure above, the very first high-loss image tells the story. The model swears it’s a grizzly, but our dataset insists it’s a black bear. One look, and you can tell the model’s probably right. By re-labeling this, we instantly make the dataset cleaner. While most people assume data cleaning must happen before training, this example shows that a trained model can actually point out the messy parts of our dataset — faster and more reliably. That’s why a practical strategy is:

Train a quick and simple model first.
Use it to help us with data cleaning.
Retrain our model.

So, the takeaway is clear: the model doesn’t just learn from the data — it helps improve the data.

How To Avoid Disaster?

When you’re building regular software, you can trace every line, inspect every step, and know exactly how the system will behave. Neural networks? Not so much. Their “logic” isn’t hard-coded — it emerges from the patterns in the training data. And that’s where things can get messy.

Imagine rolling out a bear detection system for campsites in national parks. Sounds great, right? But If we used a model trained with a neat little dataset scraped from the internet, there would be all kinds of problems in practice, such as these:

Working with video data instead of images
Handling nighttime images, which may not appear in this dataset
Dealing with low-resolution camera images
Ensuring results are returned fast enough to be useful in practice
Recognizing bears in positions that are rarely seen in photos that people post online (for example from behind, partially covered by bushes, or a long way away from the camera)

Deep learning are powerful but unpredictable, and that unpredictability can spell disaster. So, how to avoid it? Please keep in mind these following crucial steps to follow when deploying deep learning models:

Collect real data — don’t rely only on curated internet sets
Start manual and parallel — let humans double-check the model’s output
Limit scope — run small, time-boxed trials before scaling
Roll out slowly — expand step by step, not all at once
Monitor smartly — track metrics, watch for sudden shifts, and design reports that expose failure modes

Unforeseen Consequences and Feedback Loops — Rolling out a model doesn’t just predict behavior — it can change it. Take predictive policing: algorithms flag “high-crime” areas, police flood those neighborhoods, more arrests get recorded, and the cycle reinforces itself. As one study put it: these models aren’t predicting crime, they’re predicting future policing.

How to Stay Safe — Before launching any ML system, ask: What if it works really, really well? Who benefits? Who suffers? What’s the worst-case scenario? Then build in careful rollout plans, strong monitoring, and — most importantly — human oversight that actually has power to intervene.

Final thoughts

In the end, deploying deep learning models isn’t just about accuracy — it’s about responsibility. From biased data and domain shifts to feedback loops that can spiral out of control, the risks are real. But with careful rollout, constant monitoring, and meaningful human oversight, we can turn these risks into manageable challenges.

If you’d like to dive deeper, these insights come from Deep Learning for Coders with fastai and PyTorch by Jeremy Howard and Sylvain Gugger.

By the way, I wrote my first post introducing this book and why it matters for anyone interested in practical AI. You can read it here: Deep learning for coders with fastai and PyTorch — Build your AI applications without a PhD. For now, we’d love your thoughts and feedback.

📘 Thanks for reading! I’m Khanh Do Van (k-dovan), writing about AI, NLP, and Data with a focus on practical insights and real-world applications.
⭐ If you found this useful, follow me here on Medium for more deep dives, tutorials, and lessons learned.
💻 You can also check out my projects and code on GitHub: github.com/k-dovan.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Frequently Used, Contextual References

Resources

From Model to Production: Deploying Deep Learning Without the Drama

Author(s): Van Khanh Do

Creating a model is the rehearsal; deploying it is the live concert — with no room for off-key notes.

Towards AI Academy

We Build Enterprise-Grade AI. We'll Teach You to Master It Too.

Recent Posts

Full-Stack Data Scientists for the Agentic Coding World

Building Production-Grade AI Skills with Snowflake Cortex AI Function Studio

I Tried 10 AI Agent Frameworks in 2026 — Here’s the Honest Guide I Wish I Had Earlier

How One Spring Boot Optimization Saved Our Startup $30,000 a Year

Inside Palantir AIP: How the World’s Most Controversial AI Platform Actually Works

What Is a Reverse Proxy? (And Why Every Backend Developer Should Care)

What Claude Opus 4.8 Actually Changes If You’re Building Agents

QWEN 3.7 Max Worked For 35 Hrs Straight And The Results Were Mind-blowing

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

From Model to Production: Deploying Deep Learning Without the Drama

Author(s): Van Khanh Do

Creating a model is the rehearsal; deploying it is the live concert — with no room for off-key notes.

Towards AI Academy

We Build Enterprise-Grade AI. We'll Teach You to Master It Too.

Related posts

Recent Posts

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

GDPR CCPA Statement