Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Unlock the full potential of AI with Building LLMs for Production—our 470+ page guide to mastering LLMs with practical projects and expert insights!

Publication

What’s It Like to Work in Applied AI?
Latest   Machine Learning

What’s It Like to Work in Applied AI?

Last Updated on July 26, 2023 by Editorial Team

Author(s): Branden Lisk

Originally published on Towards AI.

Understanding the role of a Machine Learning Engineer from an early-career perspective

Source: Photo by Possessed Photography on Unsplash

Introduction

Disclosure and Accuracy

This article is written from the perspective of an early-career stage: 2+ years in the tech space, 1+ years of applied AI/ML experience, with an (expected) bachelor’s level degree in engineering. The bulk of experience lies in start-up, small, and medium-sized environments, which are inherently less structured than established organizations.

The author strives to maintain accurate information that is useful to a general audience. However, note that information on this topic evolves quickly, and practices can be wildly different from organization to organization, so there is an abundance of conflicting opinions from various sources.

Intended Audience

This article is intended for other early-career engineers in the field of applied AI, software engineers interested in a career transition, and anyone else with interest in this topic.

What is Applied AI?

There are plenty of existing resources to dig deeper into this, but in summary, applied AI is:

“…the branch of artificial intelligence that brings it out of the lab and into the real world…” [1]

In essence, applied AI is taking scientific research and applying it to business problems.

What is a Machine Learning Engineer?

It would be helpful to start with a definition of machine learning. It’s defined as:

“…the process of using mathematical models of data to help a computer learn without direct instruction. This enables a computer system to continue learning and improving on its own, based on experience.” [2]

Essentially, “AI” is the idea of an “intelligent” machine. Machine learning is the process used to develop that “intelligence”. So in this context, a Machine Learning (ML) Engineer designs and builds the infrastructure needed to enable the process of machine learning.

We’ll discuss more on the particular responsibilities of an ML Engineer later.

Expectation vs. Reality

How does this differ from how we traditionally understand the role of an engineer who works in the field of “Machine Learning” and “Artificial Intelligence”? It’s likely we think of someone who does the following:

  • a lot of research and reading into published papers,
  • approaching new and novel problems,
  • planning and conducting experiments to invent new methods to solve these problems,
  • publishing new papers.

These responsibilities are actually fulfilled by someone we’d call a “Research Scientist” or even a “Data Scientist”. We’re falsely attributing theory to practice.

In reality, this is actually what we can expect to see in practice:

Source: Image by the author, designed using resources from Flaticon and Freepik

We can see that staying informed of current research and approaches to problems by reading papers is unchanged. However, each other point has an analogous responsibility more reflective of a machine learning engineer’s day-to-day.

We’ll talk more about these distinctions a bit later. First, we have to discuss some industry jargon that is thrown around in the “Applied AI” space.

Convoluted Separation of Responsibilities

Job Titles

This is the area where you’ll experience the most conflicting opinions. The separation of responsibilities according to different job titles is not consistent between organizations or even sub-teams within the same organization. Trying to understand different descriptions and convoluted diagrams is largely a waste of time and will make you even more confused about the distinctions. Trust me, when I tried for this article, I started to question what my own job even is. You’ll hear some subset of the following titles:

  • Data Engineer
  • Data Scientist
  • Applied Data Scientist
  • Data Analyst
  • Business Analyst
  • ML Engineer
  • MLOps
  • Data Infrastructure Engineer
  • DevOps

Organizations describing job postings, as well as individuals describing past experiences, use many of these terms interchangeably, so it’s very hard to find consistent data.

At the risk of adding more confusion to this argument, I’ll define my own generic titles that have no major significance outside of this article, according to how they fit into the data science process.

Data Science Process

Standard data science process. Black arrows represent the main flow, grey dashed arrows represent decision points in which we return to a previous step. (Source: Image by the author, inspired by [3] and [4])

We can see five distinct “roles” in the above diagram:

  • Business: This stage involves understanding and formulating the business problem and defining the goal of the project. This role would typically be called a “Business (and/or Data) Analyst”.
  • Data: This stage involves iterating until the desired amount and quality of data is achieved for feature engineering. These include understanding potential data sources that could be used to solve the business problem (assessing key factors such as availability, usability, quality, and cost), collecting said data, and cleaning (mitigating outliers, resampling, etc.) the collected data. This would be assigned to a combination of the “Data Engineer” and “Data Labeler”. Building the data pipeline would typically be assigned to the “Data Infrastructure Engineer”.
  • Model: This stage involves two parts: feature engineering and modeling. The collected data is used to perform feature engineering. A model is selected, built, and trained. The model is then evaluated against the metrics set in the “Business” stage. If it does not achieve the required metrics, we have to assess the problem and return to a previous step. This role has the most inconsistency in terms of job title. It can be any of: “Data Scientist” (the most incorrect), “Applied Data Scientist,” or “Data Analyst”.
  • Operations: This stage involves deploying, serving, and monitoring the completed model. Monitoring is a continuous action for the lifetime of the model. This role would traditionally be called “DevOps” (software development and IT operations) but is typically called “MLOps” (machine learning operations) for machine learning projects.
  • Maintenance: Finally, if monitoring detects the model is no longer achieving the desired metrics, we must revert to a previous stage in the process.

Covering the missing title from above, the responsibilities of an “ML Engineer” are Data, Model, Operations, and Maintenance, typically with an emphasis on Data and Model.

Early-Career Perspective

Now that we understand what “applied AI” is from a high-level and the common roles you’ll hear in the field, what does this mean for you? In this section, we’ll discuss what you can expect as an early-career ML Engineer.

Source: Photo by Saulo Mohana on Unsplash

Common Misconceptions

We’ll start by walking through some misconceptions about the field.

Machine learning is heavy in statistics. You need deep statistical knowledge to work in the field.

If you tell people you work in machine learning, you’ll often hear, “wow, you must be really good with statistics and math,” and my response is usually, “Maybe? But it’s not incredibly relevant for my job”.

This is to say that this statement is true to an extent but heavily exaggerated. At its backbone, machine learning is just a new application of old mathematics: statistics, linear algebra, and calculus. It’s important to understand the fundamentals to understand how to apply machine learning truly effectively.

However, this only goes so far. As an ML Engineer, will you ever need to implement backpropagation from scratch? Probably not. Modern ML frameworks abstract all of this fundamental understanding away (for better or for worse). Will you need to understand the math used in research papers (at a high level)? Absolutely. This is required knowledge to adapt their approach to your own business problem effectively.

You need a graduate-level degree to work in the machine learning field.

This is untrue from an early-career perspective in the applied AI field. Many that work in this field do have advanced degrees, but that doesn’t imply that you need an advanced degree. Many that I have worked with prove that this isn’t necessarily true (at least anecdotally).

Practical machine learning engineering is quite a bit more traditional software engineering than you’d think.

Machine learning is all research.

Machine learning engineers are engineers at their core.

“Engineers are individuals who combine knowledge of science and mathematics to solve technical problems that confront society.” [5]

ML engineers utilize scientific research to solve business problems. They care more about how to make the solution work in practice. Most problems are solved problems, and understanding and adapting existing solutions to a business use case is the primary goal.

Where do you fit in?

As an early-career machine learning engineer, where in this process do you fit? This can heavily depend on the organization, but I’ll describe this as generally as possible.

Referring to the “Data Science Process” diagram, I’ll break this down into individual sections.

  • Business: Generally, you will not be involved in this stage. You may be approached to discuss technical feasibility and, in some rare cases (specifically start-ups), may be involved in prioritizing business problems.
  • Data: You’ll be quite involved in this stage—specifically, understanding data sources and cleaning data. Data collection is often outsourced in larger organizations (if it’s needed at all). In rare cases (again, start-ups), you’ll collect the data yourself. This is generally the most lengthy (and most important) stage.
  • Model: You’ll be very involved in this stage. Feature engineering and modeling are generally your full responsibility.
  • Operations: You’ll likely not be involved in this stage. Most organizations have a dedicated DevOps team that will handle this. The exception would be configuring model monitoring tools specific to your application.
  • Maintenance: You’ll be responsible for future maintenance of your deployed models. I’d suggest you make this process as easy as possible for yourself (if you’re wondering what could go wrong, check out a previous post).

Conclusion

We’ve seen what “Applied AI” is and the job titles of those who work in the field. We also saw the various stages of the data science process and where the “Machine Learning Engineer” fits in. Using this knowledge as a foundation, we explored the key responsibilities of an early-career machine learning engineer and some common misconceptions about the field.

I hope this article paints a realistic picture of the early-career environment of the applied AI field and helps guide you on the next step of your career.

This is just the beginning! If you enjoyed the article, follow me to get notified of the next ones! I appreciate the support U+2764️

Branden Lisk — Medium

Read writing from Branden Lisk on Medium. Machine Learning R&D Engineer U+1F916 U+007C Passionate about building cool and…

medium.com

Sources

[1] “Applied Artificial Intelligence,” Cognizant Technology Solutions. [Online]. Available: https://www.cognizant.com/us/en/glossary/applied-ai. [Accessed: 24-Jul-2022].

[2] “Artificial Intelligence vs. Machine Learning,” Microsoft Azure. [Online]. Available: https://azure.microsoft.com/en-us/solutions/ai/artificial-intelligence-vs-machine-learning/#introduction. [Accessed: 24-Jul-2022].

[3] C. Nantasenamat, “The Data Science Process,” Towards Data Science, 27-Jul-2020.

[4] A. Burkov, Machine Learning Engineering. Québec, Canada: True Positive Inc., 2020.

[5] D. Wang, “The Engineering Profession,” in ECE 190, 2018.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Feedback ↓