Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

Top 10 reasons Why 87% of Machine learning projects fail?
Machine Learning

Top 10 reasons Why 87% of Machine learning projects fail?

Last Updated on September 18, 2020 by Editorial Team

Author(s): Prajeen M V

Machine Learning, Opinion

A lesson on the shocking reasons for your AI adoption disaster.

Photo by Jonathan Borba onΒ Unsplash

We see news about Machine learning everywhere. Indeed, there is a lot of potential in Machine learning. According to Gartner’s predictions, β€œThrough 2020, 80% of AI projects will remain alchemy, run by wizards whose talents will not scale in the organization” and Transform 2019 of VentureBeat predicted that 87% of AI projects will never make it into production.

Why is it like that? Why do so many projectsΒ fail?

Not Enough Expertise

One of the reasons is that the technology is still new to a large audience. In addition, most of the organizations are still unfamiliar with the software tools and the required hardware.

It seems that today, anyone who has worked in data analytics or software development who has done some sample data science projects are labeling themselves as data scientists after taking a short courseΒ online.

The fact is that experienced data scientists are needed to handle most of the machine learning and AI projects, especially when it comes to defining the success criteria, final deployment, and continuous monitoring of theΒ model.

A disconnect between Data Science and traditional Software development

A disconnect between Data Science and traditional Software development is another major factor. Traditional software development tends to be more predictable and measurable.

However, Data science is still part-research and part-engineering.

Data science research moves ahead with multiple iterations and experimentation. Sometimes, the whole project will have to loop back from the deployment phase to the planning phase since the metric that was picked is not driving user behavior.

Traditional Agile based project deliveries may not be expected from a Data science project. This will cause large scale confusion for the leader who has been working with clear deliveries at the end of each task cycles for normal software development projects.

Volume and Quality ofΒ Data

Everyone knows that the larger the dataset, the better the prediction from the AI system. Apart from the direct implications of the higher volumes, as the size of the data increases, a lot of new challenges arise.

In many such cases, you will have to merge data from multiple sources. Once you start doing it, you will realize that they are not in sync many times. This will result in a lot of confusion. Sometimes you will end up merging data that were not supposed to merge, which will result in having data points with the same name but different meanings.

Bad data at best will produce results that aren’t actionable or insightful. Bad data can also lead to misleading results.

Source

Labeling ofΒ data

The unavailability of labeled data is another challenge that stalls many of the machine learning projects. According to the MIT Sloan Management Review,

76% of the people combat this challenge by attempting to label and annotate training data on their own and 63% go so far as to try to build their own labeling and annotation automation technology.

This means that a huge percentage of expertise of those data scientists are lost for the labeling process. This is a major challenge for the effective execution of an AIΒ project.

This is the reason many of the companies are outsourcing the labeling task to other companies. However, it is a challenge to outsource the labeling task if it requires enough domain knowledge. Companies will have to invest in formal and standardized training of annotators if they need to maintain quality and consistency across datasets.

Another option is to develop their own data labeling tool if the data to be labeled complex. However, this often requires more engineering overhead than the Machine learning taskΒ itself.

Organizations areΒ Siloed

Data is the most important entity of a machine learning project. In most organizations, these data would reside in different places with different security constraints and in different formatsβ€Šβ€”β€Šstructured, unstructured, video files, audio files, text, andΒ images.

Having these data in different places in the different format itself is a challenge to handle. However, the challenge doubles when the organization is siloed, and responsible individuals are not collaborating with eachΒ other.

Photo by Dmitry Demidov onΒ Pexels

Lack of collaboration

Lack of collaboration between different teams such as Data Scientists, Data engineers, data stewards, BI specialists, DevOps, and engineering, is another major challenge. This is especially important for the teams in the engineering scheme of things to Data science since there are a lot many differences in the way they work and the technology they use to fulfill theΒ project.

It is the engineering team who is going to implement the machine learning model and take it to the production. So, there needs to be a proper understanding and strong collaboration betweenΒ them.

Technically Infeasible Projects

Since the cost of Machine learning projects tends to be extremely expensive, most of the enterprises tend to target a hyper-ambitious β€œmoon-shot” project that will completely transform the company or the product and give oversized return or investment.

Such projects will take forever to complete and will push the data science team to theirΒ limits.

Ultimately, the business leaders will lose confidence in the project and stop the investment.

It is always best to focus on a single, achievable project with the proper scope and target a discrete business challenge.

Alignment problem between Technical and BusinessΒ teams

Many times, ML projects are started without a clear alignment on expectations, goals, and success criteria of the project between the business and data scienceΒ teams.

These kinds of projects will forever stay in the research stage itself because they never know if they are making any progress since it was never clear what the objective was.

Here, the data science team will be focused mainly on accuracy, whereas the business team will be more interested in metrics such as financial benefits or business insights. In the end, the business team ends up not accepting the outcome from the Data ScienceΒ team.

Source

Lack of DataΒ Strategy

According to the MIT Sloan Management Review, only 50% of large enterprises with more than 100,000 employees are most likely to have a Data strategy. Developing a solid data strategy before you start the Machine learning project is critical.

You need to have a clear understanding of the following as part of Data strategy,

  • The total data you have in theΒ company
  • How much of that data is really required for the projects?
  • How will the required individuals have access to these data, and how easily those individuals can accessΒ them?
  • Specific strategy on how to bring all these data from different sourcesΒ together
  • How to clean up and transform theseΒ data.

Most of the companies start without a plan or don’t start thinking that they don’t have theΒ data.

Lack of Leadership support

It is easy to think that β€œyou just need to throw some money and technology at the problem and the result would come automatically”

We do not see the right support from the leadership to make sure of the needed conditions for success. Sometimes business leaders do not have confidence in the models developed by the data scientists.

This could be because of the combination of a lack of understanding of AI of the business leader and the inability of the data scientist to communicate the business benefits of the model to the leadership.

Ultimately, leaders need to understand how Machine learning works and what AI really means for the organization.


Top 10 reasons Why 87% of Machine learning projects fail? was originally published in Towards AIβ€Šβ€”β€ŠMultidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Feedback ↓