Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

No ML Algorithms Cheat Sheet, Please
Machine Learning

No ML Algorithms Cheat Sheet, Please

Last Updated on June 16, 2020 by Editorial Team

Author(s): Venkat Raman

Machine Learning, Opinion

Image source: https://unsplash.com/

What is a Cheat SheetΒ ?

Wikipedia defines cheat sheets as a concise set of notes used for quick reference. Now the word that needs to be emphasized here is β€˜quick reference’.

In programming, cheat sheets are OK because no one can remember all the syntax of a programming language. Especially if the programming language constantly evolves (like Python) or if the programmer finds himself/herself transitioning in and out of different programming languages.

A quick reference like a cheat sheet helps the programmer save time and focus on the largerΒ problem.

Data ScientistΒ , What are you in a hurry forΒ ?

Image source: Pinterest

Machine learning algorithm learning and implementation are never supposed to be a 100 M dash. Each machine learning implementation is supposed to be mulled over, thought through carefully, and then implemented. Data science solution takes time, it is an exploratory and experimental endeavor.

Following some cheat sheet makes you less experimental and you fail to explore all the options. The β€˜Dive-straight into the problem’ attitude might help you win some Kaggle competitions but it won’t take you far in real-life machine learning useΒ cases.

OK, now let’s get to the crux of theΒ matter…

Why ML algorithms cheat sheets are a badΒ idea?

Data and Assumptions

Even within a company, one department’s business problem varies from the other. On a case to case basis, the data variety & complexity are too vast that no one single approach could be prescribed. But ML cheat sheet does exactlyΒ that.

For e.g. If data < 1k, choose algorithm X: Else If data > 1k, choose algorithm Y

Coming to the assumptions, there are multitudes of assumptions considered for every machine learning algorithm. Starting from, assumptions about the data generation process to assumptions about the model. These assumptions are simply not studied or evaluated inΒ detail.

Cheat sheets put you on a path with no U-turns orΒ detours

Image source:Β Pixabay

Much like hard coding in programming, cheat sheets for ML algorithms constrain your options. They put you on a path in which you merrily thread and once when you do realize that the path that you are on is wrong, it is often tooΒ late!

No opportunity toΒ Innovate

If you go by cheat sheets, you are not taking the road less traveled. Needless to say, Innovation happens by taking the road less traveled. The cheat sheets don’t tell you to apply learning from one domain to another. Transfer learning ain’t happening here. Neither does it tell you to try some ensemble technique or to try some amalgamation of different algorithmic techniques. One is more or less like a horse with blinkers.

We (Data Scientists) stand on the shoulders of giants. Be it OLS from Legendre or Geoffrey Hinton’s various Deep learning techniques, none of them were invented by following cheatΒ sheets.

Cheat sheet makes your Decision making β€˜machine-like’

Well, coding machine learning algorithms does not mean you become the machine itselfΒ !! Cheat sheets often make your decisions binary at everyΒ stage.

For e.g

Voila, you have your clusters.. or doΒ you?

A naive Data scientist or an aspiring Data Scientist would just be happy with clusters he/she got and just moveΒ on.

But here is theΒ catch…

One of the pitfalls k means algorithm has is that it will cluster almost anything. Because we now have clusters, it does not mean we have accomplished anything! It is just a Pyrrhic victory. I would urge the readers to read this excellent blog by David Robinson on the drawbacks of kΒ means.

So one can clearly see that, while a cheat sheet has led the data scientist down the path of K means algorithm, it gives a false sense of task completion when further probing is required.

No free lunch theoryβ€Šβ€”β€ŠThe final nail in theΒ coffin

Perhaps β€˜No free lunch theory’ is the final nail in the coffin for ML cheat sheets. No free lunch theory statesΒ that

β€œThere is no one model that works best for every problem. The assumptions of a great model for one problem may not hold for another problem”.

If there is no one model or algorithm which works best for every problem, then does it really make sense to have an ML algorithm cheatΒ sheet?

Image source:Β Giphy

So, please refrain from using ML algorithm cheat sheets. Try to arrive at a solution organically. Let your mind intuit and connect theΒ dots!

Your comments and opinions areΒ welcome.

You can reach out to meΒ on

Linkedin

Twitter


No ML Algorithms Cheat Sheet, Please was originally published in Towards AIβ€Šβ€”β€ŠMultidisciplinary Science Journal on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Feedback ↓