Linear Regression from Scratch
Author(s): Nunzio Logallo Originally published on Towards AI. Photo by William Daigneault on Unsplash If you started to learn data science or machine learning, you have also heard about linear regression, but what is it and when should we use it? Linear …
Predicting Sales using R programming
Author(s): Suyash Maheshwari Originally published on Towards AI. In this article, I will forecast the sales of a multinational retail corporation. The dataset for this can be found on Kaggle. We have been provided with weekly sales data on which we train …
How and Why to Implement Stemming and Lemmatization from NLTK
Author(s): Manmohan Singh Originally published on Towards AI. In this article, we try to solve one of NLPβs problems by implementing Stemming and Lemmatization Source: pixxabay.com The English language has more than a million words in its vocabulary. Around 170k are in …
5 Different Ways to Build ML Models!
Author(s): Deepak Sekar Originally published on Towards AI. We have come across data science platforms and ML offerings targeted for expert audiences who have Python/ R/ Matlab..etc skills and who understand algorithms/ kernels..etc. But, what if people who understand data very well …
Mismatch Between Academic and Real-world Data Science Projects
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Capstone projects in academic data science training should be a semester or two long, and should prioritize group projects over individual projects Photo by Mimi Thian on Unsplash Academic data science programs …
12 Free NLP Datasets to Work on to Tide you Through this Pandemic
Author(s): Timothy Tan Originally published on Towards AI. Photo by Luke Chesser on Unsplash You probably have seen how fast COVID-19 spreads if it left unchecked. As much as we all want our normal lives back again, the best thing we can …
Introduction to the Pandas Library
Author(s): Saiteja Kura Originally published on Towards AI. Source β Nimble Coding Before beginning, I would suggest you read my previous article on NumPy here. Although NumPyβs arrays are better than Pythonβs data structures several limitations hinder its usage.1. NumPyβs high dimensional …
Implementation of Principal Component Analysis from scratch
Author(s): Navoneel Chakrabarty Originally published on Towards AI. Letβs Get Started Real-time data may have a vast number of attributes, which often makes essential Exploratory Data Analytics very difficult. Such data are known as highly Multi-Dimensional Data in which each and every …
Letβs see the sales in a Big Mart Store
Author(s): Saikat Biswas Originally published on Towards AI. Image by Anastasia Dulgier on Unsplash Predicting the sales of each product in a particular store We as Data Analysts and Data Science Practitioners are often challenged to predict the factors that have an …
R Script for Data Science Coding Exercise
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Some coding challenge problems would specify that a formal project report be submitted together with a Jupyter notebook or R Script file Photo by Safar Safarov on Unsplash The take-home challenge problem …
Who Can Become a Data Scientist?
Author(s): Chan Naseeb Originally published on Towards AI. What does it take to be a data scientist? Questions and Answers Photo by Luke Chesser on UnsplashBachelor in Business Administration (BBA) holderPetroleum engineer,Psychologist,Anyone! Here is my take on this type of questions: Keep …
Marketing Analytics (Part 1)
Author(s): Mishtert T Originally published on Towards AI. Simple Linear Regression Customer Lifetime Value (CLV) In marketing, customer lifetime value (CLV or often CLTV), lifetime customer value (LCV), or lifetime value (LTV) is a prediction of the net profit attributed to the …