Predicting Sales using R programming
Author(s): Suyash Maheshwari Originally published on Towards AI. In this article, I will forecast the sales of a multinational retail corporation. The dataset for this can be found on Kaggle. We have been provided with weekly sales data on which we train …
How and Why to Implement Stemming and Lemmatization from NLTK
Author(s): Manmohan Singh Originally published on Towards AI. In this article, we try to solve one of NLPβs problems by implementing Stemming and Lemmatization Source: pixxabay.com The English language has more than a million words in its vocabulary. Around 170k are in …
5 Different Ways to Build ML Models!
Author(s): Deepak Sekar Originally published on Towards AI. We have come across data science platforms and ML offerings targeted for expert audiences who have Python/ R/ Matlab..etc skills and who understand algorithms/ kernels..etc. But, what if people who understand data very well …
Mismatch Between Academic and Real-world Data Science Projects
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Capstone projects in academic data science training should be a semester or two long, and should prioritize group projects over individual projects Photo by Mimi Thian on Unsplash Academic data science programs …
Checking the Sentiment of a Tweet Using Machine Learning
Author(s): Saikat Biswas Originally published on Towards AI. Image Source: Unsplash Letβs see some tweets and classify them as positive or negative sentiment When we hear the word Twitter, what does it ring to our ears? Well, for some, its a source …
12 Free NLP Datasets to Work on to Tide you Through this Pandemic
Author(s): Timothy Tan Originally published on Towards AI. Photo by Luke Chesser on Unsplash You probably have seen how fast COVID-19 spreads if it left unchecked. As much as we all want our normal lives back again, the best thing we can …
Introduction to the Pandas Library
Author(s): Saiteja Kura Originally published on Towards AI. Source β Nimble Coding Before beginning, I would suggest you read my previous article on NumPy here. Although NumPyβs arrays are better than Pythonβs data structures several limitations hinder its usage.1. NumPyβs high dimensional …
Implementation of Principal Component Analysis from scratch
Author(s): Navoneel Chakrabarty Originally published on Towards AI. Letβs Get Started Real-time data may have a vast number of attributes, which often makes essential Exploratory Data Analytics very difficult. Such data are known as highly Multi-Dimensional Data in which each and every …
Letβs see the sales in a Big Mart Store
Author(s): Saikat Biswas Originally published on Towards AI. Image by Anastasia Dulgier on Unsplash Predicting the sales of each product in a particular store We as Data Analysts and Data Science Practitioners are often challenged to predict the factors that have an …
R Script for Data Science Coding Exercise
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Some coding challenge problems would specify that a formal project report be submitted together with a Jupyter notebook or R Script file Photo by Safar Safarov on Unsplash The take-home challenge problem …
Data Scientist Types
Author(s): Chan Naseeb Originally published on Towards AI. What kind of data scientists do you come across in the market? Photo by Franki Chamaki on Unsplash I have seen different types of people around carrying out and using the title of a …