From Raw to Refined: A Journey Through Data Preprocessing β Part 3: Duplicate Data
Author(s): Shivamshinde Originally published on Towards AI. This article will explain how to identify duplicate records in the data and, the different ways to deal with the problem of having duplicate records. Photo by Pineapple Supply Co. on Unsplash Why the presence …
What does βGarbage in, garbage outβ mean in solving real business problems?
Author(s): Zijing Zhu Originally published on Towards AI. and how to avoid it with a practical workflow Photo by Gary Chan on Unsplash This member-only story is on us. Upgrade to access all of Medium. In today's business landscape, relying on accurate …
From Raw to Refined: A Journey Through Data Preprocessing β Part 2: Missing Values
Author(s): Shivamshinde Originally published on Towards AI. Photo by Holly Stratton on Unsplash Before going through this article, please check out the previous article in the series on feature engineering. From Raw to Refined: A Journey Through Data Preprocessing β Part 1: …
From Raw to Refined: A Journey Through Data Preprocessing β Part 1
Author(s): Shivamshinde Originally published on Towards AI. Photo by Edward Howell on Unsplash Sometimes, the data we receive for our machine learning tasks isnβt in a suitable format for coding with Scikit-Learn or other machine learning libraries. As a result, we have …
From Raw to Refined: A Journey Through Data Preprocessing β Part 1: Feature Scaling
Author(s): Shivamshinde Originally published on Towards AI. Photo by Edward Howell on Unsplash Sometimes, the data we receive for our machine learning tasks isnβt in a suitable format for coding with Scikit-Learn or other machine learning libraries. As a result, we have …
The Complete Guide to Data Preprocessing (Part 2)
Author(s): Dr. Roi Yehoshua Originally published on Towards AI. Loading the Data Set In the first part of this article, we described the data preprocessing process and showed how to handle missing values, categorical data, outliers and skewed data. In this part …
The Complete Guide to Data Preprocessing (Part 1)
Author(s): Dr. Roi Yehoshua Originally published on Towards AI. Data preprocessing is the process of cleaning, transforming, and organizing your data set in order to prepare it for data analysis and modeling. It aims to improve the quality, integrity, and reliability of …
Understanding Dimensionality Reduction
Author(s): Vikas K Solegaonkar (ThinkPro Systems) Originally published on Towards AI. Python Code Top highlight We all understand that more data means better AI. That sounds great! But, with the recent blast of information, we often end in a problem of too …