10 Basic Feature Engineering Techniques to Prepare Your Data
Last Updated on November 4, 2024 by Editorial Team
Author(s): Muhammad Ihsan
Originally published on Towards AI.
This member-only story is on us. Upgrade to access all of Medium.
Photo by Josh Redd on UnsplashNot a member? Click here to see the full story
When we have data, we will certainly think about extracting valuable values ββfrom it; the goal may be to help decision-making or predict future trends.
So how do we get that?
At times like this, we need feature engineering. Feature Engineering is the process of modifying raw data into more informative features. In this article, we will learn ten basic feature engineering techniques, a brief intuition, and implementation examples.
Before we get started, we need to import additional libraries to ensure our code works well. Here are the libraries used in this article:
import pandas as pd #data manipulationimport numpy as np #numerical computationimport matplotlib.pyplot as plt #data visualizationimport gensim.downloader as api #to download corpus provided in gensimfrom gensim.models import Word2Vec #word embeddingfrom sklearn.pipeline import Pipeline #chaining multiple data processing stepsfrom sklearn.decomposition import PCA #pca implementationfrom sklearn.datasets import load_iris # load iris datasetfrom sklearn.impute import SimpleImputer # imputationfrom sklearn.compose import ColumnTransformer #applying transformation to datasetfrom sklearn.feature_extraction.text import TfidfVectorizer #tf-idf implementation classfrom sklearn.preprocessing import MinMaxScaler, StandardScaler #scaling data
Letβs start by learning about Imputation. This term refers to the process of… Read the full blog for free on Medium.
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI