Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take the GenAI Test: 25 Questions, 6 Topics. Free from Activeloop & Towards AI

Publication

Encoding Categorical Data: A Step-by-Step Guide
Data Analysis   Data Science   Latest   Machine Learning

Encoding Categorical Data: A Step-by-Step Guide

Last Updated on September 3, 2024 by Editorial Team

Author(s): Souradip Pal

Originally published on Towards AI.

This member-only story is on us. Upgrade to access all of Medium.

Imagine you’re baking a cake, but instead of sugar, flour, and eggs, you have words like β€œvanilla,” β€œchocolate,” and β€œstrawberry” on your countertop. As much as you’d like to start, there’s a problem β€” your recipe can only follow numeric measurements, not words. This is exactly what happens when you try to feed categorical data into a machine-learning model. The model needs numbers to work its magic, not strings of text.

Image generated by Dall-E

In this hands-on tutorial, we’ll unravel the mystery of encoding categorical data so your models can process it with ease. We’ll break down the types of categorical data, discuss when and why each encoding method is used, and dive into Python code examples that show exactly how to get the job done.

Before we start transforming data, let’s get our definitions straight. In the world of data, you generally have two types: numerical and categorical. Machine learning models can easily understand numbers β€” no surprise there! But when it comes to words or labels, we need to convert these into numbers to help our models β€œunderstand” the data.

Ordinal Data:Ordinal data is like your favorite Netflix ranking list… Read the full blog for free on Medium.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.

Published via Towards AI

Feedback ↓