Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Unlock the full potential of AI with Building LLMs for Productionβ€”our 470+ page guide to mastering LLMs with practical projects and expert insights!

Publication

Exploratory Data Analysis in Few Seconds
Data Analysis

Exploratory Data Analysis in Few Seconds

Last Updated on February 21, 2021 by Editorial Team

Author(s): Juhi Sharma

Data Analysis

EDA through Pandas VisualΒ Analysis

Photo by lynn lynum onΒ Unsplash

EDA is a way to understand what the data is all about. It is very important as it helps us to understand the outliers, relationship of features within the data with the help of graphs andΒ plots.

EDA is a time taking process as we need to make visualizations between different features using libraries like Matplot, seaborn,Β etc.

There is a way to automate this process by a single line of code using the library Pandas Visual Analysis.

About Pandas VisualΒ Analysis

  1. It is an open-source python library used for Exploratory Data Analysis.
  2. It creates an interactive user interface to visualize datasets in Jupyter Notebook.
  3. Visualizations created can be downloaded as images from the interface itself.
  4. It has a selection type that will help to visualize patterns with and without outliers.

Implementation

  1. Installation

We will start by installing pandas visual analysis using pipΒ install.

!pip install pandas_visual_analysis

2. Importing Dataset

The data set that I am using here is already present in the seaborn library which can be easily loaded using the code givenΒ below.

import seaborn as sns
df=sns.load_dataset('tips')
df.head()
Dataset(Sourceβ€Šβ€”β€ŠByΒ author)

3. EDA using Pandas VisualΒ Analysis

In this step, we will be importing pandas visual analysis and use it for exploratory data analysis of the dataset we areΒ using.

from pandas_visual_analysis import VisualAnalysis
VisualAnalysis(df)

This will create a user interface containing all visualizations of our Dataset that too in just a few seconds. Here you can clearly visualize different attributes of theΒ dataset.

Visualization(Source byΒ author)

Understanding Output

Let’s understand the different sections in the user interfaceΒ :

  1. Statistical Analysis: This section will show the statistical properties like Mean, Median, Mode, and Quantiles of all numerical features.
  2. Scatter Plot-It shows the Distribution between 2 different features with the help of a scatter plot. you can choose features to be plotted on the X and Y axis from the dropdown.
  3. Histogram-It shows the distribution between 2 Different features with the help of a Histogram.

There are 3 types of selection for the data for which Visualizations will be Shown. By using the Subtract selection type you can drag and select the data points for which you want to analyze the visualizations and by using the Addition selection type you can again drag and add those data points which were excluded.

This feature is very useful if you want to see how visualizations between different features change with and without outliers in fewΒ seconds.

Here you can see that I have excluded outliers marked by pen by choosing subtraction selection and it changed visualizations in all other sections asΒ well.

This is how you can use pandas visual analysis for EDA. One of the main advantages of using this library over other libraries is that you can customize your data point selection.

Before YouΒ Go

Thanks for reading! If you want to get in touch with me, feel free to reach me at [email protected] or my LinkedIn Profile. Also, you can find the code for this article and some really useful data science projects on my GitHubΒ account.


Exploratory Data Analysis in Few Seconds was originally published in Towards AI on Medium, where people are continuing the conversation by highlighting and responding to this story.

Published via Towards AI

Feedback ↓