One-Line, Magical Code to Perform EDA!
Author(s): Daksh Trehan
Machine Learning, Exploratory Data Analysis
One line solves all your problems!
“Data is the new oil” ~ Clive Humby
Data is an integral part of our life and unlike other resources it is inexhaustible but here comes a catch, it is only useful to your organization if you know how to mend it and get its gist.
Data Science is a process that includes: Collecting, Storing, Processing, Describing, and Modeling.
Processing, Describing/EDA(Exploratory Data Analysis) can be referred to as a lifecycle to get introduced to the data by finding relations among each variable and visualizing them to find hidden trends.
EDA accounts for a large amount of time and effort to clean and explore our data. Though in the field of data processing we still expect some advancements for Data Exploration there have been astounding improvements. Several open-source libraries have come up with a no-code or low-code method to help ease the exploration.
D-Tale is one such library, it is the combination of Flask back-end and React Front-end that brings up an interactive way to visualize and explore pandas data frame.
D-Tale makes sure you aren’t sick of performing df.head() recursively!
- Install D-Tale: Like any other Python Library you can easily install D-Tale by using “pip install dtale” in your command line.
2. Importing Relevant Libraries: Use Seaborn to load dataset and D-tale to visualize and explore.
3. Chose a dataset: From pre-defined dataset in Seaborn chose any one.
4. Take basic insights of data: Use describe() method for the same.
5. Use D-Tale: Load the data using D-Tale library and get interactive insights.
Tap on play button and chose “Open in New Tab”:
Features of D-Tale
All you Data in clean and elegant way!
Look for Statistical Property of your dataset using “Describe” button.
Summarize the data of relevant rows and columns.
Remove Duplicate Values
Check the Correlation in your data for easy creation of charts.
The library helps you with agnostic score that helps to determine linear and non-linear relationship.
The most anticipating feature of D-Tale is its ability to create charts seamlessly.
It offers a wide variety of charts with data cleaning options that are personalized to each chart type.
Another highlighting feature of D-Tale is that it allows you to create chart from the dashboard and directly import the code for the same.
Heat maps can also be employed either on whole data or on particular columns.
You can also check statistical values for particular column and perform column analysis.
Perform Variance analysis on each column.
The user can also change the data of any particular record.
The above mentioned code can be found at: dakshtrehan/D-Tale-Exploration (github.com)
The article helped us to throw a light on extremely powerful EDA reporting tool: D-Tale. We saw how D-Tale make it super easy to create appealing visuals and explore the data.
Feel free to connect:
Portfolio ~ https://www.dakshtrehan.com
LinkedIn ~ https://www.linkedin.com/in/dakshtrehan
Medium ~ https://medium.com/@dakshtrehan
Want to learn more?
Are You Ready to Worship AI Gods?
Detecting COVID-19 Using Deep Learning
The Inescapable AI Algorithm: TikTok
GPT-3 Explained to a 5-year old.
Tinder+AI: A perfect Matchmaking?
An insider’s guide to Cartoonization using Machine Learning
Reinforcing the Science Behind Reinforcement Learning
Decoding science behind Generative Adversarial Networks
Understanding LSTM’s and GRU’s
Recurrent Neural Network for Dummies
Convolution Neural Network for Dummies
Published via Towards AI