This Pandas Trick Will Blow Your Mind As a Data Scientist!
Last Updated on November 1, 2024 by Editorial Team
Author(s): Gencay I.
Originally published on Towards AI.
Automate Data Analysis with Pandas
This member-only story is on us. Upgrade to access all of Medium.
Created with Abidin Dino AI, to reach it, consider being Paid subscriber to LearnAIWithMe, herePandas is undoubtedly the most powerful data science library, but what if I told you that you could automate data analysis and complete your work with just a click?
In this article, weβll explore how to do this, but first, letβs look at what the final script will look like.
Now, we would have 8 steps to achieve this;
SS of the outputHere, you can upload any CSV files you want and see;
First rowsLast rowsData typesStatistical SummaryMissing ValuesCorrelation Matrix
After just one click. Also, you can see ;
Value CountsUnique ValuesHistogramBox plot
of the columns you have selected.
Before doing that, letβs set up the environment.
pip install pandas numpy ipywidgets matplotlib seaborn
Now, letβs create a place where you can upload the dataset at the end and import the libraries.
import pandas as pdimport numpy as npimport ipywidgets as widgetsfrom ipywidgets import FileUploadfrom IPython.display import display, clear_outputimport matplotlib.pyplot as pltimport seaborn as snsimport io# Output area for displaying resultsoutput = widgets.Output()# File upload widget to upload CSV filesupload_widget = FileUpload( accept='.csv', multiple=False, description='Upload CSV File')# Display the widgetdisplay(upload_widget)
At this step, we will define the… Read the full blog for free on Medium.
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI