Data Visualization β An Underrated Art
Author(s): Saiteja Kura Originally published on Towards AI. 1. Data-Ink Ratio It is essential to choose the right one! (Source β Data Science Central) Raise your hands if you ever came across the terms data visualization and data visualization tools like Tableau …
Implementation of Principal Component Analysis from scratch
Author(s): Navoneel Chakrabarty Originally published on Towards AI. Letβs Get Started Real-time data may have a vast number of attributes, which often makes essential Exploratory Data Analytics very difficult. Such data are known as highly Multi-Dimensional Data in which each and every …
Data Science Tools to Get Started in 2020
Author(s): Slava Vaniukov Originally published on Towards AI. Data science is a discipline that is gaining importance as the world becomes increasingly dependent on vast stores of digital information. It is the study of data involving methods of collecting, storing, and analyzing …
Efficient Pandas: Using Chunksize for Large Data Sets
Author(s): Lawrence Alaso Krukrubo Originally published on Towards AI. Question One: Data Science professionals often encounter very large data sets with hundreds of dimensions and millions of observations. There are multiple ways to handle large data sets. We all know about the …
Top 10 Tech Skills in 2020 Worldwide
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Top 10 tech skills in 2020 worldwide, and in the United States. Skills related to data are of the highest demand. Photo by Benjamin O. Tayo As we enter a new year …
Feature Selection and Dimensionality Reduction Using Covariance Matrix Plot
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. 1. Import Necessary Libraries Top highlight This article will discuss how the covariance matrix plot can be used for feature selection and dimensionality reduction. A machine learning algorithm (such as classification, clustering …
Machine Learning Model for Stochastic Processes
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Abstract: Using the loan_timing.csv dataset provided, we built a simple model using the Monte Carlo simulation for predicting the fraction of loans that will default after the 3-year duration of the loan. …
Tutorial on Data Visualization: Weather Data
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Weather data analysis and visualization using Pythonβs Matplotlib Data Visualization is more of an Art than Science. To produce a good visualization, you need to put several pieces of code together for …
Outlier Detection and Treatment: A Beginner's Guide
Author(s): Swetha Lakshmanan Originally published on Towards AI. Nature of Outliers: Top highlight One of the most important steps in data pre-processing is outlier detection and treatment. Machine learning algorithms are very sensitive to the range and distribution of data points. Data …
A Gaussian Approach to the Detection of Anomalous Behavior in Server Computers
Author(s): Navoneel Chakrabarty Originally published on Towards AI. Letβs detect the anomalyβ¦ Anomaly Detection is a different variant of Machine Learning Problems that falls under Semi-Supervised Learning. It is Semi-Supervised because, in Anomaly Detection (also popularly known as Outlier Detection), models often …