Tutorial on Data Wrangling: College Towns Dataset
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Data wrangling is the process of converting data from its raw form to the tidy form ready for analysis. Data wrangling is an important step in data preprocessing and includes several processes …
Several Ways for Machine Learning Model Serving (Model as a Service)
Author(s): Edward Ma Originally published on Towards AI. Using Model as a Service (MaaS) on Cloud Platforms Top highlight Photo by Edward Ma on Unsplash No matter how well you build a model, no one knows it if you cannot ship model. …
Why Precision and Recall metric?
Author(s): Jalal Mansoori Originally published on Towards AI. What is a Class-imbalanced dataset? Image by Author Before answering the above question let me tell you my experience when I was learning about the evaluation of learning algorithms in classification problems. Currently, I …
Data Science 101 β A Short Course on Medium Platform with R and Python Code Included
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Data Science 101 is intended for individuals that have some prior exposure or knowledge in data science concepts and are interested in practical applications beyond what is offered in most introductory-level data …
New Model for Word Embeddings which are Resilient to Misspellings (MOE)
Author(s): Edward Ma Originally published on Towards AI. Photo by Edward Ma on Unsplash Traditional word embeddings are good at solving lots of natural language processing (NLP) downstream problems such as documentation classification and named-entity recognition (NER). However, one of the drawbacks …
4 Tips To Write Scalable Apache Spark Code
Author(s): ___ Originally published on Towards AI. In this article, I will share some tips on how to write scalable Apache Spark code. The examples presented here are actually based on the code I encountered in the real world. So, by sharing …
How does Data Noising Help to Improve your NLP Model?
Author(s): Edward Ma Originally published on Towards AI. The objective of injecting data noising Photo by Edward Ma on Unsplash Introducing data noising to neural network aims at improving model generalization and performance. Xie et al. proposes several ways to generate more …
How You Should Save the Output of your Spark ETL Jobs (If you are not Writing to a Database)
Author(s): ___ Originally published on Towards AI. In this article, I will share my thoughts on the best way to save the output of Spark ETL jobs so that it is easier to do analytical work later. The code to reproduce the …
15 Months of Writing Data Science Articles on Medium β Accomplishments and Lessons Learned
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. I joined Medium over a year ago. I published my first medium article Using sapply() function in R to generate a table on July 7, 2018. This article was written mostly for …
Review: DUNet β Deformable U-Net for Retinal Vessels Segmentation (Biomedical Image Segmentation)
Author(s): Sik-Ho Tsang Originally published on Towards AI. Technical Review of DUNet U+007C Towards AI U-Net+DCN, Outperforms U-Net & DCN In this story, DUNet, by Tianjin University, Linkoping University, and, is briefly reviewed. DUNet, Deformable U-Net: exploits the retinal vesselsβ local features …
How R-CNN Works on Object Detection?
Author(s): Edward Ma Originally published on Towards AI. Introduction to Region with Convolutional Neural Networks (R-CNNs) Photo by Edward Ma on Unsplash Region with Convolutional Neural Network (R-CNN) is proposed by Girshick et al. in 2013. It changed the object detection field …
Six Reasons Why We Need to Take a Collaborative Approach to AI Development
Author(s): Rudradeb Mitra Originally published on Towards AI. Courtesy: Flickr (This article is co-authored with Yang Gao and Michael Burkhardt) A century ago, in 1906, statistician Francis Galton showed the power of the crowd. At the International Exhibition in London, he asked …
Review: IEF β Iterative Error Feedback (Human Pose Estimation)
Author(s): Sik-Ho Tsang Originally published on Towards AI. A Review on IEF U+007C Towards AI Outperforms Tompson NIPSβ14, and Tompson CVPRβ15 Getting Better and Better From Left to Right With Iterative Error Feedback (IEF) In this story, IEF (Iterative Error Feedback), by …
Training a Machine Learning Model on a Dataset with Highly-Correlated Features
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. 1. Import necessary libraries In the previous article (Feature Selection and Dimensionality Reduction Using Covariance Matrix Plot), weβve shown that a covariance matrix plot can be used for feature selection and dimensionality …