Data Science Curriculum
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Recommended curriculum for intro-level data science self-study Top highlight Photo by Kelly Sikkema on Unsplash As a data science educator, lots of people interested in getting into data science have contacted me …
Microsoft Data Science Interviews
Author(s): Kenny Kim Originally published on Towards AI. Understanding the interview process from the interviewerβs perspective from ex-Microsoft and ex-Expedia. Photo by Andrew Mantarro on Unsplash A couple of weeks ago, I saw a few posts about the Microsoft interview process. Both …
Productivity Tools for Large-scale Data Science Projects
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Analyzing different productivity tools used for real-world industrial type projects Photo by imgix on Unsplash Basic productivity tools for data science such as Jupyter notebook and R Studio are good tools for …
Multi-lingual Language Model Fine-tuning
Author(s): Edward Ma Originally published on Towards AI. The Problem of Low-resource Languages Photo by Chloe Evans on Unsplash English is one of the richest resources in natural language processing field. Lots of state-of-the-art NLP models support English natively. To tackle multi-lingual …
Efficient Pandas: Using Chunksize for Large Data Sets
Author(s): Lawrence Alaso Krukrubo Originally published on Towards AI. Question One: Data Science professionals often encounter very large data sets with hundreds of dimensions and millions of observations. There are multiple ways to handle large data sets. We all know about the …
Why is Python the Ideal Programming Language for AI and Data Science?
Author(s): Kristy Hill Originally published on Towards AI. Image by Geralt at Pixabay Is Python the preferred programming language for AI and Data Science today? If yes, then letβs learn the logical reasons in this article, along with why it is an …
Titanic Challenge β Machine Learning for Disaster Recovery
Author(s): Bindhu Balu Originally published on Towards AI. GitHub repo : https://github.com/BindhuVinodh/Titanic-Data-Visualization The Titanic challenge hosted by Kaggle is a competition in which the goal is to predict the survival or the death of a given passenger based on a set of …
Machine Learning Resources from Sebastian Raschka
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Open-source resources from a bestselling author and machine learning expert that will enable you to unlock deeper insights into cutting-edge machine learning and deep learning Photo by Christopher Gower on Unsplash In …
Sources of Error in Machine Learning
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. The solution to a machine learning problem is not unique. The predictive power of a model depends on the experience of the data scientist in dealing with sources of error Top highlight …
Summary of 2019 Accomplishments as a Data Science Blogger
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. Reviewing 2019 accomplishments and lessons learned as a medium writer, as well as discussing goals for 2020 Photo by Nicolas Tissot on Unsplash The year 2019 has been a remarkable year for …
Start Data Science Blogging in 2020
Author(s): Benjamin Obi Tayo Ph.D. Originally published on Towards AI. How to Start with Blogging in Data Science Photo by Corinne Kutz on Unsplash Medium is now considered one of the most popular blogging sites. Medium is a platform specifically designed for …