Is it possible to do Text Classification on unlabeled data? (Feat. Zero-Shot Classification) [Experiment]
Author(s): Ala Alam Falaki Originally published on Towards AI. Photo by Markus Spiske on Unsplash A couple of months ago, I did an experiment to answer the question: Is it possible to do sentiment analysis if we do not have the target …
Finding the Needle in the Haystack: How to Train a Dense Passage Retriever
Author(s): Thilina Rajapakse Originally published on Towards AI. Let’s see how we can train a model to perform dense-passage retrieval with Transformer models using Simple Transformers. Photo by matthew Feeney on Unsplash Passage retrieval is a conceptually simple task where a system …
An Intuitive Introduction to Document Vector(Doc2Vec)
Author(s): Manish Nayak Originally published on Towards AI. Intro to Doc2Vec U+007C Towards AI Introduction Doc2Vec is an extension of Word2vec that encodes entire documents as opposed to individual words. You can read about Word2Vec in my previous post. Doc2Vec vectors represent …
Multi-Label Text Classification Using Scikit-multilearn: a Case Study with StackOverflow Questions
Author(s): Avishek Nag Originally published on Towards AI. Designing a multi-label text classification model which helps to tag stackoverflow.com questions with different topics Everyday users of stackoverflow.com posts many technical questions and all those get tagged with different topics. In this article, …
Demystifying the Architecture of Long Short Term Memory (LSTM) Networks
Author(s): Manish Nayak Originally published on Towards AI. Architecture of LSTMs U+007C Towards AI Introduction In my previous article, I explain RNNs’ Architecture. RNNs are not perfect and they mainly suffer from two major issues exploding gradients and vanishing gradients. Exploding gradients …
New Model for Word Embeddings which are Resilient to Misspellings (MOE)
Author(s): Edward Ma Originally published on Towards AI. Photo by Edward Ma on Unsplash Traditional word embeddings are good at solving lots of natural language processing (NLP) downstream problems such as documentation classification and named-entity recognition (NER). However, one of the drawbacks …
How does Data Noising Help to Improve your NLP Model?
Author(s): Edward Ma Originally published on Towards AI. The objective of injecting data noising Photo by Edward Ma on Unsplash Introducing data noising to neural network aims at improving model generalization and performance. Xie et al. proposes several ways to generate more …
NTM: Neural Turing Machines
Author(s): Sherwin Chen Originally published on Towards AI. A detailed walk-through of Neural Turing Machines Top highlight Photo by Kvistholt Photography on Unsplash We discuss Neural Turing Machine(NTM), an architecture proposed by Graves et al. in DeepMind. NTMs are designed to solve …
DNC: Differential Neural Network
Author(s): Sherwin Chen Originally published on Towards AI. A detailed walk-through of DNC Photo by Philipp Katzenberger on Unsplash In the previous article, we discussed Neural Turing Machines(NTMs), which introduced an external memory to maintain information for later retrieval. In this post, …
Emoticon and Emoji in Text Mining
Author(s): Dhilip Subramanian Originally published on Towards AI. Converting Emoticon and Emoji into word form using Python Source: wallpaperplay In today’s online communication, emojis and emoticons are becoming the primary language that allows us to communicate with anyone globally when you need …
Generate Quotes with Web Scrapping, Glove Embeddings, and LSTM in Pytorch
Author(s): Lakshmi Narayana Santha Originally published on Towards AI. Introduction With the rise of advancement in research in NLP specially in Language Models, text generation – a classical machine learning task which solved using Recurrent Networks. In this article we walk through …
Sentiment Prediction of Google Play Store Reviews with Tensorflow 2.0
Author(s): Juan Manuel Ciro Torres Originally published on Towards AI. Photo by Austin Distel on Unsplash How to train your own algorithm to classify the sentiment of the reviews of your app from scratch You spend hours and hours creating your app, …