66 | Towards AI

The Rise of Vector Databases: Understanding Vector Search and RAG Pipeline

21 likes

June 4, 2024

Author(s): Shwetha Acharya Originally published on Towards AI. What is a Vector? Vector is an object that possesses both magnitude and direction. It is represented as an array of numbers that define its dimensionality. Here is an example of how vectors — …

Technical Post-Mortem of a Data Migration Event

ifttt-user

17 likes

June 4, 2024

Author(s): Vishnu Regimon Nair Originally published on Towards AI. Key Objectives of Data Migration. Image by Author In this data-driven landscape, extracting the maximum value from data is crucial for success. As data volumes grow exponentially, organizations face considerable pressure to optimize …

Artificial Intelligence Latest Machine Learning

The Architecture of Mistral’s Sparse Mixture of Experts (S〽️⭕E)

ifttt-user

17 likes

June 4, 2024

Author(s): JAIGANESAN Originally published on Towards AI. Exploring Feed Forward Networks, Gating Mechanism, Mixture of Experts (MoE), and Sparse Mixture of Experts (SMoE). Photo by Ticka Kao on Unsplash Introduction:🥳 In this article, we’ll dive deeper into the specifics of Mistral’s SMoE …

Latest Machine Learning

Unsupervised Clustering: Can We Identify Clusters in the Descriptions of Sounds in Music?

ifttt-user

19 likes

June 3, 2024

Author(s): Greg Postalian-Yrausquin Originally published on Towards AI. The data used is tricky because it is a list of Spotify songs, which are assigned values that describe the sounds in them. At this point, the goal is to see if those descriptions …

Latest Machine Learning

How To Use Target Encoding in Machine Learning Credit Risk Models — Part 1

ifttt-user

17 likes

June 3, 2024

Author(s): Varun Nakra Originally published on Towards AI. Target encoding, also known as mean encoding or likelihood encoding, is a technique used to convert categorical variables into numerical values based on the target variable in supervised learning tasks. This method is particularly …

Latest Machine Learning

Web scraping & NLP

ifttt-user

18 likes

June 3, 2024

Author(s): Greg Postalian-Yrausquin Originally published on Towards AI. In this example, I extract data from a Wikipedia list of the most grossing movies go into each of the links and fetch the text of the movie’s article. Then I use BERTopic (which …

Latest Machine Learning

Using NLP (Doc2Vec) and Neural Networks (with Keras): Removing Hate Speech and Offensive Tweets

ifttt-user

16 likes

June 3, 2024

Author(s): Greg Postalian-Yrausquin Originally published on Towards AI. This is a great example of how more than one ML step can be used to achieve a goal. In this exercise, I will combine NLP (Doc2Vec) with binary classification to extract offensive and …

Latest Machine Learning

Perfect Answer to Deep Learning Interview Question — Why Not Quadratic Cost Function?

ifttt-user

14 likes

June 1, 2024

Author(s): Varun Nakra Originally published on Towards AI. One of the most common question asked during deep learning knowledge interviews is — “Why can’t we use a quadratic cost function to train a Neural Network?”. We will delve deep into the answer …

Latest Machine Learning

How Do Diffusion Models Work? Simple Explanation: No Mathematical Jargon, Promised!

ifttt-user

15 likes

May 31, 2024

Author(s): Suhaib Arshad Originally published on Towards AI. Background Knowledge Essentially, there are 3 common types of generative models: Generative Adversarial Networks (GANs), Variational Autoencoder, and Flow-based models. Although they have proven their spot as high-quality image-generating models, they fall short on …

Latest Machine Learning

Data Science Interview Question: Creating ROC & Precision-Recall Curves From Scratch

ifttt-user

16 likes

May 30, 2024

Author(s): Varun Nakra Originally published on Towards AI. This is one of the popular data science interview questions which requires one to create the ROC and similar curves from scratch, i.e., no data on hand. For the purposes of this story, I …

Frequently Used, Contextual References

Resources

The Rise of Vector Databases: Understanding Vector Search and RAG Pipeline

Technical Post-Mortem of a Data Migration Event

The Architecture of Mistral’s Sparse Mixture of Experts (S〽️⭕E)

Unsupervised Clustering: Can We Identify Clusters in the Descriptions of Sounds in Music?

How To Use Target Encoding in Machine Learning Credit Risk Models — Part 1

Web scraping & NLP

Using NLP (Doc2Vec) and Neural Networks (with Keras): Removing Hate Speech and Offensive Tweets

Perfect Answer to Deep Learning Interview Question — Why Not Quadratic Cost Function?

How Do Diffusion Models Work? Simple Explanation: No Mathematical Jargon, Promised!

Data Science Interview Question: Creating ROC & Precision-Recall Curves From Scratch

Recent Posts

Part 20: Data Manipulation in Multi-Dimensional Aggregation

A Fundamental Introduction to Genetic Algorithm -Part Two

TAI #200: Anthropic’s Mythos Capability Step Change and Gated Release

From Notebook to Production: Running ML in the Real World (Part 4)

Sqribble’s Template‑Driven Document Automation

Anthropic Just Shipped the Layer That’s Already Going to Zero

The L1 Loss Gradient, Explained From Scratch

Your Postcode Is Deciding Your Care. I Built a Pipeline to Prove It.

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Recent Posts

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

GDPR CCPA Statement