What I Learned Today About Apache Spark Architecture
Author(s): Abinaya Subramaniam Originally published on Towards AI. Apache Spark often feels magical when we first start using it. We write a few lines of PySpark code, hit run, and suddenly terabytes of data are being processed in seconds. But behind this …
RAG Text Chunking Strategies: Optimize LLM Knowledge Access
Author(s): Abinaya Subramaniam Originally published on Towards AI. If retrieval is the search engine of your RAG system, chunking is the foundation the search engine stands on. Even the strongest LLM fails when the chunks are too long, too short, noisy, or …
Resampling Methods in Action: How Bootstrap and Jackknife Improve our Estimates
Author(s): Abinaya Subramaniam Originally published on Towards AI. Imagine trying to understand a population based on a small sample. We calculate a statistic, maybe the mean test score of students, the average income of households, or the correlation between two variables. But …
Retrieval Augmented Generation (RAG) Explained: Why AI Needs It
Author(s): Abinaya Subramaniam Originally published on Towards AI. Large Language Models (LLMs) have rapidly become the engine behind intelligent applications, from chatbots to document assistants to sophisticated automation tools. Their ability to understand context, reason through text, and generate human-like responses often …
Building an Advisory Expert System to Find the Best Apartments in Colombo
Author(s): Abinaya Subramaniam Originally published on Towards AI. In today’s fast paced urban lifestyle, finding the perfect apartment in a bustling city like Colombo can be overwhelming. With countless options differing in location, price, developer, amenities, and overall quality, prospective buyers often …
Cookiecutter Data Science: A Standardized, Flexible Approach for Modern Data Projects
Author(s): Abinaya Subramaniam Originally published on Towards AI. In the ever-evolving world of data science, one of the biggest challenges isn’t the algorithms or tools, it’s project organization. If you are working solo or collaborating with a team, maintaining a clean, reproducible, …
Breaking Down YOLO: How Real Time Object Detection Works Step by Step
Author(s): Abinaya Subramaniam Originally published on Towards AI. Object detection is one of the most interesting areas of computer vision. It is the process of identifying and locating objects in an image. Popular examples include detecting cars on a road, identifying products …
Understanding Multimodal LLMs: The Next Evolution of AI
Author(s): Abinaya Subramaniam Originally published on Towards AI. Discover how multimodal LLMs are transforming AI by combining text, images, audio, and video into a single reasoning system. Learn how they work, real-world applications, challenges, and why they’re the next evolution beyond text-only …
Inside World Models and V-JEPA: Building AI That Predicts Reality
Author(s): Abinaya Subramaniam Originally published on Towards AI. Artificial intelligence has dazzled the world with its ability to generate text, images, and even music. Large Language Models (LLMs) like GPT and multimodal systems that combine text, vision, and audio have pushed the …
DoodlAI- Build a Real-Time Doodle Recognition AI with CNN
Author(s): Abinaya Subramaniam Originally published on Towards AI. Have you ever wondered if a computer could recognize your doodles of cats, trees, cars, or even clocks, as you draw them? That’s exactly what DoodlAI does. In this blog, I’ll take you step …
Building an Employee Onboarding Chatbot with RAG, FastAPI, and AI
Author(s): Abinaya Subramaniam Originally published on Towards AI. Learn how to build a smart employee onboarding assistant using Retrieval-Augmented Generation (RAG), FastAPI, and LLMs. Step-by-step guide with code, vector embeddings, and conversation memory to create a context-aware AI chatbot for your company …
Understanding Logistic Regression: Theory, Intuition, and Applications
Author(s): Abinaya Subramaniam Originally published on Towards AI. Logistic Regression — Image by Author You may already be familiar with linear regression, which models a straight line relationship between input features and a continuous output. However, when the output variable is categorical, …
Handling Imbalanced Datasets in Machine Learning: SMOTE, Oversampling & Undersampling Explained
Author(s): Abinaya Subramaniam Originally published on Towards AI. Imbalanced Datasets — Image by Author What are imbalanced Datasets? In many real-world classification problems, the number of samples in each class is not balanced. This is called an imbalanced dataset. For example, in …
The Power of Prolog: A Comprehensive Guide to Logic Programming for AI
Author(s): Abinaya Subramaniam Originally published on Towards AI. Programming in Logic — Image by Author In procedural or object-oriented programming (OOP), such as languages like C, Java, or Python, the focus is on how to do things. In these languages, we write …