Kaushik Rajan | Towards AI

LLM Benchmarks Are Junk Science

69 likes

April 1, 2026

Author(s): Kaushik Rajan Originally published on Towards AI. An Oxford review of 445 benchmarks found 84% lack basic statistical testing. Models score 90% on standard tests but 2% on unseen problems. A 5-question smell test for any benchmark claim. Over the past …

Artificial Intelligence Latest Machine Learning

AI Bots Formed a Cartel. No One Told Them To.

Kaushik Rajan

14 likes

February 22, 2026

Author(s): Kaushik Rajan Originally published on Towards AI. Inside the research that shows algorithmic price-fixing isn’t a bug in the code. It’s a feature of the math. A sealed-bid auction. Six participants: three buyers, three sellers. An optional messaging channel (think WhatsApp, …

Artificial Intelligence Latest Machine Learning

When AI Agents Forget What They Saw: The Goal Drift Problem in Video Research

Kaushik Rajan

14 likes

January 19, 2026

Author(s): Kaushik Rajan Originally published on Towards AI. Why more autonomy doesn’t always mean better performance, and what the first video deep research benchmark reveals about the limits of agentic AI You’re watching a museum tour video. Someone asks: “What’s the registration …

Artificial Intelligence Latest Machine Learning

The Prism Hypothesis: Why AI Vision Systems Have Been Looking at the World Wrong

Kaushik Rajan

13 likes

December 25, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. Vision models either understand images or generate them well. A frequency-based view dissolves the trade-off. Here’s a puzzle that has quietly haunted computer vision for years: Contrastive Language-Image Pre-training (CLIP), OpenAI’s model that learns …

Artificial Intelligence Latest Machine Learning

Thinking with Video: The Next Leap in Multimodal AI Reasoning

Kaushik Rajan

17 likes

December 3, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How video generation models like Sora-2 are bridging the gap between static images and dynamic understanding I still remember the first time I saw a Vision Language Model (VLM) describe a complex image. It …

Artificial Intelligence Latest Machine Learning

Teaching AI to Say “I Don’t Know”

Kaushik Rajan

23 likes

October 3, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. A deep dive into TruthRL, a new reinforcement learning method making large language models more honest. I once asked an early AI model for a biography of a niche historical figure. It confidently spun …

Artificial Intelligence Latest Machine Learning

LLMs Don’t Just Need to Be Smart — They Need to Be Specific. Here’s How.

Kaushik Rajan

31 likes

September 24, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How a new technique called “Test-Time Deliberation” teaches AI to think before it speaks I spend a lot of my time wrestling with Large Language Models (LLMs). The goal is always the same: how …

Artificial Intelligence Latest Machine Learning

Researchers put AI in a Room with Regulators and a Game of Trust. It Didn’t Go Well.

Kaushik Rajan

27 likes

September 19, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. A new study uses game theory to simulate how AI agents, developers, and users interact. I’ve spent countless hours thinking about AI safety. It’s the kind of topic that keeps you up at night. …

Artificial Intelligence Latest Machine Learning

We’ve Been Measuring AI Reasoning All Wrong. Here’s How to Fix It.

Kaushik Rajan

43 likes

September 11, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. A new research paper reveals how we can teach language models to actually think, not just guess the right answer. Imagine a math student who consistently aces every test. You’re impressed. But one day, …

Artificial Intelligence Latest Machine Learning

Why Your AI Is a Fluent Liar

Kaushik Rajan

39 likes

September 8, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. A deep dive into the research that explains why AI hallucinations are an inherent feature of Large Language Models, not just a bug. You’ve probably seen it before. You ask an AI chatbot a …

Artificial Intelligence Latest Machine Learning

Beyond Text-to-Speech: The Next Wave of Generative Audio

Kaushik Rajan

21 likes

August 29, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How Step-Audio 2 is changing the game for creators and developers For years, AI-generated audio has felt like a technology perpetually on the verge of a breakthrough. We’ve all heard it: the robotic voice …

Artificial Intelligence Computer Vision Latest Machine Learning

From Pixels to Understanding: A Better Way for AI to See

Kaushik Rajan

127 likes

August 28, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How a new “denoising” technique is making on-device computer vision faster, smarter, and ready for your next app. Computer vision on mobile devices is a quiet miracle. It powers the face-unlock on your phone, …

Artificial Intelligence Computer Vision Latest Machine Learning

From Pixels to Understanding: A Better Way for AI to See

Kaushik Rajan

106 likes

August 28, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How a new “denoising” technique is making on-device computer vision faster, smarter, and ready for your next app. Computer vision on mobile devices is a quiet miracle. It powers the face-unlock on your phone, …

Artificial Intelligence Latest Machine Learning

I Built an AI Rock Identifier App in a Weekend With SwiftUI & Gemini

Kaushik Rajan

20 likes

August 28, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. How I took a simple idea from concept to the App Store in just 48 hours. You’re on a hike and find a stone with mesmerizing, deep-purple crystals. Is it amethyst? Fluorite? For most …

Artificial Intelligence Latest Machine Learning

How I Built an Adaptive Concept Explainer Using Hugging Face Models

Kaushik Rajan

25 likes

March 29, 2025

Author(s): Kaushik Rajan Originally published on Towards AI. Demystifying Complex Ideas Through Multi-Level ExplanationsCredit: Generative AI (ChatGPT 4o) Have you ever found yourself trying to wrap your head around a complex concept, or having to break down something technical for someone who …

Frequently Used, Contextual References

Resources

LLM Benchmarks Are Junk Science

AI Bots Formed a Cartel. No One Told Them To.

When AI Agents Forget What They Saw: The Goal Drift Problem in Video Research

The Prism Hypothesis: Why AI Vision Systems Have Been Looking at the World Wrong

Thinking with Video: The Next Leap in Multimodal AI Reasoning

Teaching AI to Say “I Don’t Know”

LLMs Don’t Just Need to Be Smart — They Need to Be Specific. Here’s How.

Researchers put AI in a Room with Regulators and a Game of Trust. It Didn’t Go Well.

We’ve Been Measuring AI Reasoning All Wrong. Here’s How to Fix It.

Why Your AI Is a Fluent Liar

Beyond Text-to-Speech: The Next Wave of Generative Audio

From Pixels to Understanding: A Better Way for AI to See

From Pixels to Understanding: A Better Way for AI to See

I Built an AI Rock Identifier App in a Weekend With SwiftUI & Gemini

How I Built an Adaptive Concept Explainer Using Hugging Face Models

Recent Posts

Full-Stack Data Scientists for the Agentic Coding World

Building Production-Grade AI Skills with Snowflake Cortex AI Function Studio

I Tried 10 AI Agent Frameworks in 2026 — Here’s the Honest Guide I Wish I Had Earlier

How One Spring Boot Optimization Saved Our Startup $30,000 a Year

Inside Palantir AIP: How the World’s Most Controversial AI Platform Actually Works

What Is a Reverse Proxy? (And Why Every Backend Developer Should Care)

What Claude Opus 4.8 Actually Changes If You’re Building Agents

QWEN 3.7 Max Worked For 35 Hrs Straight And The Results Were Mind-blowing

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Recent Posts

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

GDPR CCPA Statement