The Video Frontier: When AI Stopped Watching and Started Understanding
Author(s): Ampatishan Sivalingam Originally published on Towards AI. Part IV of the Multimodal Intelligence Series · The model learned to see. Then it learned to remember what it saw. This stack did not exist in 2023. The U-Net diffusion models that produced …
Latent Space: The Most Important Place That Doesn’t Exist
Author(s): Ampatishan Sivalingam Originally published on Towards AI. How AI navigates invisible dimensions to understand reality, and why you should care Every time you prompt an AI to create a “cyberpunk cat playing jazz,” you are navigating a multi-dimensional map you cannot …
What Are World Models? The Blueprint for the Next Decade of AI
Author(s): Ampatishan Sivalingam Originally published on Towards AI. We built machines that can talk. Now we’re building machines that can think, plan, and imagine, before they ever act. A toddler reaches for a stack of wooden blocks. She doesn’t just see the …
The Death of CNNs: How Vision Transformers Rewrote Computer Vision in 3 Years (Part 1: The CNN Era)
Author(s): Ampatishan Sivalingam Originally published on Towards AI. From AlexNet’s 2012 revolution to ResNet’s dominance, and why it all became obsolete overnight In 2012, a neural network called AlexNet won the ImageNet challenge by a margin so absurd that researchers initially thought …
I Gave Moltbot Access to My Computer for 7 Days: Here’s What Actually Happened (And Who Should Try It)
Author(s): Ampatishan Sivalingam Originally published on Towards AI. An honest account of living with an autonomous AI agent that can actually do things, from the thrilling wins to the terrifying security moments At 2:47 AM on a Tuesday morning, I woke up …