I Gave Qwen3.7-Plus a Screenshot and It Found the Exact Pixel to Click for $0.40
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. I Gave Qwen3.7-Plus a Screenshot and It Found the Exact Pixel to Click for $0.40 I uploaded a messy AWS console screenshot and asked one question: which pixel do I …
Moonshot Cracked Claude Code’s Playbook with an MIT Terminal Agent and a $0.60 Model
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Why this matters right now A Chinese lab just shipped a terminal coding agent that does almost everything Claude Code does, released the entire thing under the MIT license, and …
Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn’t Be This Good
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Google Shrank Gemma 4 by 72% and Unsloth Fixed the 4-Bit Bug Nobody Else Caught on One 4090, and 4-Bit Shouldn't Be This Good A 26-billion-parameter model has no business …
I Deleted 95% of My AI Agent’s Skills and Accuracy Jumped From 77% to 97%
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. How an “agent skill” actually is A DX engineer at WorkOS named Nick Nisi did something that sounds like sabotage. He took a 10,000-line library of auto-generated “skills” he had …
MiniMax M3 Decodes 1M Tokens 15x Faster — and It Shouldn’t Be This Cheap
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. MiniMax M3 Decodes 1M Tokens 15x Faster — and It Shouldn’t Be This Cheap On June 1, a Shanghai lab quietly shipped a model that decodes a 1-million-token context 15.6x …
I Ran a 1.5B-Active Model on My Laptop That Embarrassed a 26B by 46 Points
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. I Ran a 1.5B-Active Model on My Laptop That Embarrassed a 26B by 46 Points I did not expect a model that activates 1.5 billion parameters to walk all over …
NVIDIA’s 550B Nemotron Embarrassed Every US Open Model — and It Shouldn’t Run This Fast
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. NVIDIA's 550B Nemotron Embarrassed Every US Open Model — and It Shouldn't Run This Fast NVIDIA just shipped a 550B-parameter open model that scores 48 on the Artificial Analysis Intelligence …
I Ran Claude Code on My MacBook With vllm-mlx — It Embarrassed llama.cpp by 87%
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. I Ran Claude Code on My MacBook With vllm-mlx — It Embarrassed llama.cpp by 87% I did something this week that I assumed would be a slow, frustrating downgrade: I …
Microsoft Just Embarrassed Browser Web Agents — 1,000 Lines Made GPT-5.4 Beat Opus 4.6 on 200 Web Tasks
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Microsoft Just Embarrassed Browser Web Agents — 1,000 Lines Made GPT-5.4 Beat Opus 4.6 on 200 Web Tasks A Microsoft Research lab spent the last few weeks watching every other …
Sebastian Raschka’s New Repo Builds a DeepSeek-R1 Clone in 8 Chapters — and It Shouldn’t Be This Simple
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Sebastian Raschka's New Repo Builds a DeepSeek-R1 Clone in 8 Chapters — and It Shouldn't Be This Simple For the last year I have treated reasoning models the way most …
Two HTML Attributes Now Turn Your Website Into an AI Agent Tool — Inside Chrome’s WebMCP
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. Two HTML Attributes Now Turn Your Website Into an AI Agent Tool — Inside Chrome's WebMCP At Google I/O 2026, buried under Gemini 3.5 and a press release bragging about …
Merve Noyan Stopped Writing Training Scripts — Her Agent Just Fine-Tuned 18 Models Solo for $11.40
Author(s): Chew Loong Nian – AI ENGINEER Originally published on Towards AI. The 17,300-view AI Engineer Singapore talk that quietly killed half my MLOps job I watched Merve Noyan’s “Your Agent Can Now Train Models” talk three times this week. It went …