Diffuse and Disperse: Image Generation with Representation Regularization (Paper Review)
Author(s): Hira Ahmad Originally published on Towards AI. Diffuse and Disperse: Image Generation with Representation Regularization (Paper Review) Diffusion models have redefined the frontiers of generative AI, capable of transforming noise into highly structured, realistic images. But as these models grow, a …
Continual Learning via Sparse Memory Finetuning (Paper Review)
Author(s): Hira Ahmad Originally published on Towards AI. Continual Learning via Sparse Memory Finetuning (Paper Review) Modern large language models learn vast amounts of knowledge; yet when we try to teach them something new, they tend to forget what they already know. …
DeepSeek-OCR: Contexts Optical Compression (Paper Review)
Author(s): Hira Ahmad Originally published on Towards AI. The Shift from Recognition to Understanding From recognizing letters to reasoning through meaning, DeepSeek-OCR redefines what it means for machines to read. Source ImageDeepSeek-OCR revolutionizes optical character recognition by integrating comprehension and contextual reasoning …
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models (Paper Review)
Author(s): Hira Ahmad Originally published on Towards AI. Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models (Paper Review) Intelligence is not in thinking long, it’s in thinking right.In the race to make machines reason like humans, we’ve trained models …
Less is More: Recursive Reasoning with Tiny Networks (Paper Review)
Author(s): Hira Ahmad Originally published on Towards AI. Less is More: Recursive Reasoning with Tiny Networks (Paper Review) Modern AI often chases scale: deeper layers, more attention heads, and billions of parameters. But hidden beneath this race lies a quieter revolution: recursive …
The Evolving Vision: From Block World to Intelligent Perception
Author(s): Hira Ahmad Originally published on Towards AI. The Evolving Vision: From Block World to Intelligent Perception In the vast history of artificial intelligence, vision has remained one of its most profound and persistent pursuits not merely to capture what humans see, …
When Transformers Multiply Their Heads: What Increasing Multi-Head Attention Really Does
Author(s): Hira Ahmad Originally published on Towards AI. When Transformers Multiply Their Heads: What Increasing Multi-Head Attention Really Does Transformers have become the backbone of many AI breakthroughs, in NLP, vision, speech, etc. A central component is multi-head self-attention: the notion that …
LLM Evaluation Methods: Integrating Binary Evals with Score Evals
Author(s): Hira Ahmad Originally published on Towards AI. LLM Evaluation Methods: Integrating Binary Evals with Score Evals Evaluating large language models (LLMs) is a bit like checking a student’s exam paper, you can grade by impression or you can check each answer …
Deep Reflection: How Modern LLMs are Redefining The Meaning of Writing
Author(s): Hira Ahmad Originally published on Towards AI. Beyond Function: The Search for Understanding Sometimes I wonder if we truly understand what we’ve created. These language models, magnificent in design and frightening in consequence, have begun to speak like us, yet they …
AI Roadmap: Foundation Models and Beyond
Author(s): Hira Ahmad Originally published on Towards AI. AI Roadmap: Foundation Models and Beyond Artificial Intelligence has evolved into an ecosystem of frameworks, architectures, and methodologies that together define how we build and understand intelligent systems today. Whether you’re beginning your journey …
From Words to Worlds: Rethinking Embeddings and Ranking in Retrieval
Author(s): Hira Ahmad Originally published on Towards AI. To choose the right model for semantic search, consider the trade-offs between a bi-encoder’s speed, a cross-encoder’s precision, and ColBERT’s balance of both. Words alone are insufficient to capture communication; the full message is …
Hugging Face Transformers: The Framework Redefined Modern AI
Author(s): Hira Ahmad Originally published on Towards AI. Introduction: What We See Is Only the Surface Most people know Hugging Face as a library that helps load models like BERT, GPT, or T5 in a few lines of code. But that’s barely …
Building the Practical Foundation of Fine-Tuning Large Language Models (LLMs)
Author(s): Hira Ahmad Originally published on Towards AI. Building the Practical Foundation of Fine-Tuning Large Language Models (LLMs) Large Language Models (LLMs) like GPT, LLaMA, and Falcon have changed how machines understand and generate human-like text. Yet, their true power emerges not …
Inter-GPU Communication
Author(s): Hira Ahmad Originally published on Towards AI. Introduction Every major leap in AI over the past decade has been powered by scaling. Models have grown to billions or even trillions of parameters, datasets span millions of examples, and GPUs are deployed …