Have o1 Models Solved Human Reasoning?
Author(s): Nehdiii Originally published on Towards AI. Image Generated By ChatGPT OpenAI made waves in the AI community with the release of their o1 models. As the excitement settles, I feel it’s the perfect time to share my thoughts on LLMs’ reasoning …
DeepSeek-V3 Part 3: Auxiliary-Loss-Free Load Balancing
Author(s): Nehdiii Originally published on Towards AI. This is the third article in our DeepSeek-V3 series, where we explore another key architectural breakthrough in DeepSeek [1, 2, 3] models related to Mixture-of-Experts (MoE): Auxiliary-Loss-Free Load Balancing [5]. Vegapunk №03 One Piece Character …
DeepSeek-V3 Part 2: DeepSeekMoE
Author(s): Nehdiii Originally published on Towards AI. This article marks the second entry in our DeepSeek-V3 series, focusing on a pivotal architectural breakthrough in the DeepSeek models [1, 2, 3]: DeepSeekMoE [4]. Vegapunk №02 One Piece Character Generated with ChatGPT In this …
DeepSeek-V3 Explained, Part 1: Understanding Multi-Head Latent Attention
Author(s): Nehdiii Originally published on Towards AI. Vegapunk No.01 One Piece Character Generated with ChatGPT This is the first article of our new series “DeepSeek-V3 Explained”, where we will try to demystify DeepSeek-V3 [1, 2], the latest model open-sourced by DeepSeek. In …
Extracting Actionable Rules from Raw Data
Author(s): Nehdiii Originally published on Towards AI. Image by DALL-E 3 When working with products, we often encounter situations where introducing certain “rules” becomes necessary. Let me clarify what I mean by “rules” through some practical examples: Imagine we’re facing a surge …
🧠 From CLIP to the Future: A Deep Dive into Vision-Language Models for Vision Tasks
Author(s): Nehdiii Originally published on Towards AI. From recognizing faces in photos to detecting objects in real-time videos, computer vision has revolutionized the way machines “see” the world. Tasks like image classification, object detection, segmentation, and even person re-identification (ReID) have seen …