Linear Function Approximation in Reinforcement Learning
Author(s): Shivam Mohan Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. In reinforcement learning (RL), a key challenge is estimating the value function, which predicts future rewards based on the current state. In …
Understanding Chain-of-Thought (CoT) Reasoning: The Core Behind OpenAIβs o1 Model
Author(s): Shivam Mohan Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Chain-of-Thought (CoT) reasoning is an approach that significantly enhances the reasoning abilities of large language models (LLMs) by breaking down complex problems …
Retrieval-Augmented Generation (RAG): LLMs with Real-Time Knowledge
Author(s): Shivam Mohan Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Artificial Intelligence (AI) constantly evolves, and Retrieval-Augmented Generation (RAG) is at the forefront of this revolution. By merging the text generation capabilities …
Mastering Prompt Engineering: A Beginnerβs Guide to AI Interaction
Author(s): Shivam Mohan Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Introduction Prompt engineering is an essential skill for anyone venturing into the world of artificial intelligence (AI). By designing effective prompts, we …