Fine-Tuning DeepSeek R1 on Reasoning Task with Unsloth [Part 1]
Author(s): Youssef Hosni Originally published on Towards AI. Hands-On Fine-Tuning DeepSeek on Medical Reasoning Dataset This member-only story is on us. Upgrade to access all of Medium. DeepSeek company recently released DeepSeek-R1, the next step in their work on reasoning models. Itβs …
Comparing DeepSeek-R1 Models: 32B vs 70B vs R1
Author(s): Lorentz Yeung Originally published on Towards AI. DeepSeek has made waves in the AI world. They offer multiple models at the same time, so which one should we choose? This member-only story is on us. Upgrade to access all of Medium. …
Stop Paying for AI! How to Run DeepSeek Locally for Free
Author(s): Prisca Ekhaeyemhe Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Are you tired of API costs and privacy concerns when using AI models? What if you could run a powerful AI like …
Month in 4 Papers (January 2025)
Author(s): Ala Falaki, PhD Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. How Language Models Learn to Think, Judge, and Scale: From Code Evaluation to Memory-Efficient Reasoning. This series of posts is designed …
DeepSeekβs Disruptive Debut: AI Winter or Efficiency Revolution?
Author(s): RSD Studio.ai Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Source: Google A Seismic Shift in AI For some years, the AI industry operated under an unshakable assumption: progress would demand exponential …
Benchmarking ChatGPT, Qwen, and DeepSeek on Real-World AI Tasks
Author(s): HarshVardhan jain Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Photo by Aidin Geranrekab on Unsplash The wealthy tech giants in the U.S. once dominated the AI market but DeepSeekβs release caused …
Reinforcement Learning-Driven Adaptive Model Selection and Blending for Supervised Learning
Author(s): Shenggang Li Originally published on Towards AI. Inspired by Deepseeker: Dynamically Choosing and Combining ML Models for Optimal Performance This member-only story is on us. Upgrade to access all of Medium. Photo by Agence Olloweb on Unsplash Machine learning model selection …
DeepSeek AI β The Future is Here
Author(s): M. Haseeb Hassan Originally published on Towards AI. The rise of AI is massively affecting the society. It is not something that will happen someday, it has happened already. AI is being discussed in various sectors like healthcare, banking, education, manufacturing, …
How Does Deepseek AI Actually Work? What Makes It Different?
Author(s): Sakshi Shruti Originally published on Towards AI. β οΈ Insanely cheap AI model is disrupting the market This member-only story is on us. Upgrade to access all of Medium. image generated from Leonardo AI I know you might have already come across …
AI Hallucinations: Why Large Language Models Make Up Information and How to Address It
Author(s): Rohan Rao Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Photo by julien Tromeur on Unsplash I was going through a few basic topics of AI and suddenly I found βAI hallucinationsβ. …
Fine-tuning DeepSeek R1 to respond like Humans using Python!
Author(s): Krishan Walia Originally published on Towards AI. Learn to Fine-Tune Deep Seek R1 to respond as humans, through this beginner-friendly tutorial! Krishan Walia This member-only story is on us. Upgrade to access all of Medium. Letβs make DeepSeek R1 respond like …
Hands-On: Prompt Engineering with Ollama and Google Colab
Author(s): Sayanteka Chakraborty Originally published on Towards AI. This member-only story is on us. Upgrade to access all of Medium. Prompt Engineering is like giving instructions to an AI model to get the best possible answers or results. The way you phrase …
Actor Critic β Deep Reinforcement Learning
Author(s): Sarvesh Khetan Originally published on Towards AI. I have introduced the problem statement here wherein we are trying to build an agent capable of playing Ping Pong Atari game. Before reading this article I would suggest building up foundational knowledge by …
How to Explain Black-Box Deep Learning Models in Computer Vision and NLP
Author(s): Chien Vu Originally published on Towards AI. Explaining a black box Deep learning model is an essential but difficult task for engineers in an AI project. Letβs explore how to use the OmniXAI package in Python to examine and understand how …
Important LLMs Papers for the Week from 20/01 to 26/01
Author(s): Youssef Hosni Originally published on Towards AI. Stay Updated with Recent Large Language Models Research This member-only story is on us. Upgrade to access all of Medium. Large language models (LLMs) have advanced rapidly in recent years. As new generations of …