From Training Language Models to Training DeepSeek-R1
Author(s): Akhil Theerthala Originally published on Towards AI. Reasoning Models #1 β An overview of trainingFrom RNNs to LLMs, a comprehensive overview of how training regimes changed. This member-only story is on us. Upgrade to access all of Medium. You probably already …
7 Practical PyTorch Tips for Smoother Development and Better Performance
Author(s): Akhil Theerthala Originally published on Towards AI. Things I wish someone listed down. This member-only story is on us. Upgrade to access all of Medium. PyTorch Logo. Source: Internet. PyTorch is a surprisingly simple language that allows you to train a …