How Nvidia trained Nemotron, better agents, and more #31
Author(s): Towards AI Editorial Team
Originally published on Towards AI.
Good morning, AI enthusiasts!
We are excited to announce that βBuilding LLMs for Productionβ is now also available to readers across the globe on the O-Reilly learning platform. But thatβs not all. We are also working on more exciting collaborations with OβReilly to bring even more value and resources to our community (we will share more about this soon!).
For over 45 years, OβReilly has been one of the biggest platforms for providing comprehensive learning resources. It offers exclusive live training, interactive learning experiences, certification programs, books, videos, and more.
If you are a subscriber of the platform, you can read it directly on the OβReilly learning platform or sign up for a 10-day free trial to access the book.
If you are like us and prefer a physical book, you can also find it as a paperback, high-quality colored hardcover on Amazon. If you have already grabbed your copy, help fellow AI community members discover the book by leaving a review on Amazon.
Whatβs AI Weekly
In todayβs video, I dive into key learnings from Nvidiaβs Nemotron family of models and insights for training an LLM using synthetic data. Training large language models is such a massive challenge due to the enormous need for high-quality data. But getting that data is incredibly tough. While many people have tried to solve this problem in various ways, synthetic data is one of the most promising approaches. Itβs less expensive than other methods but has a major drawback: the lack of diversity. Recently, Nvidiaβs new LLMs from their Nemotron family of models have addressed this issue. Theyβve shared a pipeline for generating synthetic data thatβs used for training and refining large language models (LLMs). Watch the video or read the article version!
β Louis-FranΓ§ois Bouchard, Towards AI Co-founder & Head of Community
This issue is brought to you thanks to Ai4:
Join the industryβs leading AI conference β free passes available!
Ai4, the worldβs largest gathering of artificial intelligence leaders in business, is coming to Las Vegas β August 12β14, 2024. Join 4500+ attendees, 350+ speakers, and 150+ AI exhibitors from 75+ countries at the epicenter of AI innovation.
Donβt wait β passes are going fast. Apply today for a complimentary pass, or register now for 41% off final prices.
Learn AI Together Community section!
Featured Community post from the Discord
Craenius just launched a demo of their latest agent, NotDevin. Their experiments show that NotDevin was able to replace Devin and Googleβs Project IDX. You can sign up here to get on the waitlist and support a fellow community member. Share your feedback in the Discord thread!
AI poll of the week!
For everyone planning to buy the book, now is a great time! We now have it as an e-book, paperback, hardcover on Amazon, and the OβReilly learning platform. For those who donβt like books, do you all want a more bite-sized version of the most important takeaways? Tell us in the Discord thread!
Collaboration Opportunities
The Learn AI Together Discord community is flooding with collaboration opportunities. If you are excited to dive into applied AI, want a study partner, or even want to find a partner for your passion project, join the collaboration channel! Keep an eye on this section, too β we share cool opportunities every week!
1. Tanishk0619 is looking for a couple of people to learn ML with and be accountability partners. If you are also looking for a disciplined learning journey, contact him in the thread!
2. Nitin01652 is pursuing deep reinforcement learning courses from HuggingFace. He is looking for partners to discuss assignments and share resources for the next courses on other topics. If you want to try it, connect with him in the thread!
3. Baadror is starting his LLM learning journey with hands-on projects. If you want to start learning and are looking for other learners too, reach out to him in the thread!
Meme of the week!
Meme shared by rucha8062
TAI Curated section
Article of the week
KAN (Kolmogorov-Arnold Networks): A Starter Guide
Inspired by the Kolmogorov-Arnold representation theorem, KANs emerge as promising alternatives to Multi-Layer Perceptrons (MLPs). Unlike traditional neural networks, KANs place activation functions along the connections between nodes, not at the nodes themselves. This innovative approach opens doors for further enhancing deep learning models that heavily rely on MLPs. The goal of this article is to give some basic understanding of KAN and explore the parts or building blocks of KAN in this Article.
Our must-read articles
1. Named Entity Recognition in E-commerce Industry β Custom model [Github Repo] β 03/07/24
From e-commerce to customer support, all businesses require some kind of NER model to process large amounts of texts from users. Businesses require NER models to extract relevant and important entities from text. This article explains how to build NER from scratch.
2. How NVIDIA Nim Can Revolutionize the Deployment of Generative AI applications?
Enterprises absolutely need control of things like logging, monitoring, and security while also striving to integrate AI into their established infrastructure. Going for in-house manufacturing might not be feasible as it requires specialized knowledge, tools, and resources. This is when NVIDIA NIM comes into the picture; explore more in this article.
3. Stable Face-Mask Detection Using Adapted Eye Cascader
In this insightful article, Jan Werth dives into stable face-mask detection using an adapted eye cascader. The article explains how the adapted eye cascader works, providing step-by-step details on detecting eyes and creating face-bounding boxes.
If you are interested in publishing with Towards AI, check our guidelines and sign up. We will publish your work to our network if it meets our editorial policies and standards.
Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.
Published via Towards AI