How Nvidia trained Nemotron, better agents, and more #31

Author(s): Towards AI Editorial Team

Originally published on Towards AI.

Good morning, AI enthusiasts!

We are excited to announce that ‘Building LLMs for Production’ is now also available to readers across the globe on the O-Reilly learning platform. But that’s not all. We are also working on more exciting collaborations with O’Reilly to bring even more value and resources to our community (we will share more about this soon!).

For over 45 years, O’Reilly has been one of the biggest platforms for providing comprehensive learning resources. It offers exclusive live training, interactive learning experiences, certification programs, books, videos, and more.

If you are a subscriber of the platform, you can read it directly on the O’Reilly learning platform or sign up for a 10-day free trial to access the book.

If you are like us and prefer a physical book, you can also find it as a paperback, high-quality colored hardcover on Amazon. If you have already grabbed your copy, help fellow AI community members discover the book by leaving a review on Amazon.

What’s AI Weekly

In today’s video, I dive into key learnings from Nvidia’s Nemotron family of models and insights for training an LLM using synthetic data. Training large language models is such a massive challenge due to the enormous need for high-quality data. But getting that data is incredibly tough. While many people have tried to solve this problem in various ways, synthetic data is one of the most promising approaches. It’s less expensive than other methods but has a major drawback: the lack of diversity. Recently, Nvidia’s new LLMs from their Nemotron family of models have addressed this issue. They’ve shared a pipeline for generating synthetic data that’s used for training and refining large language models (LLMs). Watch the video or read the article version!

— Louis-François Bouchard, Towards AI Co-founder & Head of Community

This issue is brought to you thanks to Ai4:

Join the industry’s leading AI conference — free passes available!

Ai4, the world’s largest gathering of artificial intelligence leaders in business, is coming to Las Vegas — August 12–14, 2024. Join 4500+ attendees, 350+ speakers, and 150+ AI exhibitors from 75+ countries at the epicenter of AI innovation.

Don’t wait — passes are going fast. Apply today for a complimentary pass, or register now for 41% off final prices.

Learn AI Together Community section!

Featured Community post from the Discord

Craenius just launched a demo of their latest agent, NotDevin. Their experiments show that NotDevin was able to replace Devin and Google’s Project IDX. You can sign up here to get on the waitlist and support a fellow community member. Share your feedback in the Discord thread!

AI poll of the week!

For everyone planning to buy the book, now is a great time! We now have it as an e-book, paperback, hardcover on Amazon, and the O’Reilly learning platform. For those who don’t like books, do you all want a more bite-sized version of the most important takeaways? Tell us in the Discord thread!

Collaboration Opportunities

The Learn AI Together Discord community is flooding with collaboration opportunities. If you are excited to dive into applied AI, want a study partner, or even want to find a partner for your passion project, join the collaboration channel! Keep an eye on this section, too — we share cool opportunities every week!

1. Tanishk0619 is looking for a couple of people to learn ML with and be accountability partners. If you are also looking for a disciplined learning journey, contact him in the thread!

2. Nitin01652 is pursuing deep reinforcement learning courses from HuggingFace. He is looking for partners to discuss assignments and share resources for the next courses on other topics. If you want to try it, connect with him in the thread!

3. Baadror is starting his LLM learning journey with hands-on projects. If you want to start learning and are looking for other learners too, reach out to him in the thread!

Meme of the week!

Meme shared by rucha8062

TAI Curated section

Article of the week

KAN (Kolmogorov-Arnold Networks): A Starter Guide

Inspired by the Kolmogorov-Arnold representation theorem, KANs emerge as promising alternatives to Multi-Layer Perceptrons (MLPs). Unlike traditional neural networks, KANs place activation functions along the connections between nodes, not at the nodes themselves. This innovative approach opens doors for further enhancing deep learning models that heavily rely on MLPs. The goal of this article is to give some basic understanding of KAN and explore the parts or building blocks of KAN in this Article.

Our must-read articles

1. Named Entity Recognition in E-commerce Industry — Custom model [Github Repo] — 03/07/24

From e-commerce to customer support, all businesses require some kind of NER model to process large amounts of texts from users. Businesses require NER models to extract relevant and important entities from text. This article explains how to build NER from scratch.

2. How NVIDIA Nim Can Revolutionize the Deployment of Generative AI applications?

Enterprises absolutely need control of things like logging, monitoring, and security while also striving to integrate AI into their established infrastructure. Going for in-house manufacturing might not be feasible as it requires specialized knowledge, tools, and resources. This is when NVIDIA NIM comes into the picture; explore more in this article.

3. Stable Face-Mask Detection Using Adapted Eye Cascader

In this insightful article, Jan Werth dives into stable face-mask detection using an adapted eye cascader. The article explains how the adapted eye cascader works, providing step-by-step details on detecting eyes and creating face-bounding boxes.

If you are interested in publishing with Towards AI, check our guidelines and sign up. We will publish your work to our network if it meets our editorial policies and standards.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Frequently Used, Contextual References

Resources

Publication

How Nvidia trained Nemotron, better agents, and more #31

Author(s): Towards AI Editorial Team

What’s AI Weekly

Learn AI Together Community section!

Featured Community post from the Discord

AI poll of the week!

Collaboration Opportunities

Meme of the week!

TAI Curated section

Article of the week

Our must-read articles

Feedback ↓ Cancel reply

Popular posts

Best Laptops for Deep Learning, Machine Learning (ML), and Data Science for 2023

Best Workstations for Deep Learning, Data Science, and Machine Learning (ML) for 2022

Descriptive Statistics for Data-driven Decision Making with Python

Best Machine Learning (ML) Books - Free and Paid - Editorial Recommendations for 2022

Best Data Science Books - Free and Paid - Editorial Recommendations for 2022

Updates

Recent Posts

I Used ChatGPT to Count My Calories

Resource-Efficient Fine-Tuning of DeepSeek-R1

TAI #138: OpenAI’s o3-Mini and Deep Research: A New Era of Reasoning Powered Agents?

Text Preprocessing for NLP: A Step-by-Step Guide to Clean Raw Text Data

DeepSeek AI — The Future is Here

The World’s Leading AI and Technology Publication.

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

How Nvidia trained Nemotron, better agents, and more #31

Author(s): Towards AI Editorial Team

What’s AI Weekly

Learn AI Together Community section!

Featured Community post from the Discord

AI poll of the week!

Collaboration Opportunities

Meme of the week!

TAI Curated section

Article of the week

Our must-read articles

Related posts

Feedback ↓ Cancel reply

Popular posts

Updates

Recent Posts

The World’s Leading AI and Technology Publication.

Company

CONTACT US

GDPR CCPA Statement