Falcon-40B: A Fully OpenSourced Foundation LLM
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Each Contributor hereby grants Grants to You a perpetual, worldwide, non-exclusive, irrevocable copyright license to reproduce, prepare Derivative Works of, publicly display, publicly perform, sublicense, and distribute the Work and such …
Fine Tune GPT Models Using Lit-Parrot by Lightening-AI
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. BYOD Bring Your Own Data! and Let's Train on Your GPU Lit-Parrot is a hackable implementation of state-of-the-art open-source large language models: StabilityAI StableLMEleutherAI PythiaTogether RedPajama-INCITETII UAE Falcon Lit-Parrot has been …
WizardLM: Fully Open-source Automated Instruction Data Generator
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Automate tedious steps of instruction-based training data generation Instruction tuning on open-domain LLMs (LLaMA, MPT, Falcon) has worked fantastically! But manually creating instruction data is really time-consuming and humans are lazy …
AlphaDev: Sorting Algorithm βHold My Beerβ
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. AlphaDev uncovered a faster algorithm for sorting, a method for ordering data. Billions of people use these algorithms every day without realizing it Just when you started to think as ChatGPT …
NVIDIA: LLM That Predicts Patient Readmission
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Pretraining datasets (NYU Notes, NYU NotesβManhattan, NYU NotesβBrooklyn) TLDR Nearly 15% of hospital patients in the U.S. are readmitted within 30 days of their initial discharge, which is often associated with …
How Do 8 Smaller Models in GPT4 Work?
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. The secret βModel of Expertsβ is out; let's understand why GPT4 is so good! In recent years, deep learning models have been all the buzz. Every company is developing it. And …
GPT-4 Lost This Battle 449 to 28
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. After GDPR, Europeβs push for Safe and Transparent AI will change the LLM landscape significantly. Source: Stanford.edu We have enjoyed foundation models being developed left and right in the last 2β3 …
How Do 8 Smaller Models in GPT-4 Work?
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. The secret βModel of Expertsβ is out; let's understand why GPT4 is so good! This member-only story is on us. Upgrade to access all of Medium. In recent years, deep learning …
Better than GPT-4 for SQL queries: NSQL (Fully OpenSource)
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. NSQL is a new family of open-source large foundation models (FMs) designed specifically for SQL generation tasks Raise your hand if you have tried to use ChatGPT or any of the …
NSQL: First Ever Fully OpenSource SQL Foundation Model
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. NSQL is a new family of open-source large foundation models (FMs) designed specifically for SQL generation tasks Raise your hand if you have tried to use ChatGPT or any of the …
Meet Fully OpenSource Foundation Model By Salesforce XGen-7B
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. This model allows long sequences of up to 8K tokens completely free Salesforce has a reputation for doing good work in the open-source world, including their image encoders models like blip …
Meet MPT-30B: A Fully OpenSouce LLM that Outperforms GPT-3
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Forget LAMP Stack: LLM stack is here! The Community has run with MPT-7B, which was downloaded over 3M times. Within a month, the community has created. LLaVA-MPT adds vision understanding to …
Orca 13B: Imitating GPT-4 the βRightβ Way
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Limited diversity Paper argues that all other small models are imitating GPT-4 the wrong way (style) but not the right way (reasoning). Lack of rigorous testing makes these small models look …
Forget LAMP Stack: LLM Stack is Here!
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. Huggingface has positioned itself as the new standard stack in the NLP/LLM ecosystem. Now the companies are asking for an LLM stack. Top highlight On the may of last year, Huggingface …
Meet Gorilla: A Fully OpenSource LLM Tuned For API Calls
Author(s): Dr. Mandar Karhade, MD. PhD. Originally published on Towards AI. UC Berkley and Microsoft Research together came up with Gorilla, which specializes in API calls. This model is a 7b parameter model means consumer GPUs are in business. Let’s take …