Inside Code Llama: Meta AIβs Entrance in the Code LLM Space
Author(s): Jesus Rodriguez Originally published on Towards AI. The new family of models builds on the Llama 2 foundation to match state-of-the-art performance across different code generation tasks. Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over …
Meet SeamlessM4T: Meta AIβs New Foundation Model for Speech Translation
Author(s): Jesus Rodriguez Originally published on Towards AI. The model provides a unique architecture and breakthrough performance across different speech translation tasks. Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS …
Inside AVIS: Googleβs New Visual Information Seeling LLM
Author(s): Jesus Rodriguez Originally published on Towards AI. The new model combines LLMs with web search, computer vision, and image search to achieve remarkable results. Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence …
Inside XGen-Image-1: How Salesforce Research Built, Trained, and Evaluated a Massive Text-to-Image Model
Author(s): Jesus Rodriguez Originally published on Towards AI. One of the most efficient training processes for text-to-image models ever implemented. Image Credit: Salesforce Research I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS (meaning …
Googleβs Symbol Tuning is a New Fine-Tuning Technique that In-Context Learning in LLMs
Author(s): Jesus Rodriguez Originally published on Towards AI. The new method can become the foundation of new fine-tuning techniques. Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS (meaning no hype, …
Meet WebAgent: DeepMindβs New LLM that Follows Instructions and Complete Tasks on Websites
Author(s): Jesus Rodriguez Originally published on Towards AI. The model combines language understanding and web navigation. Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS (meaning no hype, no news, etc) …
Inside SDXL 1.0: Stability AI New Text-to-Image Super Model
Author(s): Jesus Rodriguez Originally published on Towards AI. The new release represents a major improvement over previous versions and matches state-of-the-art models. This member-only story is on us. Upgrade to access all of Medium. Image Credit: Stability AI I recently started an …
StyleGAN3: Allias-Free GAN
Author(s): Albert Nguyen Originally published on Towards AI. Have you ever thought of a movie that is totally generated by AI? Recent advancements in generative AI have shown promising results in controllable image generation. Generators like StyleGAN2 can produce realistic images. See …
Inside DINOv2: Meta AIβs New Self-Supervised Learning Model for Computer Vision
Author(s): Jesus Rodriguez Originally published on Towards AI. The model uses a novel architecture to remove the dependencies on text fine tuning and exhibits interesting emerging properties. I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is …
LLMs and Memory is Definitely All You Need: Google Shows that Memory-Augmented LLMs Can Simulate Any Turing Machine
Author(s): Jesus Rodriguez Originally published on Towards AI. A major breakthrough in LLM research. Top highlight Created Using Midjourney I recently started an AI-focused educational newsletter, that already has over 160,000 subscribers. TheSequence is a no-BS (meaning no hype, no news, etc) …
Foundation Models and the Path Towards a Universal Learning Algorithm
Author(s): Jesus Rodriguez Originally published on Towards AI. Can foundation models validate the theory of a master algorithm for all human knowledge? Created with: DALL-E I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is a no-BS …
The AI Behind Claude, the ChatGPT Competitor That Has Raised Over $1 Billion
Author(s): Jesus Rodriguez Originally published on Towards AI. The new chatbot follows traditional reinforcement learning with human feedback approach with a special twist. Image Credit: Anthropic I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is a …
Googleβs Chain of Thought Prompting is One of the Most Exciting Techniques in Generative AI
Author(s): Jesus Rodriguez Originally published on Towards AI. The technique is likely to be one of the hallmarks of the LaMDA model. Created with Midjourney I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is a no-BS …
Paper Review: Multimodal Chain of Thought Reasoning
Author(s): Building Blocks Originally published on Towards AI. Language Models improve with Visual Features One of the cool emergent features of Large Language Models (LLMs) is their ability to perform better on reasoning tasks such as arithmetic problems, common sense reasoning, etc., …
Hugging Faceβs LoRA is a Simple Framework for Fine-Tuning Text-to-Image Models
Author(s): Jesus Rodriguez Originally published on Towards AI. The framework is integrated into the Diffusers library and maintains compatibility with Dreambooth. Image Credit: Hugging Face I recently started an AI-focused educational newsletter, that already has over 150,000 subscribers. TheSequence is a no-BS …