Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

Google’s “Jarvis” AI Could Soon Run Your Browser for Everyday Tasks
Artificial Intelligence   Latest   Machine Learning

Google’s “Jarvis” AI Could Soon Run Your Browser for Everyday Tasks

Last Updated on October 31, 2024 by Editorial Team

Author(s): Get The Gist

Originally published on Towards AI.

Welcome to Get The Gist, where every weekday, we share an easy-to-read summary of the latest and greatest developments in AI — news, innovations, and trends — all delivered in under 5 minutes! ⏱

In today’s edition:

  • Google’s “Jarvis” AI Could Soon Run Your Browser for Everyday Tasks
  • Musk’s xAI Grok Now Analyzes Images and Cracks Jokes
  • Meta Launches Open Source Alternative to Google’s NotebookLM
  • Google is Preparing to Launch Gemini 2.0
  • And more AI news….

1. Google’s “Jarvis” AI Could Soon Run Your Browser for Everyday Tasks

Image by: Google

The Gist: Google is reportedly working on “Jarvis,” an AI that could operate a web browser autonomously to streamline routine tasks, with a preview potentially coming in December.

Key Details:

  • Jarvis uses frequent screenshots to interpret a user’s screen, then acts on commands like clicking buttons or typing text, handling activities like research, shopping, and travel bookings.
  • The AI is designed specifically for web browsers, with a focus on Chrome, offering users direct assistance within a familiar platform.
  • This project is part of Google’s larger push into AI, alongside new features in its Gemini AI and expanded language support for Gemini Live.
  • Google’s move comes shortly after Anthropic’s Claude AI introduced similar “computer-using” skills, which are now in a public beta.

2. Musk’s xAI Grok Now Analyzes Images and Cracks Jokes

Image by: X Corp.

The Gist: Elon Musk’s xAI has introduced an image understanding feature to its Grok chatbot, allowing paid X users to upload images and ask Grok questions about them, even for humor analysis.

Key Details:

  • The new update lets Grok interpret images, with the potential to analyze jokes within visuals, as Musk noted in a post on X.
  • Grok’s image understanding feature is in its early phase but is expected to improve quickly with further development.
  • This follows Grok-2’s August release, which introduced image generation via Black Forest Labs’ FLUX.1 model, with promises for future multimodal capabilities.
  • Musk hinted at upcoming document interpretation abilities for Grok, aiming to overcome current limitations with file formats like PDFs.

3. Meta’s NotebookLlama: An Open Source Alternative to Google’s NotebookLM

Image by: Unsplash

The Gist: Meta has launched NotebookLlama, an open-source tool to create podcast-style summaries from text files, designed as an alternative to Google’s NotebookLM.

Key Details:

  • NotebookLlama generates conversational summaries from uploaded text, using Meta’s Llama models to process PDFs and create engaging podcast scripts.
  • The process includes several steps: pre-processing text, creating transcripts, dramatizing the script, and converting it to audio with tools like Parler-TTS and Bark’s Suno.
  • While some users find the output more robotic compared to NotebookLM, it offers developers insight into open-source podcast tech.
  • Meta’s Llama models have seen massive global use, with India as a major market, and Llama 4 is expected to launch next year.

Quick Gist

  • Abu Dhabi’s G42 is pioneering AI in India’s film industry, enhancing processes like dubbing, script support, and plans for a Hindi language model to streamline creative workflows in Bollywood (Read More).
  • Google is preparing to launch its Gemini 2.0 AI model in December, though reports suggest it may offer limited advancements over the previous version (Read More).
  • Google DeepMind introduced the Habermas Machine, an AI designed to mediate and promote consensus in conflicting discussions by generating compromise-based statements (Read More).
  • Research highlights issues with OpenAI’s Whisper transcription tool, which frequently generates errors or “hallucinations” in sensitive contexts, such as healthcare, where accuracy is paramount (Read More).
  • Meta has partnered with Reuters to integrate news content into its Meta AI chatbot, enhancing responses to current events while addressing concerns over misinformation control (Read More).

That’s it for today, see you tomorrow! 👋

If you enjoyed this update and want to stay informed about the latest developments in AI, consider subscribing to Get The Gist on Medium for more insights and analyses.

Want to dive even deeper? Subscribe to our free daily email newsletter for quick, concise updates straight to your inbox so you never miss an important development. You can sign up by clicking here.

Join us as we explore the world of AI together — one gist at a time! 💡🤖

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Feedback ↓