Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Read by thought-leaders and decision-makers around the world. Phone Number: +1-650-246-9381 Email: [email protected]
228 Park Avenue South New York, NY 10003 United States
Website: Publisher: https://towardsai.net/#publisher Diversity Policy: https://towardsai.net/about Ethics Policy: https://towardsai.net/about Masthead: https://towardsai.net/about
Name: Towards AI Legal Name: Towards AI, Inc. Description: Towards AI is the world's leading artificial intelligence (AI) and technology publication. Founders: Roberto Iriondo, , Job Title: Co-founder and Advisor Works for: Towards AI, Inc. Follow Roberto: X, LinkedIn, GitHub, Google Scholar, Towards AI Profile, Medium, ML@CMU, FreeCodeCamp, Crunchbase, Bloomberg, Roberto Iriondo, Generative AI Lab, Generative AI Lab Denis Piffaretti, Job Title: Co-founder Works for: Towards AI, Inc. Louie Peters, Job Title: Co-founder Works for: Towards AI, Inc. Louis-François Bouchard, Job Title: Co-founder Works for: Towards AI, Inc. Cover:
Towards AI Cover
Logo:
Towards AI Logo
Areas Served: Worldwide Alternate Name: Towards AI, Inc. Alternate Name: Towards AI Co. Alternate Name: towards ai Alternate Name: towardsai Alternate Name: towards.ai Alternate Name: tai Alternate Name: toward ai Alternate Name: toward.ai Alternate Name: Towards AI, Inc. Alternate Name: towardsai.net Alternate Name: pub.towardsai.net
5 stars – based on 497 reviews

Frequently Used, Contextual References

TODO: Remember to copy unique IDs whenever it needs used. i.e., URL: 304b2e42315e

Resources

Take our 85+ lesson From Beginner to Advanced LLM Developer Certification: From choosing a project to deploying a working product this is the most comprehensive and practical LLM course out there!

Publication

NLP News Cypher | 03.15.20
Latest   Machine Learning   Newsletter

NLP News Cypher | 03.15.20

Last Updated on July 27, 2023 by Editorial Team

Author(s): Ricky Costa

Originally published on Towards AI.

Photo by Andrew Coelho on Unsplash

NATURAL LANGUAGE PROCESSING (NLP) WEEKLY NEWSLETTER

NLP News Cypher U+007C 03.15.20

Keep it Movin’

How was your week? U+1F637

Same here.

In light of recent events, we released a COVID tracker to stay-up-date on the latest COVID news from sources across the 50 US states. We have linked local and national news sources in addition to the Health Departments and other authorities from each state. We are using 3 different APIs: one from Datawrapper’s server that connects to John Hopkins COVID data, Feedly’s news API, and Twitter’s streaming API. U+1F60B

Check it out:

US COVID Tracker – Quantum Stat

Currently, the COVID-19 pandemic is spreading across the United States. Under these conditions, one must know how to…

covid.quantumstat.com

With regards to the Big Bad NLP Database, we also added research papers to ~90% of the database! Special thank you to our researcher Gabi Alexandru for doing an amazing job. U+1F60E

The Big Bad NLP Database – Quantum Stat – Quantum Stat

Datasets for various tasks in Natural Language Processing – Quantum Stat

datasets.quantumstat.com

FYI, stay indoors!

This week, the newsletter will be shorter than usual given the slow news cycle, I’m assuming it’s related to the current pandemic U+1F60C.

This Week:

TensorFlow Quantum

Haste

Electra Feel

Hugging Papers

Dataset of the Week: Jeopardy Questions

TensorFlow Quantum

Google introduced an open-source library for the rapid prototyping of quantum ML models!

In order to understand quantum models, you need to familiarize yourself with two concepts : quantum data and hybrid quantum-classical models (current approach).

Quantum Data: (which can be generated) can be used for the simulation of chemicals and quantum matter, quantum control, quantum communication networks, quantum metrology, and much more.

Hybrid quantum-classical models: OK spoiler alert, these quantum models are not YET using quantum powered hardware (still too noisy), so we are left with using GPUs. So that’s why they are β€œhybrid”.

The good thing about this library is: if we can get used to these models now, by the time the processors are ready for prime-time, we will be able to crunch HUMONGOUS amount of data using quantum principles. But first, we need to dip our feet with the quantum framework β€” this is what Google is doing for us with this library.

Blog

TensorFlow Quantum

TensorFlow Quantum is a library for hybrid quantum-classical machine learning. TensorFlow Quantum (TFQ) is a quantum…

www.tensorflow.org

Paper:

LINK

Haste

Hey remember RNNs? U+1F923. So even though we all want to marry transformers for life, RNNs are still very useful. Why? Because most companies that use AI models are still using RNNs for sequential NLP data. (It takes a couple of years for them to catch up to transformers).

Props to Mr. Nanavat for creating the RNN library called Haste:

lmnt-com/haste

Haste is a CUDA implementation of fused LSTM, Layer Normalized LSTM, and GRU layers with built-in DropConnect and…

github.com

Electra Feel

The ELECTRA transformer is pretty cool. Why? Because they changed the representation model. Instead of masking during pre-training, they have turned to substituting words with fake words and making the model choose the right one. It’s like GANs but for NLP!

This new pre-training technique allows ELECTRA to outperform current NLP transformers given the same compute during training! (SOTA on SQuAD v2)

β€œWe compare ELECTRA against other state-of-the-art NLP models and found that it substantially improves over previous methods, given the same compute budget, performing comparably to RoBERTa and XLNet while using less than 25% of the compute.”

In terms of downstream tasks, ELECTRA supports text classification, question answering and sequence tagging.

More Efficient NLP Model Pre-training with ELECTRA

Recent advances in language pre-training have led to substantial gains in the field of natural language processing…

ai.googleblog.com

GitHub:

google-research/electra

ELECTRA is a new method for self-supervised language representation learning. It can be used to pre-train transformer…

github.com

Hugging Papers

Hugging Face shared their favorite NLP research papers with their community. In the link below, you will find their fav research papers and also a schedule for future papers to be discussed. They are pulling us into the matrix! Take the blue pill!

huggingface/awesome-papers

Each week, the Hugging Face team has a science day where one team member presents an awesome NLP paper. We've decided…

github.com

Dataset of the Week: Jeopardy Questions

What is it?

A dataset containing 216,930 Jeopardy questions & answers used for, you guessed it, question answering.

Sample:

Here’s the metadata descriptors:

  • β€˜category’ : the question category, e.g. β€œHISTORY”
  • β€˜value’ : $ value of the question as string, e.g. β€œ$200”
  • Note: This is β€œNone” for Final Jeopardy! and Tiebreaker questions
  • β€˜question’ : text of question
  • Note: This sometimes contains hyperlinks and other things messy text such as when there’s a picture or video question
  • β€˜answer’ : text of answer
  • β€˜round’ : one of β€œJeopardy!”,”Double Jeopardy!”,”Final Jeopardy!” or β€œTiebreaker”
  • Note: Tiebreaker questions do happen but they’re very rare (like once every 20 years)
  • β€˜show_number’ : string of show number, e.g β€˜4680’
  • β€˜air_date’ : the show air date in format YYYY-MM-DD

Where is it?

Every Sunday we do a weekly round-up of NLP news and code drops from researchers around the world.

If you enjoyed this article, help us out and share with friends!

For complete coverage, follow our Twitter: @Quantum_Stat

www.quantumstat.com

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming aΒ sponsor.

Published via Towards AI

Feedback ↓