Optical Character Recognition (OCR) for Text Localization, Detection, and More!

Last Updated on November 10, 2021 by Editorial Team

Author(s): Towards AI Team

AI news, research and updates, an exciting natural language API, our first book on descriptive statistics, and our monthly editorial picks!

If you have trouble reading this email, see it on a web browser.

Happy Tuesday, Towards AI family! It has been a little while since we sent our last newsletter. In this edition, we are bringing you some exciting goodies we think you will love. To get started, this research paper on Liquid Time-constant Networks led by Ramin Hasani et al. from MIT showcases novel recurrent neural network models that can change their underlying equations to adapt to new data inputs to reduce complexity massively continuously.

Optical Character Recognition (OCR) for Text Localization, Detection, and More!

Have you tried out expert.ai’s natural language API demo (no signup needed to try it!). Simply, select a language, choose a document or use a sample text up to 10,000 characters, click analyze, and you will see the different types of natural language analysis expert.ai performs.

We recently launched our book on descriptive statistics with Python, if you haven’t checked it out. This article or this PDF provides a sample of the first 36 pages of the book. Please don’t forget that you can access this work, many more books, and other goodies by becoming a member.

This work on reinforcement learning led by MineRL is fascinating. They are leading state-of-the-art work in the advancement and development of breakthrough RL methods for machine learning research. Check them out, especially if you are interested in Minecraft and reinforcement learning.

Next, if you are interested in computer vision, check out this research from Carnegie Mellon led by Mihir Prabhudesai, Hsiao-Yu Fish Tung, et al., their model can recognize new objects and provide answers to complex visual questions from tiny labeled datasets.

At the beginning of each year, Gradient Flow gathers some groundwork of the year’s technology developments in areas concerning big data, analytics, machine learning, and AI and share their predictions on a trends report. If you haven’t checked it out, their 2021 trends report is very comprehensive.

Next in NLP, powerful language models (LM) such as GPT-3 and T5 have an impressive ability to answer queries in complex scenarios by continuing textual prompts. However, how confident are they? Zhengbao Jiang et al. discuss this LM problem in detail in this paper.

Now into the monthly picks! We pick these articles based on readers, fans, and views a specific piece gets. We hope you enjoy reading them as much as we did. Also, we started doing something new! We will pick our top-performing articles, and our editors will choose a couple of essays that didn’t have outstanding performance, but due to their quality — they made the cut for the month.

If you can, please share our subscription link with your friends, colleagues, and acquaintances. One email per month; unsubscribe anytime! If you have any feedback on how we can improve, please feel free to send us an email.

📚 Editor’s choice featured articles of the month ↓ 📚

Tesseract OCR for Text Localization and Detection by Sharon Lim

Optical character recognition (“OCR”) systems have been widely used to provide automated text entry into computerized systems. However, conventional OCR systems’ inability to read more than a handful of type fonts and page formats still remains unresolved. As a result, conventional OCR has never achieved more than a marginal impact on the total number of documents requiring conversion into its digital form.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

Optical Character Recognition (OCR) for Text Localization, Detection, and More!

Author(s): Towards AI Team

AI news, research and updates, an exciting natural language API, our first book on descriptive statistics, and our monthly editorial picks!

📚 Editor’s choice featured articles of the month ↓ 📚

Tesseract OCR for Text Localization and Detection by Sharon Lim

Descriptive Statistics for Data-driven Decision Making with Python by Pratik Shukla, Roberto Iriondo

How AI Will End the One-Size-Fits-All Approach in Human Assessment by Okan Bulut

Genetic Algorithm for Trading Strategy Optimization in Python by Louis Chan

Step-by-step implementation of GANs on custom image data in PyTorch: Part 2 by Varshita Sher

Creating AI Web Apps using TensorFlow, Google Cloud Platform, and Firebase by Jonathan Quijas

How to Predict Stock Prices with LSTM by George Pipis

Basics of Time Series with Python by Amit Chauhan

Thinking Fast and Slow and the Third Wave of AI by Louis (What’s AI) Bouchard

You Will Never Succeed If You Keep Applying for Jobs Online by Arunn Thevapalan

Deep Hashing for Similarity Search by Rutuja Shivraj Pawar

Methods, Challenges, and Hazards of Collecting Tweets by Stephen DeFerrari

Image De-noising Using Deep Learning by Chintan Dave

Setup Your Raspberry Pi Quickly by Nikolas Malamas

Tweet Topic Modeling Part 1: Using Twint to Scrape Tweets by John Bica

Roberto Iriondo

Related posts

Popular posts

Updates

Recent Posts

Comprehensive AI Engineering and AI for Work certifications

Company

CONTACT US

GDPR CCPA Statement