Getting Started with Form Recognizer — Labeling Tool

Last Updated on July 20, 2023 by Editorial Team

Author(s): Dayo Bamikole

Originally published on Towards AI.

Photo from Microsoft Docs on Form Recognizer

TLDR

Background — Story of needing Form Recognizer
Overview — About Form Recognizer
How does it work / What is the labeling tool
How to Deploy the Labeling Tool

Background

So, at some point in life, we all have had to fill out a form either physically or online. From the front-end as a customer or responder, that is the end of the process, but for the back-end as a business or reviewer, a lot more goes into the process. Typically, the responses from these forms need to be extracted and analyzed for making sound business decisions, or even personal choices if you are like me. (Fun fact: I take pictures of my receipts, so I can go back to them to assess my spending habits and be more financially-responsible). Forms are documents that come in different formats e.g., PDFs, JPGs, PNG, or TIFF. The text in these documents can become difficult to work with if they cannot be extracted, searched upon, or appropriately analyzed. To the rescue comes in Form Recognizer!

Overview

Form Recognizer is part of Microsoft Azure Cognitive Service. What is Cognitive Services? It is pre-built AI features that can be easily used via an API call. They can be split into 5 categories: Decision, Language, Speech, Vision, and Web Search. Check out the different features in each of these categories here. Form Recognizer falls under the vision category, and it’s’s goal is to

extract text, key-value pairs, and tables from documents

Here is an example of key-value pair extraction from a receipt

Photo from Microsoft Azure Cognitive Services Demos

You also get the form extraction in JSON format, which can be easily consumed by different databases (e.g., NoSQL) or tools that use JSON format.

Hopefully, by now, you’re excited about the possibilities of Form Recognizer. The next question is, how do I get this working.

Pre-requisite

The only pre-requisite is to have an Azure Subscription.

How does it work?

How does Form Recognizer work? According to Azure Docs, it uses machine learning technology to extract data from the documents. As a result, it needs to at least 5 forms to create a model that subsequent forms can use as a reference point for labeling purposes (for more details, please visit the Forms Recognizer Document here). The tool used to make this happen called the Form Recognizer Labeling Tool. This is what I will like to focus on for the rest of the blog. I will show you the most efficient way to set it up such that you can use it for all your document extraction.

The tool is available via Docker. You can install Docker on your laptop or a VM (Virtual Machine). This is done with any operating system: Linux, Windows, or macOS.

While this is an excellent method, I prefer to use a PaaS approach to install the labeling tool because it guarantees that my labeling tool will always be available whenever without having to rely on my laptop or VM to be on.

To deploy the labeling tool using a PaaS approach, we will be an Azure Web App.

Deploy the Labeling Tool

Head over to the Azure Portal

Select “Create a Resource”
Select “Web App”
On the Basics Tab. Fill in the following information:

Resource Group: DemoFormRecognizer
Name: “”type a unique name””
Publish: Docker Container
Region: Central US
SKU and size: “_choose Dev/Test for a cheaper or free size _.”

4. On the Docker Tab. Fill in the following information:

Image Source: Docker Hub
Image and Tag: mcr.microsoft.com/azure-cognitive-services/custom-form/labeltool
Startup Command: docker run -it -p 3000:80 mcr.microsoft.com/azure-cognitive-services/custom-form/labeltool eula=accept

5. Select Review + Create

6. Access the tool by going to the URL

Final Thoughts

I hope this helped deploy the Form Recognizer Labeling Tool. Now you can begin to label your forms for training and testing purposes. The next steps will be to follow the instructions in the Microsoft Docs below to Create a Project in the Labeling Tool. You will need to have provisioned a Form Recognizer resource in Azure as well as an Azure Storage account.

Quickstart: Label forms, train a model, and analyze a form using the sample labeling tool — Form

In this quickstart, you’ll use the Form Recognizer REST API with the sample labeling tool to train a custom model with…

docs.microsoft.com.

Join thousands of data leaders on the AI newsletter. Join over 80,000 subscribers and keep up to date with the latest developments in AI. From research to projects and ideas. If you are building an AI startup, an AI-related product, or a service, we invite you to consider becoming a sponsor.

Published via Towards AI

Frequently Used, Contextual References

Resources

Publication

Getting Started with Form Recognizer — Labeling Tool

Author(s): Dayo Bamikole

TLDR

Background

Overview

How does it work?

Deploy the Labeling Tool

Final Thoughts

Quickstart: Label forms, train a model, and analyze a form using the sample labeling tool — Form

In this quickstart, you’ll use the Form Recognizer REST API with the sample labeling tool to train a custom model with…

Feedback ↓ Cancel reply

Popular posts

Best Laptops for Deep Learning, Machine Learning (ML), and Data Science for 2023

Best Workstations for Deep Learning, Data Science, and Machine Learning (ML) for 2022

Descriptive Statistics for Data-driven Decision Making with Python

Best Machine Learning (ML) Books - Free and Paid - Editorial Recommendations for 2022

Best Data Science Books - Free and Paid - Editorial Recommendations for 2022

Updates

Recent Posts

Building Large Action Models: Insights from Microsoft

My 6 Secret Tips for Getting an ML Job in 2025

People often follow Probabilities, Deviations and Densities that play a key role in ML modeling.

AI Agents: The Missing Link in DeFi’s $100 Billion Liquidity Challenge

Boxes, Violins and Contours Conclude the Exploratory Data Analysis Process.

The World’s Leading AI and Technology Publication.

Company

CONTACT US

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.

Frequently Used, Contextual References

Resources

Publication

Getting Started with Form Recognizer — Labeling Tool

Author(s): Dayo Bamikole

TLDR

Background

Overview

How does it work?

Deploy the Labeling Tool

Final Thoughts

Quickstart: Label forms, train a model, and analyze a form using the sample labeling tool — Form

In this quickstart, you’ll use the Form Recognizer REST API with the sample labeling tool to train a custom model with…

Related posts

Feedback ↓ Cancel reply

Popular posts

Updates

Recent Posts

The World’s Leading AI and Technology Publication.

Company

CONTACT US

GDPR CCPA Statement