Azure Form Recognizer is a Vision API that extracts key-value pairs and table data from form documents.
Uses of the Form Recognizer service include:
- Process automation
- Knowledge mining
- Industry-specific applications
Form Recognizer uses Optical Character Recognition (OCR) capabilities and deep learning models to extract text, key-value pairs, selection marks, and tables from documents.
- OCR captures document structure by creating bounding boxes around detected objects in an image.
- The locations of the bounding boxes are recorded as coordinates in relation to the rest of the page.
- Form Recognizer services return bounding box data and other information in a structured form with the relationships from the original file.
Form Recognizer is composed of the following services:
- Document analysis models: which take an input of JPEG, PNG, PDF, and TIFF files and return a JSON file with the location of text in bounding boxes, text content, tables, selection marks (also known as checkboxes or radio buttons), and document structure.
- Prebuilt models: which detect and extract information from document images and return the extracted data in a structured JSON output. Form Recognizer currently supports prebuilt models for several forms, including:
- W-2 forms
- Invoices
- Receipts
- ID documents
- Business cards
- Custom models: custom models extract data from forms specific to your business. Custom models can be trained by calling the Build model API, or through Form Recognizer Studio.
Understand Form Recognizer file input requirements
Form Recognizer works on input documents that meet these requirements:
- Format must be JPG, PNG, BMP, PDF (text or scanned), or TIFF.
- The file size must be less than 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
- Image dimensions must be between 50 x 50 pixels and 10000 x 10000 pixels.
- The total size of the training data set must be 500 pages or less.
More input requirements can be found in the documentation for specific models.
Create a Form Recognizer resource
Note down the resource group subscription ID and location which we will use when running a custom Windows batch script.
In the beginning of the script we’ve updated our details. The script creates automatically the storage account and uploads the forms.
Update the application config file with the specifics.
In the train-model folder update the specifics including the Model ID.
Below is the output. note the confidence score of 1.
If you want to try out this module here is the link:
https://docs.microsoft.com/en-us/learn/modules/work-form-recognizer/