Extract data from forms with Form Recognizer

Azure Form Recognizer is a Vision API that extracts key-value pairs and table data from form documents.

Uses of the Form Recognizer service include:

Process automation
Knowledge mining
Industry-specific applications

Form Recognizer uses Optical Character Recognition (OCR) capabilities and deep learning models to extract text, key-value pairs, selection marks, and tables from documents.

OCR captures document structure by creating bounding boxes around detected objects in an image.
The locations of the bounding boxes are recorded as coordinates in relation to the rest of the page.
Form Recognizer services return bounding box data and other information in a structured form with the relationships from the original file.

Form Recognizer is composed of the following services:

Document analysis models: which take an input of JPEG, PNG, PDF, and TIFF files and return a JSON file with the location of text in bounding boxes, text content, tables, selection marks (also known as checkboxes or radio buttons), and document structure.
Prebuilt models: which detect and extract information from document images and return the extracted data in a structured JSON output. Form Recognizer currently supports prebuilt models for several forms, including:
- W-2 forms
- Invoices
- Receipts
- ID documents
- Business cards
Custom models: custom models extract data from forms specific to your business. Custom models can be trained by calling the Build model API, or through Form Recognizer Studio.

Understand Form Recognizer file input requirements

Form Recognizer works on input documents that meet these requirements:

Format must be JPG, PNG, BMP, PDF (text or scanned), or TIFF.
The file size must be less than 500 MB for paid (S0) tier and 4 MB for free (F0) tier.
Image dimensions must be between 50 x 50 pixels and 10000 x 10000 pixels.
The total size of the training data set must be 500 pages or less.

More input requirements can be found in the documentation for specific models.

Create a Form Recognizer resource