Grooper 2025 - Azure Document Intelligence Config
Azure Document Intelligence is a Repository Option that connects Grooper to an existing Document Intelligence service in an Azure environment. This enables several features that leverage Azure Document Intelligence for exceptional document analysis, including machine print and handwritten text extraction (OCR), layout data collection, and document structure analysis.
Azure Document Intelligence is a cloud service from Microsoft that enables optical character recognition (OCR) and document analysis. Grooper's Azure Document Intelligence integration allows organizations to leverage Azure’s advanced machine learning models for raw text extraction, layout and structure analysis, and semantic understanding. A wide variety of document types are supported, including both machine-printed and handwritten content.
Grooper connects to an Azure Document Intelligence service by enabling and configuring the Azure Document Intelligence Repository Option. This option is configured on the Grooper database Root node and provides connectivity by entering an API key and resource name.
With the Azure Document Intelligence option added and configured, Grooper leverages the Document Intelligence service in two primary ways:
FYI: This analysis produces JSON data files that are used by the DI Layout quoting method when configuring AI-enabled features.
What if I want to use both DI Analyze and Azure DI OCR? Does that mean the document gets sent to Azure twice?
No. However, you need to run DI Analyze first.
Running DI Analyze generates a set of JSON files for each document and its child pages. When an OCR Profile uses Azure DI OCR, it first checks for the existence of these files. If they are found, Grooper retrieves the text and layout data from the JSON rather than making a duplicate call to the Azure Document Intelligence service.
Similarities
Azure DI OCR and DI Analyze have several things in common.
Differences
While both methods utilize Azure Document Intelligence, they differ in scope, output, and intended use:
You must add the Azure Document Intelligence option to the Grooper Root before using Azure DI OCR or DI Analyze. These features will not function without it.
There are two primary steps required to connect Grooper to Azure Document Intelligence.
Azure Document Intelligence is a cloud-based service and must be provisioned in the Azure portal before it can be used by Grooper. Instructions for creating a Document Intelligence resource are available in Microsoft’s Create a Document Intelligence resource article.
Once the Document Intelligence resource is available in Azure, Grooper can be connected to it using the Azure Document Intelligence repository option. Configuration is straightforward and requires the service’s API key and resource name, both of which can be obtained from the Azure portal.
To add and configure the Azure Document Intelligence option in Grooper: