Grooper 2025 - Azure Document Intelligence

Grooper 2025 - Azure Document Intelligence Config

Grooper 2025 - Azure Document Intelligence

Azure Document Intelligence is a Repository Option that connects Grooper to an existing Document Intelligence service in an Azure environment. This enables several features that leverage Azure Document Intelligence for exceptional document analysis, including machine print and handwritten text extraction (OCR), layout data collection, and document structure analysis.

Overview of Azure Document Intelligence in Grooper

Azure Document Intelligence is a cloud service from Microsoft that enables optical character recognition (OCR) and document analysis. Grooper's Azure Document Intelligence integration allows organizations to leverage Azure’s advanced machine learning models for raw text extraction, layout and structure analysis, and semantic understanding. A wide variety of document types are supported, including both machine-printed and handwritten content.

Grooper connects to an Azure Document Intelligence service by enabling and configuring the Azure Document Intelligence Repository Option. This option is configured on the Grooper database Root node and provides connectivity by entering an API key and resource name.

How Grooper uses Azure Document Intelligence

With the Azure Document Intelligence option added and configured, Grooper leverages the Document Intelligence service in two primary ways:

FYI: This analysis produces JSON data files that are used by the DI Layout quoting method when configuring AI-enabled features.

FYI:

What if I want to use both DI Analyze and Azure DI OCR? Does that mean the document gets sent to Azure twice?

No. However, you need to run DI Analyze first.

Running DI Analyze generates a set of JSON files for each document and its child pages. When an OCR Profile uses Azure DI OCR, it first checks for the existence of these files. If they are found, Grooper retrieves the text and layout data from the JSON rather than making a duplicate call to the Azure Document Intelligence service.

Key similarities and differences between DI Analyze and Azure DI OCR

Similarities

Azure DI OCR and DI Analyze have several things in common.

Differences

While both methods utilize Azure Document Intelligence, they differ in scope, output, and intended use:

Azure DI OCR

DI Analyze

Adding and configuring the Azure Document Intelligence option

You must add the Azure Document Intelligence option to the Grooper Root before using Azure DI OCR or DI Analyze. These features will not function without it.

There are two primary steps required to connect Grooper to Azure Document Intelligence.

1. Create an Azure Document Intelligence resource

Azure Document Intelligence is a cloud-based service and must be provisioned in the Azure portal before it can be used by Grooper. Instructions for creating a Document Intelligence resource are available in Microsoft’s Create a Document Intelligence resource article.

2. Add and configure an Azure Document Intelligence repository option in Grooper

Once the Document Intelligence resource is available in Azure, Grooper can be connected to it using the Azure Document Intelligence repository option. Configuration is straightforward and requires the service’s API key and resource name, both of which can be obtained from the Azure portal.

To add and configure the Azure Document Intelligence option in Grooper:

  1. From the Design page, go to the database Root node.
  2. Open the "Options" editor (Press the "..." button).
  3. Press the Add button (add_circle) and select "Azure Document Intelligence" from the dropdown.
  4. In the "API Key" property, enter your Document Intelligence service's API key (from your Azure portal).
  5. In the "Resource Name" property, enter the Document Intelligence service's resource name (from your Azure portal).