GitHub - microsoft/content-processing-solution-accelerator: Programmatically extract data and apply schemas to unstructured documents across text-based and multi-modal content using Azure AI Foundry, Azure OpenAI, Azure AI Content Understanding, and Cosmos DB.

Warning

Important UpdateWe've made major updates to Agentic Content Processing that include breaking changes. If you need the previous version, you can still find it in the v1 branch.

Process multi-document claims by extracting data from each document, applying schemas with confidence scoring, and generating AI-powered summaries and gap analysis across the entire claim. Upload multiple files — invoices, forms, images, contracts — to a single claim, and the solution automatically processes each document through a multi-modal content extraction pipeline, then orchestrates cross-document summarization and gap identification using an Agent Framework Workflow Engine.

The core content processing engine supports text, images, tables and graphs with schema-based transformation and confidence scoring. These capabilities can be applied to numerous use cases including: insurance claims processing, contract review, invoice processing, ID verification, and logistics shipment record processing.

SOLUTION OVERVIEW | QUICK DEPLOY | BUSINESS SCENARIO | SUPPORTING DOCUMENTATION

Note: With any AI solutions you create using these templates, you are responsible for assessing all associated risks and for complying with all applicable laws and safety standards. Learn more in the transparency documents for Agent Service and Agent Framework.

Solution overview

This accelerator leverages Azure AI Foundry, Azure AI Content Understanding Service, Azure OpenAI Service GPT-5.1, Azure Blob Storage, Azure Cosmos DB, and Azure Container Apps to process multi-document claims through a two-level architecture:

Claim Processing Workflow — Upload multiple documents to a claim via the Web UI. The Content Process Workflow (built on the Agent Framework Workflow Engine) orchestrates document extraction, AI-powered summarization, and gap analysis across all documents in the claim.
Content Processing Pipeline — The core engine (carried forward from v1) that processes each individual document through a 4-stage pipeline: Extract → Map → Evaluate → Save, with confidence scoring for extraction accuracy and schema mapping.

Processing, extraction, schema transformation, summarization, and gap analysis steps are tracked with status and scored for accuracy to automate processing and identify as-needed human validation.

Solution architecture

Click to view detailed architecture diagram

graph TB subgraph UserInterface["🖥️ User Interface"] WEB["Claim Process Monitor Web
(React / TypeScript / NGINX)
ca-name-web"] end

subgraph API_Layer["🔗 Content Process API — Gateway"]
    API["<b>Content Process API</b><br/>(FastAPI / Python)<br/>ca-name-api"]
    API_WF["/claimprocessor/*<br/>Workflow Endpoints"]
    API_CP["/contentprocessor/*<br/>Content Processor Endpoints"]
    API_SV["/schemasetvault/*<br/>Schema Set Endpoints"]
    API_SC["/schemavault/*<br/>Schema Endpoints"]
    API --- API_WF
    API --- API_CP
    API --- API_SV
    API --- API_SC
end

subgraph Queues["📨 Azure Storage Queues"]
    Q_CLAIM["claim-process-queue"]
    Q_DLQ["claim-process-dead-letter-queue"]
    Q_EXTRACT["content-pipeline-extract-queue"]
end

subgraph Workflow["⚙️ Content Process Workflow — Agent Framework"]
    WF["<b>Content Process Workflow</b><br/>(Agent Framework Workflow Engine)<br/>ca-name-wkfl"]
    WF_S1["Stage 1: Document Processing<br/>(DocumentProcessExecutor)<br/>Invokes Content Processor per document"]
    WF_S2["Stage 2: Summarizing<br/>(SummarizeExecutor)<br/>AI summary across all docs"]
    WF_S3["Stage 3: Gap Analysis<br/>(GapExecutor)<br/>AI gap identification"]
    WF --> WF_S1 --> WF_S2 --> WF_S3
end

subgraph Processor["📄 Content Processor — 4-Stage Pipeline"]
    CP["<b>Content Processor</b><br/>(Python / Queue Worker)<br/>ca-name-app"]
    CP_E["1. Extract<br/>(Azure AI Content Understanding)"]
    CP_M["2. Map<br/>(GPT-5.1 Vision)"]
    CP_V["3. Evaluate<br/>(Merge & Score)"]
    CP_S["4. Save<br/>(Blob + Cosmos DB)"]
    CP --> CP_E --> CP_M --> CP_V --> CP_S
end

subgraph AzureAI["🧠 Azure AI Services"]
    AICU["<b>Azure AI Content<br/>Understanding</b>"]
    AOAI["<b>Azure OpenAI</b><br/>GPT-5.1"]
end

subgraph DataStores["💾 Data & Storage"]
    BLOB["<b>Azure Blob Storage</b><br/>Documents, Manifests, Results"]
    COSMOS["<b>Azure Cosmos DB</b><br/>Processes | Schemas | claimprocesses"]
end

subgraph Config["🔧 Configuration & Infrastructure"]
    APPCONFIG["<b>App Configuration</b>"]
    ACR["<b>Container Registry</b>"]
    CAE["<b>Container App Environment</b>"]
    LOG["<b>Log Analytics</b>"]
end

%% Main flow
WEB -->|"HTTP"| API
API_WF -->|"enqueue claim"| Q_CLAIM
Q_CLAIM -->|"dequeue"| WF
WF_S1 -->|"HTTP per document"| API_CP
API_CP -->|"enqueue document"| Q_EXTRACT
Q_EXTRACT -->|"dequeue"| CP
WF -->|"failed messages"| Q_DLQ

%% AI service connections
CP_E -->|"OCR & layout"| AICU
CP_M -->|"vision extraction"| AOAI
WF_S2 -->|"summarization"| AOAI
WF_S3 -->|"gap analysis"| AOAI

%% Data store connections
CP_S -->|"save results"| BLOB
CP_S -->|"save results"| COSMOS
WF -->|"claim status & results"| COSMOS
WF_S1 -->|"download manifest"| BLOB
API -->|"read/write"| COSMOS
API -->|"read/write"| BLOB

%% Config connections
API -.->|"settings"| APPCONFIG
WF -.->|"settings"| APPCONFIG
CP -.->|"settings"| APPCONFIG
ACR -.->|"images"| CAE
CAE -.-> LOG

Agentic architecture

The claim processing workflow is built on the Agent Framework's Workflow Engine — a DAG-based event-streaming execution model that orchestrates specialized AI agents across the claim lifecycle. Each stage is an autonomous Executor that receives context, performs its task, and passes results downstream.

Click to view detailed agentic architecture diagram

flowchart TB subgraph Trigger["📨 Trigger"] QUEUE["Azure Storage Queue
claim-process-queue"] end

subgraph WorkflowEngine["⚙️ Agent Framework — Workflow Engine"]
    direction TB
    WB["WorkflowBuilder<br/><i>DAG graph construction</i>"]

    subgraph Agent1["🤖 Agent 1 — DocumentProcessExecutor"]
        A1_DESC["For each document in the claim:<br/>• Submit to Content Processor API<br/>• Trigger 4-stage extraction pipeline<br/>(Extract → Map → Evaluate → Save)<br/>• Poll for completion & collect results"]
    end

    subgraph Agent2["🤖 Agent 2 — SummarizeExecutor"]
        A2_DESC["Consolidated AI summary:<br/>• Receives all extraction results<br/>• Sends to Azure OpenAI GPT-5.1<br/>• Generates cross-document summary<br/>(claimant, vehicle, incident, costs)"]
    end

    subgraph Agent3["🤖 Agent 3 — GapExecutor"]
        A3_DESC["Gap & discrepancy analysis:<br/>• Evaluates YAML DSL ruleset<br/>• Detects missing documents<br/>• Flags cross-document discrepancies<br/>• Sends to Azure OpenAI GPT-5.1"]
    end

    WB --> Agent1 --> Agent2 --> Agent3
end

subgraph AIServices["🧠 Azure AI Services"]
    AICU["Azure AI Content<br/>Understanding"]
    AOAI["Azure OpenAI<br/>GPT-5.1"]
end

subgraph Persistence["💾 Results"]
    COSMOS["Azure Cosmos DB"]
    BLOB["Azure Blob Storage"]
end

QUEUE --> WorkflowEngine
Agent1 -->|"OCR & layout"| AICU
Agent1 -->|"vision extraction"| AOAI
Agent2 -->|"summarization"| AOAI
Agent3 -->|"gap analysis"| AOAI
Agent3 -->|"persist results"| COSMOS
Agent3 -->|"persist results"| BLOB

Capability	Detail
Execution model	DAG-based workflow with event streaming, concurrent workers, and retry logic
Agent orchestration	WorkflowBuilder registers executors, defines edges, and builds a frozen Workflow graph
Executor pattern	Each agent is an Executor subclass with @handler-decorated async methods
Fault tolerance	Exponential backoff retries, dead-letter queue (claim-process-dead-letter-queue), graceful shutdown
Extensibility	Add new agents (executors) and edges to the DAG without modifying existing stages

Additional resources

For detailed technical information, see the component documentation:

Technical Architecture

Document Processing Pipeline (4-stage extraction)

Claim Processing Workflow (Agent Framework)

Golden Path Workflows (end-to-end walkthroughs)

If you'd like to customize the solution accelerator, here are some common areas to start:

Adding your own Schemas and Data

Modifying System Processing Prompts

Gap Analysis Ruleset Guide (YAML DSL — no-code rule authoring)

API Reference for Content Processing & Claim Management

Customizing the Claim Processing Workflow

Features

Key features

Click to learn more about the key features this solution enables

Multi-document claim processing
Upload multiple files to a single claim and process them as a batch. The claim workflow orchestrates content extraction for each document, then performs cross-document summarization and gap analysis.
Multi-modal content processing
Core extraction engine utilizes machine learning-based OCR for efficient text extraction and integrates GPT-5.1 Vision for processing various content formats including text, images, tables, and graphs.
AI-powered summarization & gap analysis
After all documents in a claim are processed, GPT-5.1 generates a consolidated summary and performs gap analysis — detecting missing documents and flagging cross-document discrepancies across the claim.
No-code gap analysis ruleset (YAML DSL)
Gap analysis rules are defined in a reusable YAML-based Domain-Specific Language — domain experts can add, modify, or replace rules without writing code. The same DSL format is portable across industries (insurance, logistics, legal, finance). See Gap Analysis Ruleset Guide.
Agent Framework Workflow Engine
Claim processing is orchestrated by a DAG-based workflow engine with event streaming, concurrent workers, retry logic, and dead-letter queue support for production reliability.
Schema-based data transformation
Maps extracted content to custom or industry-defined schemas and outputs as JSON for interoperability.
Confidence scoring
Calculation of entity extraction and schema mapping processes for accuracy, providing scores to drive manual human-in-the-loop review, if desired.
Review, validate, update
Transparency in reviewing processing steps, summaries, and gap analysis — allowing for review, comparison to source asset, ability to modify output results, and annotation for historical reference.
API driven processing pipelines
API endpoints are available for claim lifecycle management, content processing, schema management, and external source system integration.

Getting Started

Quick deploy

How to install or deploy

Follow the quick deploy steps on the deployment guide to deploy this solution to your own Azure subscription.

Note: This solution accelerator requires Azure Developer CLI (azd) version 1.18.0 or higher. Please ensure you have the latest version installed before proceeding with deployment. Download azd here.

Note: This solution accelerator also requires Bicep CLI version 0.33.0 or higher for compiling infrastructure templates. Install Bicep.

Click here to launch the deployment guide

Note: Some tenants may have additional security restrictions that run periodically and could impact the application (e.g., blocking public network access). If you experience issues or the application stops working, check if these restrictions are the cause. In such cases, consider deploying the WAF-supported version to ensure compliance. To configure, Click here.

⚠️ Important: Check Azure OpenAI Quota Availability
To ensure sufficient quota is available in your subscription, please follow quota check instructions guide before you deploy the solution.

🛠️ Need Help? Check our Troubleshooting Guide for solutions to common deployment issues.

Guidance

Prerequisites and costs

To deploy this solution accelerator, ensure you have access to an Azure subscription with the necessary permissions to create resource groups, resources, app registrations, and assign roles at the resource group level. This should include Contributor role at the subscription level and Role Based Access Control role on the subscription and/or resource group level. Follow the steps in Azure Account Set Up.

Here are the supported regions for deployment: Australia East, Central US, East Asia, East US 2, Japan East, North Europe, Southeast Asia, UK South.

Check the Azure Products by Region page and select a region where the following services are available.

Pricing varies per region and usage, so it isn't possible to predict exact costs for your usage. The majority of the Azure resources used in this infrastructure are on usage-based pricing tiers. However, Azure Container Registry has a fixed cost per registry per day.

Use the Azure pricing calculator to calculate the cost of this solution in your subscription.

Review a sample pricing sheet in the event you want to customize and scale usage.

Note: This is not meant to outline all costs as selected SKUs, scaled use, customizations, and integrations into your own tenant can affect the total consumption of this sample solution. The sample pricing sheet is meant to give you a starting point to customize the estimate for your specific needs.

⚠️ Important: To avoid unnecessary costs, remember to take down your app if it's no longer in use, either by deleting the resource group in the Portal or running azd down.

Resources

Product	Description	Tier / Expected Usage Notes	Cost
Azure AI Foundry	Build generative AI applications on an enterprise-grade platform	Free Tier	Pricing
Azure OpenAI Service	Provides REST API access to OpenAI's powerful language models including GPT-5.1 for content extraction, summarization, and gap analysis	S0 Tier; pricing depends on token volume and model used.	Pricing
Azure AI Content Understanding Service	Analyzes various media content—such as audio, video, text, and images—transforming it into structured, searchable data	S0 Tier; pricing based on pages and transactions processed.	Pricing
Azure Blob Storage	Microsoft's object storage solution for the cloud. Blob storage is optimized for storing massive amounts of unstructured data	Standard Tier; pricing based on storage and transactions.	Pricing
Azure Container Apps	Allows you to run containerized applications without worrying about orchestration or infrastructure.	Consumption plan; pricing based on vCPU and memory usage.	Pricing
Azure Container Registry	Build, store, and manage container images and artifacts in a private registry for all types of container deployments	Basic Tier; fixed daily cost per registry.	Pricing
Azure Cosmos DB	Fully managed, distributed NoSQL, relational, and vector database for modern app development	Serverless or provisioned throughput; pricing based on request units and storage.	Pricing
Azure Queue Storage	Store large numbers of messages and access messages from anywhere in the world via HTTP or HTTPS.	Standard Tier; pricing based on number of transactions.	Pricing
GPT Model Capacity	GPT-5.1 model with multimodal capabilities, accepting both text and images as input for extraction, summarization, and gap analysis	Pricing depends on token volume and model used.	Pricing

For detailed cost estimation and pricing information, see the Deployment Guide.

Business scenario

The included sample scenario demonstrates a First Notice of Loss (FNOL) workflow for an auto insurance company. A claims analyst receives incoming collision claims — each containing an auto insurance claim form, police report, repair estimate, and photos of vehicle damage — that need to be processed together.

The analyst creates a claim in the system and uploads all documents. The workflow engine automatically:

Extracts structured data from each document — policyholder info, vehicle details (VIN, plate), incident description, and repair line items — using AI Content Understanding and GPT-5.1 Vision.
Summarizes findings across the entire claim — producing a consolidated report covering claimant details, vehicle info, incident narrative, damage assessment, and estimated costs.
Runs gap & discrepancy analysis using YAML DSL rules — detecting missing documents (e.g., no police report for a theft claim, no repair estimate when the loss exceeds $2,000) and flagging cross-document discrepancies (e.g., mismatched claim numbers, VIN conflicts between the claim form and police report, or date-of-loss differences).

The analyst reviews the AI-generated summary for a quick overview, checks gap analysis results for flagged issues and discrepancies, and drills into individual document extractions when needed. What previously required hours of manual cross-referencing is now orchestrated in minutes.

Sample data included: The claim_date_of_loss/ folder provides a complete claim package (claim form + police report + repair estimate + damage photo). The claim_hail/ folder provides a claim with a subset of documents (claim form + repair estimate + damage photo) to demonstrate gap detection. See Golden Path Workflows for a step-by-step walkthrough.

⚠️ The sample data used in this repository is synthetic and generated using Azure OpenAI service. The data is intended for use as sample data only.

Business value

Click to learn more about what value this solution provides

Automated claim intake
Upload claim forms, police reports, repair estimates, and damage photos to a single claim. The workflow engine orchestrates extraction, summarization, and gap analysis automatically — reducing intake processing from hours to minutes.
Cross-document intelligence
AI-powered summarization consolidates findings across all documents in a claim, while gap analysis identifies missing documents and flags discrepancies — such as mismatched claim numbers, VIN conflicts, or date-of-loss differences between the claim form and police report.
No-code gap rules
Gap analysis rules are defined in a YAML DSL that domain experts can modify without writing code — add required document checks, adjust severity levels, or define new discrepancy rules across industries.
Confidence-driven human-in-the-loop
Extraction and schema mapping are scored for accuracy using dual confidence signals (OCR-level and model log-probability), so analysts can focus human review on low-confidence results while high-confidence data flows through automatically.
Verifiable review & audit trail
Review AI-generated extractions, summaries, and gap analysis results side-by-side with source documents. Annotate changes, add comments, and compare processing steps for transparency and audit readiness.
Responsible AI safety gate
A built-in RAI executor screens every document's extracted content against 10 safety categories — including self-harm, violence, prompt injection, and discriminatory content — before further processing, helping ensure only safe content reaches downstream workflows.
Schema-driven extensibility
Define custom Pydantic schemas to extract structured data from any document type. The same extraction pipeline generalizes across industries — insurance, logistics, legal, finance — without code changes.
Production-ready orchestration
The Agent Framework Workflow Engine provides DAG-based execution with event streaming, retry logic, dead-letter queues, and graceful shutdown — designed for reliable, scalable processing in production environments.
API-first integration
All capabilities — claim lifecycle, content processing, schema management — are exposed through REST APIs, enabling integration with existing line-of-business systems, RPA workflows, and custom applications.

Supporting documentation

Security guidelines

This template uses Azure App Configuration for centralized configuration management and Managed Identity for secure service-to-service authentication — no connection strings or secrets are stored in application code.

To maintain strong security practices, it is recommended that GitHub repositories built on this solution enable GitHub secret scanning to detect accidental secret exposure.

Additional security considerations include:

Enabling Microsoft Defender for Cloud to monitor and secure Azure resources.
Using Virtual Networks or firewall rules to protect Azure Container Apps from unauthorized access.
Implementing authentication and authorization for the frontend application using Microsoft Entra ID or other identity providers.

Cross references

Check out similar solution accelerators

Solution Accelerator	Description
Document knowledge mining	Process and extract summaries, entities, and metadata from unstructured, multi-modal documents and enable searching and chatting over this data.
Conversation knowledge mining	Derive insights from volumes of conversational data using generative AI. It offers key phrase extraction, topic modeling, and interactive chat experiences through an intuitive web interface.
Document generation	Identify relevant documents, summarize unstructured information, and generate document templates.

Provide feedback

Have questions, find a bug, or want to request a feature? Submit a new issue on this repo and we'll connect.

Responsible AI Transparency FAQ

Please refer to Transparency FAQ for responsible AI transparency details of this solution accelerator.

Disclaimers

To the extent that the Software includes components or code used in or derived from Microsoft products or services, including without limitation Microsoft Azure Services (collectively, "Microsoft Products and Services"), you must also comply with the Product Terms applicable to such Microsoft Products and Services. You acknowledge and agree that the license governing the Software does not grant you a license or other right to use Microsoft Products and Services. Nothing in the license or this ReadMe file will serve to supersede, amend, terminate or modify any terms in the Product Terms for any Microsoft Products and Services.

You must also comply with all domestic and international export laws and regulations that apply to the Software, which include restrictions on destinations, end users, and end use. For further information on export restrictions, visit https://aka.ms/exporting.

You acknowledge that the Software and Microsoft Products and Services (1) are not designed, intended or made available as a medical device(s), and (2) are not designed or intended to be a substitute for professional medical advice, diagnosis, treatment, or judgment and should not be used to replace or as a substitute for professional medical advice, diagnosis, treatment, or judgment. Customer is solely responsible for displaying and/or obtaining appropriate consents, warnings, disclaimers, and acknowledgements to end users of Customer's implementation of the Online Services.

You acknowledge the Software is not subject to SOC 1 and SOC 2 compliance audits. No Microsoft technology, nor any of its component technologies, including the Software, is intended or made available as a substitute for the professional advice, opinion, or judgement of a certified financial services professional. Do not use the Software to replace, substitute, or provide professional financial advice or judgment.

BY ACCESSING OR USING THE SOFTWARE, YOU ACKNOWLEDGE THAT THE SOFTWARE IS NOT DESIGNED OR INTENDED TO SUPPORT ANY USE IN WHICH A SERVICE INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE COULD RESULT IN THE DEATH OR SERIOUS BODILY INJURY OF ANY PERSON OR IN PHYSICAL OR ENVIRONMENTAL DAMAGE (COLLECTIVELY, "HIGH-RISK USE"), AND THAT YOU WILL ENSURE THAT, IN THE EVENT OF ANY INTERRUPTION, DEFECT, ERROR, OR OTHER FAILURE OF THE SOFTWARE, THE SAFETY OF PEOPLE, PROPERTY, AND THE ENVIRONMENT ARE NOT REDUCED BELOW A LEVEL THAT IS REASONABLY, APPROPRIATE, AND LEGAL, WHETHER IN GENERAL OR IN A SPECIFIC INDUSTRY. BY ACCESSING THE SOFTWARE, YOU FURTHER ACKNOWLEDGE THAT YOUR HIGH-RISK USE OF THE SOFTWARE IS AT YOUR OWN RISK.

GitHub - microsoft/content-processing-solution-accelerator: Programmatically extract data and apply schemas to unstructured documents across text-based and multi-modal content using Azure AI Foundry, Azure OpenAI, Azure AI Content Understanding, and Cosmos DB. (original) (raw)