azure-document-intelligence

Star

Here are 33 public repositories matching this topic...

kimtth / rag-multimodal-semantic-chunking

Star

🖼️📄E2E Multi-modal Document Preprocessing with Azure Document Intelligence

ocr workshop chunking image-understanding azure-document-intelligence rag-preparation

Updated Oct 22, 2025
Python

ks6088ts-labs / workshop-azure-openai

Star

Workshop for Azure OpenAI Service

python opencv poetry azure-cosmos-db streamlit ultralytics azure-openai azure-ai-search azure-document-intelligence

Updated Nov 24, 2025
Python

renswickd / document-parser-collection

Star

This is a collection of various document parsers and hands-on to construct structured data for your RAG applications.

amazon-textract document-parsing azure-document-intelligence llama-parse unstructured-io mistral-ocr

Updated Aug 17, 2025
Python

emnaot / ProfilFlow

Star

AI-Powered Web Application for Talent Search and CV Management

reactjs webapp dotnet-core api-rest azure-active-directory graphapi azure-cosmosdb talent-management azure-entra-id azure-document-intelligence cv-management

Updated Oct 7, 2025
JavaScript

AlexandreSajus / AutoBudget

Star

An application that automatically parses bank statements to visualize current income and spending compared to budgeting and savings targets

personal-finance budgeting budget-app azure-document-intelligence

Updated Oct 14, 2025
Python

dignite-projects / dignite-paperbase

Star

A channel layer that turns physical documents into trustworthy digital data — OCR + Markdown + metadata + optional field extraction, exposed via REST / EventBus / MCP / Webhook to downstream RAG platforms, business systems, and AI clients. Built on ABP.

markdown multi-tenant ocr csharp dotnet mcp abp document-processing rag abp-framework paddleocr document-digitization llm azure-document-intelligence mcp-server

Updated Jun 7, 2026
C#

setuc / pdf-annotation-with-azure-doc-intel

Star

Azure Document Intelligence Result Processor: A toolset for annotating PDFs based on Azure Document Intelligence analysis results, featuring a React web application and a standalone Python script for processing and visualizing extracted data with confidence indicators.

react javascript python vite pdf-annotation pdf-processing confidence-scores form-recognizer azure-document-intelligence

Updated Mar 12, 2025
JavaScript

boorjanunezz / SmartInvoice-ETL

Star

Solución inteligente para la digitalización y gestión de facturas. Transforma documentos PDF no estructurados en datos SQL procesables mediante IA, optimizando el flujo de trabajo financiero.

python automation sql-server etl invoice-processing azure-document-intelligence

Updated Feb 17, 2026
Python

KuchikiRenji / pypdftotext

Star

OCR-enabled PDF text extraction in Python with pypdf and Azure Document Intelligence.

python pdf ocr aws-s3 text-extraction pdf-parsing pypdf document-intelligence pdf-ocr azure-document-intelligence

Updated Jan 31, 2026
Python

TejaTalachiru / Multimodal_RAG

Star

python serverless pillow openai azure-functions vector-graphics azure-blob-storage rag azure-devops vector-database ai-chatbot gpt-4 azure-openai azure-ai-search azure-document-intelligence

Updated Apr 29, 2025
Python

lucereal / CheckMates

Star

Frontend and Backend Web App for Receipt Splitting with Friends

react mongodb azure asp signalr azure-document-intelligence real-time-co

Updated Oct 6, 2024
JavaScript

esolnguyen / multi-agent-extraction

Star

Multi-agent parallel document extraction using Gemini LLM and Azure Document Intelligence OCR, running locally in Docker.

python docker multi-agent parallel-processing document-extraction pdf-extraction llm azure-document-intelligence

Updated May 17, 2026
Python

kimtth / azure-document-intelligence-vs-markitdown-vs-tika

Star

PDF extraction samples comparing Azure Document Intelligence (layout model) 🏢 vs Markitdown ✍️vs Apache Tika

pdf-parsing tika-python markitdown azure-document-intelligence

Updated Jul 4, 2025
Python

GothiProCoder / OCR-System

Star

🚀 Intelligent document extraction system powered by Azure AI & Gemini 2.5. Transform any form into structured JSON with real-time editing and enterprise-grade validation.

ocr ocr-engine ocr-recognition document-extraction langgraph azure-document-intelligence azure-document-intelligence-ocr langgraph-agents

Updated Jan 27, 2026
Python

saishagoel27 / scribbly

Star

Scribbly - Convert your boring notes into interactive flashcards using Azure Text Analytics, Azure Document Intelligence

css typescript ocr gemini azure-text-analytics azure-document-intelligence

Updated Oct 5, 2025
Python

divadsn / evelstar-invoices-app

Sponsor

Star

A simple Python Tk app to automate the process of uploading invoices to the Evelstar courier portal.

python invoices tkinter-gui azure-ml azure-document-intelligence

Updated Mar 24, 2025
Python

BigDataIA-Spring2025-4 / Web-and-PDF-Data-Extraction-Tool

Star

A Streamlit-based app with a FastAPI backend for extracting structured data (text, images, tables) from websites and PDFs. Processed data is stored in AWS S3 and rendered in a markdown-standardized format. APIs are deployed on Google Cloud Run Service

docker aws-s3 pdf-converter python3 scrapy diffbot webscraping web-data-extraction diffbot-api beautifulsoup4 pymupdf pdf-document-processor google-cloud-run streamlit azure-document-intelligence doclin

Updated Jan 31, 2025
Jupyter Notebook

DINAKAR-S / Azure-Document-Intelligence

Star

Enterprise AI system to classify, split, and auto-route PDFs using Azure Document Intelligence and SharePoint.

automation ai azure python3 ocr-recognition microsoft-graph document-processing azure-ai azure-ai-services azure-document-intelligence azure-document-intelligence-ocr azure-ai-foundry document-processing-ocr

Updated Jan 25, 2026
Python

msaleh1888 / azure-serverless-invoice-extraction

Star

Serverless invoice extraction API using Azure Document Intelligence and Azure Functions. Upload a PDF invoice and receive normalized JSON output including line items, totals, dates, and vendor details.

Updated Dec 3, 2025
Python

ssgrummons / RedactifAI

Star

Uses OCR and PII detection models to mask PII in .tiff files. Configurable to use Azure and AWS OCR and PII detection models

celery textract ocr-recognition pii-detection comprehend-medical azure-document-intelligence

Updated Feb 10, 2026
Python

Improve this page

Add a description, image, and links to the azure-document-intelligence topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the azure-document-intelligence topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

azure-document-intelligence

Here are 33 public repositories matching this topic...

kimtth / rag-multimodal-semantic-chunking

ks6088ts-labs / workshop-azure-openai

renswickd / document-parser-collection

emnaot / ProfilFlow

AlexandreSajus / AutoBudget

dignite-projects / dignite-paperbase

setuc / pdf-annotation-with-azure-doc-intel

boorjanunezz / SmartInvoice-ETL

KuchikiRenji / pypdftotext

TejaTalachiru / Multimodal_RAG

lucereal / CheckMates

esolnguyen / multi-agent-extraction

kimtth / azure-document-intelligence-vs-markitdown-vs-tika

GothiProCoder / OCR-System

saishagoel27 / scribbly

divadsn / evelstar-invoices-app

BigDataIA-Spring2025-4 / Web-and-PDF-Data-Extraction-Tool

DINAKAR-S / Azure-Document-Intelligence

msaleh1888 / azure-serverless-invoice-extraction

ssgrummons / RedactifAI

Improve this page

Add this topic to your repo