Browse/File Systems/PaddleOCR

PaddleOCR

Name: PaddleOCR
Rating: 4.5 (7289 reviews)
Author: paddlepaddle

by paddlepaddle · Added 26d ago

72.9k

Install

git clone https://github.com/paddlepaddle/paddleocr/tree/HEAD/mcp_server

About

This MCP server provides AI assistants with PaddleOCR text extraction and document structure parsing capabilities, built by PaddlePaddle Authors using Python with FastMCP and supporting both local library execution and remote service integration. The implementation offers two core tools: OCR for extracting text from images and PDFs with confidence scores and bounding box coordinates, and PP-StructureV3 for parsing complex documents into structured markdown with embedded images, tables, and formulas. Built with flexible deployment options supporting local PaddleOCR libraries, AI Studio cloud services, and self-hosted servers, plus multiple input formats including file paths, URLs, and Base64 data, it serves document digitization workflows, content management systems requiring text extraction, and research applications needing structured document analysis with precise layout preservation.

PaddleOCR

Install

About

Tags