PaddleOCR/docs/version3.x/deployment/mcp_server.en.md

---
comments: true
---

# PaddleOCR MCP Server

[![PaddleOCR](https://img.shields.io/badge/OCR-PaddleOCR-orange)](https://github.com/PaddlePaddle/PaddleOCR)
[![FastMCP](https://img.shields.io/badge/Built%20with-FastMCP%20v2-blue)](https://gofastmcp.com)

This project provides a lightweight [Model Context Protocol (MCP)](https://modelcontextprotocol.io/introduction) server designed to integrate PaddleOCR capabilities into various LLM applications.

### Key Features

- **Currently Supported Tools**
    - **OCR**: Performs text detection and recognition on images and PDF files.
    - **PP-StructureV3**: Identifies and extracts text blocks, titles, paragraphs, images, tables, and other layout elements from images or PDF files, converting the input into Markdown documents.
    - **PaddleOCR-VL**: Identifies and extracts text blocks, titles, paragraphs, images, tables, and other layout elements from images or PDF files, converting the input into Markdown documents. A VLM-based approach is used.
- **Supported Working Modes**
    - **Local Python Library**: Runs PaddleOCR pipelines directly on the local machine. This mode requires a suitable local environment and hardware, and is ideal for offline use or privacy-sensitive scenarios.
    - **PaddleOCR Official Website Service**: Invokes services provided by the [PaddleOCR Official Website](https://aistudio.baidu.com/paddleocr?lang=en). This is suitable for quick testing, prototyping, or no-code scenarios.
    - **Self-hosted Service**: Invokes the user's self-hosted PaddleOCR services. This mode offers the advantages of serving and high flexibility. It is suitable for scenarios requiring customized service configurations, as well as those with strict data privacy requirements. **Currently, only the basic serving solution is supported.**

## Examples:
The following showcases creative use cases built with PaddleOCR MCP server combined with other tools:

### Demo 1
In Claude for Desktop, extract handwritten content from images and save to note-taking software Notion. The PaddleOCR MCP server extracts text, formulas and other information from images while preserving document structure.
<div align="center">
  <img width="65%" src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/note_to_notion.gif" alt="note_to_notion">
  <img width="30%" src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/note.jpg" alt="note">
</div>

- Note: In addition to the PaddleOCR MCP server, this demo also uses the [Notion MCP server](https://developers.notion.com/docs/mcp).

---

### Demo 2
In VSCode, convert handwritten ideas or pseudocode into runnable Python scripts that comply with project coding standards with one click, and upload them to GitHub repositories. The PaddleOCR MCP server extracts explicitly handwritten code from images for subsequent processing.

<div align="center">
  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/code_to_github.gif" alt="code_to_github">
</div>

- In addition to the PaddleOCR MCP server, this demo also uses the [filesystem MCP server](https://github.com/modelcontextprotocol/servers/tree/main/src/filesystem).

---

### Demo 3
In Claude for Desktop, convert PDF documents or images containing complex tables, formulas, handwritten text and other content into locally editable files.

#### Demo 3.1
Convert complex PDF documents with tables and watermarks to editable doc/Word format:
<div align="center">
  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/pdf_to_file.gif" alt="pdf_to_file">
</div>

#### Demo 3.2
Convert images containing formulas and tables to editable csv/Excel format:
<div align="center">
  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel1.png" alt="table_to_excel1">
  <img width="50%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel2.png" alt="table_to_excel2">
  <img width="45%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel3.png" alt="table_to_excel3">
</div>

- In addition to the PaddleOCR MCP server, this demo also uses the [filesystem MCP server](https://github.com/modelcontextprotocol/servers/tree/main/src/filesystem).

---

### Table of Contents

- [Table of Contents](#table-of-contents)
- [1. Installation](#1-installation)
- [2. Using with Claude for Desktop](#2-using-with-claude-for-desktop)
    - [2.1 Quick Start](#21-quick-start)
    - [2.2 MCP Host Configuration Details](#22-mcp-host-configuration-details)
    - [2.3 Working Modes Explained](#23-working-modes-explained)
    - [2.4 Using `uvx`](#24-using-uvx)
- [3. Running the Server](#3-running-the-server)
- [4. Parameter Reference](#4-parameter-reference)
- [5. Known Limitations](#5-known-limitations)

## 1. Installation

This section explains how to install the `paddleocr-mcp` library via pip.

- For the local Python library mode, you need to install both `paddleocr-mcp` and the PaddlePaddle framework along with PaddleOCR, as per the [PaddleOCR installation documentation](../installation.en.md).
- For the PaddleOCR official website service or the self-hosted service modes, if used within MCP hosts like Claude for Desktop, the server can also be run without installation via tools like `uvx`. See [2. Using with Claude for Desktop](#2-using-with-claude-for-desktop) for details.

For the local Python library mode you may optionally choose convenience extras (helpful to reduce manual dependency steps):

- `paddleocr-mcp[local]`: Includes PaddleOCR (but NOT the PaddlePaddle framework itself).
- `paddleocr-mcp[local-cpu]`: Includes PaddleOCR AND the CPU version of the PaddlePaddle framework.

It is still recommended to use an isolated virtual environment to avoid conflicts.

To install `paddleocr-mcp` using pip:

```bash
# Install from PyPI
pip install -U paddleocr-mcp

# Or install from source
# git clone https://github.com/PaddlePaddle/PaddleOCR.git
# pip install -e mcp_server

# Install with optional extras (choose ONE of the following if you prefer convenience installs)
# Install PaddleOCR together with the MCP server (framework not included):
pip install "paddleocr-mcp[local]"

# Install PaddleOCR and CPU PaddlePaddle framework together:
pip install "paddleocr-mcp[local-cpu]"
```

To verify successful installation:

```bash
paddleocr_mcp --help
```

If the help message is printed, the installation succeeded.

## 2. Using with Claude for Desktop

This section explains how to use the PaddleOCR MCP server within Claude for Desktop. The steps are also applicable to other MCP hosts with minor adjustments.

### 2.1 Quick Start

1. **Install `paddleocr-mcp`**

    Refer to [1. Installation](#1-installation). To avoid dependency conflicts, **it is strongly recommended to install in an isolated virtual environment**.

2. **Install PaddleOCR**

    Install the PaddlePaddle framework and PaddleOCR, as per the [PaddleOCR installation documentation](../installation.en.md).

3. **Add MCP Server Configuration**

    Locate the `claude_desktop_config.json` configuration file:

    - **macOS**: `~/Library/Application Support/Claude/claude_desktop_config.json`
    - **Windows**: `%APPDATA%\Claude\claude_desktop_config.json`
    - **Linux**: `~/.config/Claude/claude_desktop_config.json`

    Edit the file as follows:

    ```json
    {
      "mcpServers": {
        "paddleocr-ocr": {
          "command": "paddleocr_mcp",
          "args": [],
          "env": {
            "PADDLEOCR_MCP_PIPELINE": "OCR",
            "PADDLEOCR_MCP_PPOCR_SOURCE": "local"
          }
        }
      }
    }
    ```

    **Notes**:

    - `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
    - `PADDLEOCR_MCP_PIPELINE_CONFIG` is optional; if not set, the default pipeline configuration will be used. If you need to adjust the configuration, such as changing the model, please refer to the [PaddleOCR documentation](../paddleocr_and_paddlex.md) to export the pipeline configuration file, and set `PADDLEOCR_MCP_PIPELINE_CONFIG` to the absolute path of this configuration file.

    - **Inference Performance Tips**:

        If you encounter issues such as long inference time or insufficient memory during use, you may consider adjusting the pipeline configuration according to the following recommendations.

        - **OCR Pipeline**: It is recommended to switch to the `mobile` series models. For example, you can modify the pipeline configuration file to use `PP-OCRv5_mobile_det` for detection and `PP-OCRv5_mobile_rec` for recognition.

        - **PP-StructureV3 Pipeline**:

            - Disable unused features, e.g., set `use_formula_recognition` to `False` to disable formula recognition.
            - Use lightweight models, such as replacing the OCR model with the `mobile` version or switching to a lightweight formula recognition model like PP-FormulaNet-S.

            The following sample code can be used to obtain the pipeline configuration file, in which most optional features of the PP-StructureV3 pipeline are disabled, and some key models are replaced with lightweight versions.

            ```python
            from paddleocr import PPStructureV3

            pipeline = PPStructureV3(
                use_doc_orientation_classify=False, # Disable document image orientation classification
                use_doc_unwarping=False,            # Disable text image unwarping
                use_textline_orientation=False,     # Disable text line orientation classification
                use_formula_recognition=False,      # Disable formula recognition
                use_seal_recognition=False,         # Disable seal text recognition
                use_table_recognition=False,        # Disable table recognition
                use_chart_recognition=False,        # Disable chart parsing
                # Use lightweight models
                text_detection_model_name="PP-OCRv5_mobile_det",
                text_recognition_model_name="PP-OCRv5_mobile_rec",
                layout_detection_model_name="PP-DocLayout-S",
            )

            # The configuration file is saved to `PP-StructureV3.yaml`
            pipeline.export_paddlex_config_to_yaml("PP-StructureV3.yaml")
            ```

        **For PaddleOCR-VL, it is note recommended to use CPUs for inference.**

      **Important**:

      - If `paddleocr_mcp` is not in your system's `PATH`, set `command` to the absolute path of the executable.

4. **Restart the MCP Host**

    Restart Claude for Desktop. The `paddleocr-ocr` tool should now be available in the application.

### 2.2 MCP Host Configuration Details

In the configuration file for Claude for Desktop, you need to define how the MCP server is started. The key fields are as follows:

- `command`: `paddleocr_mcp` (if the executable can be found in the `PATH`) or the absolute path.
- `args`: Configurable command-line arguments, such as `["--verbose"]`. See [4. Parameter Reference](#4-parameter-reference) for details.
- `env`: Configurable environment variables. See [4. Parameter Reference](#4-parameter-reference) for details.

### 2.3 Working Modes Explained

You can configure the MCP server according to your requirements to run in different working modes. The operational procedures vary for different modes, which will be explained in detail below.

#### Mode 1: Local Python Library

See [2.1 Quick Start](#21-quick-start).

#### Mode 2: PaddleOCR Official Website Service

1. Install `paddleocr-mcp`.
2. Obtain the service base URL and AI Studio Community access token.

    On this page, click "API" in the upper-left corner. Copy the `API_URL` corresponding to "Text Recognition (PP-OCRv5)", and remove the trailing endpoint (`/ocr`) to get the base URL of the service (e.g., `https://xxxxxx.aistudio-app.com`). Also copy the `TOKEN`, which is your access token. You may need to register and log in to your PaddlePaddle AI Studio Community account.

3. Refer to the configuration example below to modify the contents of the `claude_desktop_config.json` file.
4. Restart the MCP host.

Configuration example:

```json
{
  "mcpServers": {
    "paddleocr-ocr": {
      "command": "paddleocr_mcp",
      "args": [],
      "env": {
        "PADDLEOCR_MCP_PIPELINE": "OCR",
        "PADDLEOCR_MCP_PPOCR_SOURCE": "aistudio",
        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>", 
        "PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN": "<your-access-token>"
      }
    }
  }
}
```

**Notes**:

- `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
- Replace `<your-server-url>` with your service base URL.
- Replace `<your-access-token>` with your access token.

**Important**:

- Do not expose your access token.

#### Mode 3: Self-hosted Service

1. In the environment where you need to run the PaddleOCR inference server, run the inference server as per the [PaddleOCR serving documentation](./serving.en.md).
2. Install `paddleocr-mcp` where the MCP server will run.
3. Refer to the configuration example below to modify the contents of the `claude_desktop_config.json` file. Set `PADDLEOCR_MCP_SERVER_URL` (e.g., `"http://127.0.0.1:8000"`).
4. Restart the MCP host.

Configuration example:

```json
{
  "mcpServers": {
    "paddleocr-ocr": {
      "command": "paddleocr_mcp",
      "args": [],
      "env": {
        "PADDLEOCR_MCP_PIPELINE": "OCR",
        "PADDLEOCR_MCP_PPOCR_SOURCE": "self_hosted",
        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>"
      }
    }
  }
}
```

**Note**:

- `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
- Replace `<your-server-url>` with your service’s base URL (e.g., `http://127.0.0.1:8000`).

### 2.4 Using `uvx`

Currently, for the PaddleOCR official website and self-hosted modes, and (for CPU inference) the local mode, starting the MCP server via `uvx` is also supported. With this approach, manual installation of `paddleocr-mcp` is not required. The main steps are as follows:

1. Install [uv](https://docs.astral.sh/uv/#installation).
2. Modify `claude_desktop_config.json`. Examples:

  Self-hosted mode:

    ```json
    {
      "mcpServers": {
        "paddleocr-ocr": {
          "command": "uvx",
          "args": [
            "--from",
            "paddleocr-mcp",
            "paddleocr_mcp"
          ],
          "env": {
            "PADDLEOCR_MCP_PIPELINE": "OCR",
            "PADDLEOCR_MCP_PPOCR_SOURCE": "self_hosted",
            "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>"
          }
        }
      }
    }
    ```

    Local mode (CPU, using the `local-cpu` extra):

    ```json
    {
      "mcpServers": {
        "paddleocr-ocr": {
          "command": "uvx",
          "args": [
            "--from",
            "paddleocr_mcp[local-cpu]",
            "paddleocr_mcp"
          ],
          "env": {
            "PADDLEOCR_MCP_PIPELINE": "OCR",
            "PADDLEOCR_MCP_PPOCR_SOURCE": "local"
          }
        }
      }
    }
    ```

    Because a different startup method is used (`uvx` pulls and runs the package on-demand), only `command` and `args` differ from earlier examples; available environment variables and CLI arguments remain identical.

## 3. Running the Server

In addition to MCP hosts like Claude for Desktop, you can also run the PaddleOCR MCP server via the CLI.

To view help:

```bash
paddleocr_mcp --help
```

Example commands:

```bash
# OCR + PaddleOCR official website service + stdio
PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN=xxxxxx paddleocr_mcp --pipeline OCR --ppocr_source aistudio --server_url https://xxxxxx.aistudio-hub.baidu.com

# PP-StructureV3 + local Python library + stdio
paddleocr_mcp --pipeline PP-StructureV3 --ppocr_source local

# OCR + self-hosted service + Streamable HTTP
paddleocr_mcp --pipeline OCR --ppocr_source self_hosted --server_url http://127.0.0.1:8080 --http
```

You can find all the supported parameters of the PaddleOCR MCP server in [4. Parameter Reference](#4-parameter-reference).

## 4. Parameter Reference

You can control the MCP server via environment variables or CLI arguments.

| Environment Variable                          | CLI Argument              | Type   | Description                                                           | Options                                  | Default       |
| ------------------------------------- | ------------------------- | ------ | --------------------------------------------------------------------- | ---------------------------------------- | ------------- |
| `PADDLEOCR_MCP_PIPELINE`              | `--pipeline`              | `str`  | Pipeline to run.                                                      | `"OCR"`, `"PP-StructureV3"`, `"PaddleOCR-VL"`              | `"OCR"`       |
| `PADDLEOCR_MCP_PPOCR_SOURCE`          | `--ppocr_source`          | `str`  | Source of PaddleOCR capabilities.                                     | `"local"` (local Python library), `"aistudio"` (PaddleOCR official website service), `"self_hosted"` (self-hosted service) | `"local"`     |
| `PADDLEOCR_MCP_SERVER_URL`            | `--server_url`            | `str`  | Base URL for the underlying service (`aistudio` or `self_hosted` mode only). | -                                        | `None`        |
| `PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN` | `--aistudio_access_token` | `str`  | AI Studio access token (`aistudio` mode only).                 | -                                        | `None`        |
| `PADDLEOCR_MCP_TIMEOUT`               | `--timeout`               | `int`  | Read timeout for the underlying requests (seconds).                          | -                                        | `60`          |
| `PADDLEOCR_MCP_DEVICE`                | `--device`                | `str`  | Device for inference (`local` mode only).                          | -                                        | `None`        |
| `PADDLEOCR_MCP_PIPELINE_CONFIG`       | `--pipeline_config`       | `str`  | Path to pipeline config file (`local` mode only).                     | -                                        | `None`        |
| -                                     | `--http`                  | `bool` | Use Streamable HTTP instead of stdio (for remote/multi-client use).   | -                                        | `False`       |
| -                                     | `--host`                  | `str`  | Host for the Stremable HTTP mode.                                                   | -                                        | `"127.0.0.1"` |
| -                                     | `--port`                  | `int`  | Port for the Streamable HTTP mode.                                                   | -                                        | `8000`        |
| -                                     | `--verbose`               | `bool` | Enable verbose logging for debugging.                                               | -                                        | `False`       |

## 5. Known Limitations

- In the local Python library mode, the current tools cannot process PDF document inputs that are Base64 encoded.
- In the local Python library mode, the current tools do not infer the file type based on the model's `file_type` prompt, and may fail to process some complex URLs.
- For the PP-StructureV3 and PaddleOCR-VL pipelines, if the input file contains images, the returned results may significantly increase token usage. If image content is not needed, you can explicitly exclude it through prompts to reduce resource consumption.
-												Update docs for a better experience. (#16419)


											
										
										
											2025-09-08 17:38:54 +08:00
+								---
 								comments: true
 								---
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								# PaddleOCR MCP Server
 								[![PaddleOCR](https://img.shields.io/badge/OCR-PaddleOCR-orange)](https://github.com/PaddlePaddle/PaddleOCR)
 								[![FastMCP](https://img.shields.io/badge/Built%20with-FastMCP%20v2-blue)](https://gofastmcp.com)
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								This project provides a lightweight [Model Context Protocol (MCP)](https://modelcontextprotocol.io/introduction) server designed to integrate PaddleOCR capabilities into various LLM applications.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
+								### Key Features
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- **Currently Supported Tools**
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								    - **OCR**: Performs text detection and recognition on images and PDF files.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    - **PP-StructureV3**: Identifies and extracts text blocks, titles, paragraphs, images, tables, and other layout elements from images or PDF files, converting the input into Markdown documents.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								    - **PaddleOCR-VL**: Identifies and extracts text blocks, titles, paragraphs, images, tables, and other layout elements from images or PDF files, converting the input into Markdown documents. A VLM-based approach is used.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- **Supported Working Modes**
 								    - **Local Python Library**: Runs PaddleOCR pipelines directly on the local machine. This mode requires a suitable local environment and hardware, and is ideal for offline use or privacy-sensitive scenarios.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								    - **PaddleOCR Official Website Service**: Invokes services provided by the [PaddleOCR Official Website](https://aistudio.baidu.com/paddleocr?lang=en). This is suitable for quick testing, prototyping, or no-code scenarios.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    - **Self-hosted Service**: Invokes the user's self-hosted PaddleOCR services. This mode offers the advantages of serving and high flexibility. It is suitable for scenarios requiring customized service configurations, as well as those with strict data privacy requirements. **Currently, only the basic serving solution is supported.**
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
+								## Examples:
 								The following showcases creative use cases built with PaddleOCR MCP server combined with other tools:
-												update layout for demo 3 (#15975)


											
										
										
											2025-07-06 23:56:10 -04:00
+								### Demo 1
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
+								In Claude for Desktop, extract handwritten content from images and save to note-taking software Notion. The PaddleOCR MCP server extracts text, formulas and other information from images while preserving document structure.
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
+								<div align="center">
 								  <img width="65%" src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/note_to_notion.gif" alt="note_to_notion">
 								  <img width="30%" src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/note.jpg" alt="note">
 								</div>
 								- Note: In addition to the PaddleOCR MCP server, this demo also uses the [Notion MCP server](https://developers.notion.com/docs/mcp).
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
 								---
-												update layout for demo 3 (#15975)


											
										
										
											2025-07-06 23:56:10 -04:00
+								### Demo 2
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
+								In VSCode, convert handwritten ideas or pseudocode into runnable Python scripts that comply with project coding standards with one click, and upload them to GitHub repositories. The PaddleOCR MCP server extracts explicitly handwritten code from images for subsequent processing.
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
 								<div align="center">
 								  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/code_to_github.gif" alt="code_to_github">
 								</div>
 								- In addition to the PaddleOCR MCP server, this demo also uses the [filesystem MCP server](https://github.com/modelcontextprotocol/servers/tree/main/src/filesystem).
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
+								---
-												update layout for demo 3 (#15975)


											
										
										
											2025-07-06 23:56:10 -04:00
+								### Demo 3
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
+								In Claude for Desktop, convert PDF documents or images containing complex tables, formulas, handwritten text and other content into locally editable files.
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
-												update layout for demo 3 (#15975)


											
										
										
											2025-07-06 23:56:10 -04:00
+								#### Demo 3.1
 								Convert complex PDF documents with tables and watermarks to editable doc/Word format:
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
+								<div align="center">
 								  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/main/images/paddleocr/mcp_demo/pdf_to_file.gif" alt="pdf_to_file">
 								</div>
-												update layout for demo 3 (#15975)


											
										
										
											2025-07-06 23:56:10 -04:00
+								#### Demo 3.2
 								Convert images containing formulas and tables to editable csv/Excel format:
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
+								<div align="center">
 								  <img width="70%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel1.png" alt="table_to_excel1">
 								  <img width="50%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel2.png" alt="table_to_excel2">
 								  <img width="45%" img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/00136903a4d0b5f11bd978cb0ef5d3c44f3aa5e9/images/paddleocr/mcp_demo/table_to_excel3.png" alt="table_to_excel3">
 								</div>
 								- In addition to the PaddleOCR MCP server, this demo also uses the [filesystem MCP server](https://github.com/modelcontextprotocol/servers/tree/main/src/filesystem).
-												fix doc layout issue (#15963)

* fix doc layout issue

* fix doc layout issue

* fix doc typo
											
										
										
											2025-07-04 03:20:24 -04:00
 								---
-												update demo into mcp readme (#15954)


											
										
										
											2025-07-03 09:38:19 -04:00
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								### Table of Contents
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- [Table of Contents](#table-of-contents)
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								- [1. Installation](#1-installation)
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- [2. Using with Claude for Desktop](#2-using-with-claude-for-desktop)
 								    - [2.1 Quick Start](#21-quick-start)
 								    - [2.2 MCP Host Configuration Details](#22-mcp-host-configuration-details)
 								    - [2.3 Working Modes Explained](#23-working-modes-explained)
 								    - [2.4 Using `uvx`](#24-using-uvx)
 								- [3. Running the Server](#3-running-the-server)
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								- [4. Parameter Reference](#4-parameter-reference)
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- [5. Known Limitations](#5-known-limitations)
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
 								## 1. Installation
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								This section explains how to install the `paddleocr-mcp` library via pip.
 								- For the local Python library mode, you need to install both `paddleocr-mcp` and the PaddlePaddle framework along with PaddleOCR, as per the [PaddleOCR installation documentation](../installation.en.md).
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								- For the PaddleOCR official website service or the self-hosted service modes, if used within MCP hosts like Claude for Desktop, the server can also be run without installation via tools like `uvx`. See [2. Using with Claude for Desktop](#2-using-with-claude-for-desktop) for details.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
+								For the local Python library mode you may optionally choose convenience extras (helpful to reduce manual dependency steps):
 								- `paddleocr-mcp[local]`: Includes PaddleOCR (but NOT the PaddlePaddle framework itself).
 								- `paddleocr-mcp[local-cpu]`: Includes PaddleOCR AND the CPU version of the PaddlePaddle framework.
 								It is still recommended to use an isolated virtual environment to avoid conflicts.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								To install `paddleocr-mcp` using pip:
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								```bash
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								# Install from PyPI
 								pip install -U paddleocr-mcp
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								# Or install from source
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								# git clone https://github.com/PaddlePaddle/PaddleOCR.git
 								# pip install -e mcp_server
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
 								# Install with optional extras (choose ONE of the following if you prefer convenience installs)
 								# Install PaddleOCR together with the MCP server (framework not included):
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								pip install "paddleocr-mcp[local]"
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
 								# Install PaddleOCR and CPU PaddlePaddle framework together:
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								pip install "paddleocr-mcp[local-cpu]"
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								```
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								To verify successful installation:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
 								```bash
 								paddleocr_mcp --help
 								```
-												Replace python-magic with puremagic (#15990)


											
										
										
											2025-07-08 14:16:25 +08:00
+								If the help message is printed, the installation succeeded.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
 								## 2. Using with Claude for Desktop
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								This section explains how to use the PaddleOCR MCP server within Claude for Desktop. The steps are also applicable to other MCP hosts with minor adjustments.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								### 2.1 Quick Start
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+. **Install `paddleocr-mcp`**
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    Refer to [1. Installation](#1-installation). To avoid dependency conflicts, **it is strongly recommended to install in an isolated virtual environment**.
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+. **Install PaddleOCR**
 								    Install the PaddlePaddle framework and PaddleOCR, as per the [PaddleOCR installation documentation](../installation.en.md).
 . **Add MCP Server Configuration**
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    Locate the `claude_desktop_config.json` configuration file:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
+								    - **macOS**: `~/Library/Application Support/Claude/claude_desktop_config.json`
 								    - **Windows**: `%APPDATA%\Claude\claude_desktop_config.json`
 								    - **Linux**: `~/.config/Claude/claude_desktop_config.json`
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    Edit the file as follows:
 								    ```json
 								    {
 								      "mcpServers": {
 								        "paddleocr-ocr": {
 								          "command": "paddleocr_mcp",
 								          "args": [],
 								          "env": {
 								            "PADDLEOCR_MCP_PIPELINE": "OCR",
 								            "PADDLEOCR_MCP_PPOCR_SOURCE": "local"
 								          }
 								        }
 								      }
 								    }
 								    ```
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    **Notes**:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								    - `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    - `PADDLEOCR_MCP_PIPELINE_CONFIG` is optional; if not set, the default pipeline configuration will be used. If you need to adjust the configuration, such as changing the model, please refer to the [PaddleOCR documentation](../paddleocr_and_paddlex.md) to export the pipeline configuration file, and set `PADDLEOCR_MCP_PIPELINE_CONFIG` to the absolute path of this configuration file.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    - **Inference Performance Tips**:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								        If you encounter issues such as long inference time or insufficient memory during use, you may consider adjusting the pipeline configuration according to the following recommendations.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								        - **OCR Pipeline**: It is recommended to switch to the `mobile` series models. For example, you can modify the pipeline configuration file to use `PP-OCRv5_mobile_det` for detection and `PP-OCRv5_mobile_rec` for recognition.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								        - **PP-StructureV3 Pipeline**:
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            - Disable unused features, e.g., set `use_formula_recognition` to `False` to disable formula recognition.
 								            - Use lightweight models, such as replacing the OCR model with the `mobile` version or switching to a lightweight formula recognition model like PP-FormulaNet-S.
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            The following sample code can be used to obtain the pipeline configuration file, in which most optional features of the PP-StructureV3 pipeline are disabled, and some key models are replaced with lightweight versions.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            ```python
 								            from paddleocr import PPStructureV3
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            pipeline = PPStructureV3(
 								                use_doc_orientation_classify=False, # Disable document image orientation classification
 								                use_doc_unwarping=False,            # Disable text image unwarping
 								                use_textline_orientation=False,     # Disable text line orientation classification
 								                use_formula_recognition=False,      # Disable formula recognition
 								                use_seal_recognition=False,         # Disable seal text recognition
 								                use_table_recognition=False,        # Disable table recognition
 								                use_chart_recognition=False,        # Disable chart parsing
 								                # Use lightweight models
 								                text_detection_model_name="PP-OCRv5_mobile_det",
 								                text_recognition_model_name="PP-OCRv5_mobile_rec",
 								                layout_detection_model_name="PP-DocLayout-S",
 								            )
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            # The configuration file is saved to `PP-StructureV3.yaml`
 								            pipeline.export_paddlex_config_to_yaml("PP-StructureV3.yaml")
 								            ```
-												Mcp readme en fix  (#15740)

* Update mcp_server.md

fix rendering issue

* Update mcp_server.en.md
											
										
										
											2025-06-16 02:15:43 -04:00
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								        **For PaddleOCR-VL, it is note recommended to use CPUs for inference.**
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								      **Important**:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								      - If `paddleocr_mcp` is not in your system's `PATH`, set `command` to the absolute path of the executable.
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+. **Restart the MCP Host**
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								    Restart Claude for Desktop. The `paddleocr-ocr` tool should now be available in the application.
-												Mcp readme en fix  (#15740)

* Update mcp_server.md

fix rendering issue

* Update mcp_server.en.md
											
										
										
											2025-06-16 02:15:43 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								### 2.2 MCP Host Configuration Details
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								In the configuration file for Claude for Desktop, you need to define how the MCP server is started. The key fields are as follows:
-												Mcp readme en fix  (#15740)

* Update mcp_server.md

fix rendering issue

* Update mcp_server.en.md
											
										
										
											2025-06-16 02:15:43 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- `command`: `paddleocr_mcp` (if the executable can be found in the `PATH`) or the absolute path.
 								- `args`: Configurable command-line arguments, such as `["--verbose"]`. See [4. Parameter Reference](#4-parameter-reference) for details.
 								- `env`: Configurable environment variables. See [4. Parameter Reference](#4-parameter-reference) for details.
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								### 2.3 Working Modes Explained
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								You can configure the MCP server according to your requirements to run in different working modes. The operational procedures vary for different modes, which will be explained in detail below.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								#### Mode 1: Local Python Library
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								See [2.1 Quick Start](#21-quick-start).
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								#### Mode 2: PaddleOCR Official Website Service
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+. Install `paddleocr-mcp`.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+. Obtain the service base URL and AI Studio Community access token.
 								    On this page, click "API" in the upper-left corner. Copy the `API_URL` corresponding to "Text Recognition (PP-OCRv5)", and remove the trailing endpoint (`/ocr`) to get the base URL of the service (e.g., `https://xxxxxx.aistudio-app.com`). Also copy the `TOKEN`, which is your access token. You may need to register and log in to your PaddlePaddle AI Studio Community account.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+. Refer to the configuration example below to modify the contents of the `claude_desktop_config.json` file.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+. Restart the MCP host.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								Configuration example:
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
 								```json
 								{
 								  "mcpServers": {
 								    "paddleocr-ocr": {
 								      "command": "paddleocr_mcp",
 								      "args": [],
 								      "env": {
 								        "PADDLEOCR_MCP_PIPELINE": "OCR",
 								        "PADDLEOCR_MCP_PPOCR_SOURCE": "aistudio",
 								        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>",
 								        "PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN": "<your-access-token>"
 								      }
 								    }
 								  }
 								}
 								```
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								**Notes**:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								- `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
 								- Replace `<your-server-url>` with your service base URL.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- Replace `<your-access-token>` with your access token.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								**Important**:
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- Do not expose your access token.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								#### Mode 3: Self-hosted Service
 . In the environment where you need to run the PaddleOCR inference server, run the inference server as per the [PaddleOCR serving documentation](./serving.en.md).
 . Install `paddleocr-mcp` where the MCP server will run.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+. Refer to the configuration example below to modify the contents of the `claude_desktop_config.json` file. Set `PADDLEOCR_MCP_SERVER_URL` (e.g., `"http://127.0.0.1:8000"`).
 . Restart the MCP host.
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								Configuration example:
-												[Feat] Mcp draft version for ocrv5 and structurev3 (#15604)

* Add MCP OCR server draft version

* update code review

* structure can return images

* refine code and code review

* fix images return logic

* refractor structure for abstract layer

* Fix bugs and enhance code

* Use string literal for output mode

* update images logic for service

* update readme and config example

* update readme and config example

* Fix bugs and add

* refine structure image logic, now can show positions in texts

* update readme file based on code review

* update readme file

* update readme file

* udpate readme

* udpate readme

* Polish doc

* add en readme

* Refactor docs and update installation guide

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-13 18:36:44 +08:00
 								```json
 								{
 								  "mcpServers": {
 								    "paddleocr-ocr": {
 								      "command": "paddleocr_mcp",
 								      "args": [],
 								      "env": {
 								        "PADDLEOCR_MCP_PIPELINE": "OCR",
 								        "PADDLEOCR_MCP_PPOCR_SOURCE": "self_hosted",
 								        "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>"
 								      }
 								    }
 								  }
 								}
 								```
 								**Note**:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								- `PADDLEOCR_MCP_PIPELINE` should be set to the pipeline name. See Section 4 for more details.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								- Replace `<your-server-url>` with your service’s base URL (e.g., `http://127.0.0.1:8000`).
 								### 2.4 Using `uvx`
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								Currently, for the PaddleOCR official website and self-hosted modes, and (for CPU inference) the local mode, starting the MCP server via `uvx` is also supported. With this approach, manual installation of `paddleocr-mcp` is not required. The main steps are as follows:
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
 . Install [uv](https://docs.astral.sh/uv/#installation).
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
+. Modify `claude_desktop_config.json`. Examples:
 								  Self-hosted mode:
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
 								    ```json
 								    {
 								      "mcpServers": {
 								        "paddleocr-ocr": {
 								          "command": "uvx",
 								          "args": [
 								            "--from",
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								            "paddleocr-mcp",
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								            "paddleocr_mcp"
 								          ],
 								          "env": {
 								            "PADDLEOCR_MCP_PIPELINE": "OCR",
 								            "PADDLEOCR_MCP_PPOCR_SOURCE": "self_hosted",
 								            "PADDLEOCR_MCP_SERVER_URL": "<your-server-url>"
 								          }
 								        }
 								      }
 								    }
 								    ```
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
+								    Local mode (CPU, using the `local-cpu` extra):
 								    ```json
 								    {
 								      "mcpServers": {
 								        "paddleocr-ocr": {
 								          "command": "uvx",
 								          "args": [
 								            "--from",
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								            "paddleocr_mcp[local-cpu]",
-												update mcp_server dependencies (#16387)

* Update pyproject.toml

* Update pyproject.toml

* Add optional dependencies for local environment

* chore(pyproject): update optional dependencies for local-cpu

* docs(mcp_server): update quick start instructions and dependencies

* docs(mcp_server): 更新快速开始配置说明

* docs(mcp_server): 更新快速开始说明和配置示例

* Checkpoint from VS Code for coding agent session

* Initial plan

* Implement @Bobholamovic's feedback: fix dependencies, restructure docs, add resource warnings

Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>

* docs(mcp_server): 更新安装说明和配置示例

* docs(mcp_server): 更新安装说明和可选依赖信息

* docs(mcp_server): 更新安装说明和可选依赖信息

---------

Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com>
Co-authored-by: nblog <10218627+nblog@users.noreply.github.com>
											
										
										
											2025-09-16 10:10:19 +08:00
+								            "paddleocr_mcp"
 								          ],
 								          "env": {
 								            "PADDLEOCR_MCP_PIPELINE": "OCR",
 								            "PADDLEOCR_MCP_PPOCR_SOURCE": "local"
 								          }
 								        }
 								      }
 								    }
 								    ```
 								    Because a different startup method is used (`uvx` pulls and runs the package on-demand), only `command` and `args` differ from earlier examples; available environment variables and CLI arguments remain identical.
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
 								## 3. Running the Server
 								In addition to MCP hosts like Claude for Desktop, you can also run the PaddleOCR MCP server via the CLI.
 								To view help:
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								```bash
 								paddleocr_mcp --help
 								```
 								Example commands:
 								```bash
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								# OCR + PaddleOCR official website service + stdio
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN=xxxxxx paddleocr_mcp --pipeline OCR --ppocr_source aistudio --server_url https://xxxxxx.aistudio-hub.baidu.com
 								# PP-StructureV3 + local Python library + stdio
 								paddleocr_mcp --pipeline PP-StructureV3 --ppocr_source local
 								# OCR + self-hosted service + Streamable HTTP
 								paddleocr_mcp --pipeline OCR --ppocr_source self_hosted --server_url http://127.0.0.1:8080 --http
 								```
 								You can find all the supported parameters of the PaddleOCR MCP server in [4. Parameter Reference](#4-parameter-reference).
 								## 4. Parameter Reference
-												refine mcp package (#15753)

* refine mcp package

* temp work

* update image url handle logic

* code review update

* update comments

* add the wrong deleted content

* update readme

* Fix bug

* Add known limitations in documentation

* Polish doc

* local ocr stderr log

* aistudio structure image logic refine

* update code review for stdout

* update code review for stdout

* Polish installation doc

* Fix typing and polish docs

* Fix bugs

* Fix bugs

* Fix docs

* Refine

* Fix

* Polish docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
											
										
										
											2025-06-21 07:55:59 -04:00
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								You can control the MCP server via environment variables or CLI arguments.
 								| Environment Variable                          | CLI Argument              | Type   | Description                                                           | Options                                  | Default       |
 								| ------------------------------------- | ------------------------- | ------ | --------------------------------------------------------------------- | ---------------------------------------- | ------------- |
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								| `PADDLEOCR_MCP_PIPELINE`              | `--pipeline`              | `str`  | Pipeline to run.                                                      | `"OCR"`, `"PP-StructureV3"`, `"PaddleOCR-VL"`              | `"OCR"`       |
 								| `PADDLEOCR_MCP_PPOCR_SOURCE`          | `--ppocr_source`          | `str`  | Source of PaddleOCR capabilities.                                     | `"local"` (local Python library), `"aistudio"` (PaddleOCR official website service), `"self_hosted"` (self-hosted service) | `"local"`     |
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								| `PADDLEOCR_MCP_SERVER_URL`            | `--server_url`            | `str`  | Base URL for the underlying service (`aistudio` or `self_hosted` mode only). | -                                        | `None`        |
 								| `PADDLEOCR_MCP_AISTUDIO_ACCESS_TOKEN` | `--aistudio_access_token` | `str`  | AI Studio access token (`aistudio` mode only).                 | -                                        | `None`        |
 								| `PADDLEOCR_MCP_TIMEOUT`               | `--timeout`               | `int`  | Read timeout for the underlying requests (seconds).                          | -                                        | `60`          |
 								| `PADDLEOCR_MCP_DEVICE`                | `--device`                | `str`  | Device for inference (`local` mode only).                          | -                                        | `None`        |
 								| `PADDLEOCR_MCP_PIPELINE_CONFIG`       | `--pipeline_config`       | `str`  | Path to pipeline config file (`local` mode only).                     | -                                        | `None`        |
 								| -                                     | `--http`                  | `bool` | Use Streamable HTTP instead of stdio (for remote/multi-client use).   | -                                        | `False`       |
 								| -                                     | `--host`                  | `str`  | Host for the Stremable HTTP mode.                                                   | -                                        | `"127.0.0.1"` |
 								| -                                     | `--port`                  | `int`  | Port for the Streamable HTTP mode.                                                   | -                                        | `8000`        |
 								| -                                     | `--verbose`               | `bool` | Enable verbose logging for debugging.                                               | -                                        | `False`       |
 								## 5. Known Limitations
 								- In the local Python library mode, the current tools cannot process PDF document inputs that are Base64 encoded.
 								- In the local Python library mode, the current tools do not infer the file type based on the model's `file_type` prompt, and may fail to process some complex URLs.
-												[Feat] Add PaddleOCR-VL MCP server (#17057)

* Support PaddleOCR-VL

* Update docs

* Install from PyPI

* Update docs
											
										
										
											2025-11-26 21:46:19 +08:00
+								- For the PP-StructureV3 and PaddleOCR-VL pipelines, if the input file contains images, the returned results may significantly increase token usage. If image content is not needed, you can explicitly exclude it through prompts to reduce resource consumption.