PaddleOCR/docs/version3.x/module_usage/textline_orientation_classification.en.md

---
comments: true
---

# Text Line Orientation Classification Module Tutorial

## 1. Overview
The text line orientation classification module primarily distinguishes the orientation of text lines and corrects them using post-processing. In processes such as document scanning and license/certificate photography, to capture clearer images, the capture device may be rotated, resulting in text lines in various orientations. Standard OCR pipelines cannot handle such data well. By utilizing image classification technology, the orientation of text lines can be predetermined and adjusted, thereby enhancing the accuracy of OCR processing.

## 2. Supported Model List

<table>
<thead>
<tr>
<th>Model</th><th>Model Download Link</th>
<th>Top-1 Accuracy (%)</th>
<th>GPU Inference Time (ms)<br/>[Normal Mode / High-Performance Mode]</th>
<th>CPU Inference Time (ms)</th>
<th>Model Storage Size (MB)</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>PP-LCNet_x0_25_textline_ori</td>
<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x0_25_textline_ori_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/PP-LCNet_x0_25_textline_ori_pretrained.pdparams">Training Model</a></td>
<td>98.85</td>
<td>2.16 / 0.41</td>
<td>2.37 / 0.73</td>
<td>0.96</td>
<td>Text line classification model based on PP-LCNet_x0_25, with two classes: 0 degrees and 180 degrees</td>
</tr>
<tr>
<td>PP-LCNet_x1_0_textline_ori</td>
<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x1_0_textline_ori_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/PP-LCNet_x1_0_textline_ori_pretrained.pdparams">Training Model</a></td>
<td>99.42</td>
<td>- / -</td>
<td>2.98 / 2.98</td>
<td>6.5</td>
<td>Text line classification model based on PP-LCNet_x1_0, with two classes: 0 degrees and 180 degrees</td>
</tr>
</tbody>
</table>

> ❗ **Note**: The text line orientation classification model was upgraded on May 26, 2025, and `PP-LCNet_x1_0_textline_ori` has been added. If you need to use the pre-upgrade model weights, please click the <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x0_25_textline_ori_infer.bak.tar">download link</a>.

<strong>Test Environment Description:</strong>

  <ul>
      <li><b>Performance Test Environment</b>
          <ul>
             <li><strong>Test Dataset：</strong> PaddleX Self-built Dataset, Covering Multiple Scenarios Such as Documents and Certificates, Containing 1000 Images.</li>
              <li><strong>Hardware Configuration:</strong>
                  <ul>
                      <li>GPU: NVIDIA Tesla T4</li>
                      <li>CPU: Intel Xeon Gold 6271C @ 2.60GHz</li>
                  </ul>
              </li>
              <li><strong>Software Environment:</strong>
                  <ul>
                      <li>Ubuntu 20.04 / CUDA 11.8 / cuDNN 8.9 / TensorRT 8.6.1.6</li>
                      <li>paddlepaddle 3.0.0 / paddleocr 3.0.3</li>
                  </ul>
              </li>
          </ul>
      </li>
      <li><b>Inference Mode Description</b></li>
  </ul>

<table border="1">
    <thead>
        <tr>
            <th>Mode</th>
            <th>GPU Configuration </th>
            <th>CPU Configuration </th>
            <th>Acceleration Technology Combination</th>
        </tr>
    </thead>
    <tbody>
        <tr>
            <td>Normal Mode</td>
            <td>FP32 Precision / No TRT Acceleration</td>
            <td>FP32 Precision / 8 Threads</td>
            <td>PaddleInference</td>
        </tr>
        <tr>
            <td>High-Performance Mode</td>
            <td>Optimal combination of pre-selected precision types and acceleration strategies</td>
            <td>FP32 Precision / 8 Threads</td>
            <td>Pre-selected optimal backend (Paddle/OpenVINO/TRT, etc.)</td>
        </tr>
    </tbody>
</table>

## 3. Quick Integration

> ❗ Before starting, please install the wheel package of PaddleOCR. For detailed instructions, refer to the [Installation Guide](../installation.en.md).

You can quickly experience the functionality with a single command:

```bash
paddleocr textline_orientation_classification -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg
```  

<b>Note: </b>The official models would be download from HuggingFace by default. If can't access to HuggingFace, please set the environment variable `PADDLE_PDX_MODEL_SOURCE="BOS"` to change the model source to BOS. In the future, more model sources will be supported.

You can also integrate the text line orientation classification model into your project. Run the following code after downloading the [example image](https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg) to your local machine. 

```bash
from paddleocr import TextLineOrientationClassification
model = TextLineOrientationClassification(model_name="PP-LCNet_x0_25_textline_ori")
output = model.predict("textline_rot180_demo.jpg",  batch_size=1)
for res in output:
    res.print(json_format=False)
    res.save_to_img("./output/demo.png")
    res.save_to_json("./output/res.json")
```

After running, the result obtained is:

```bash
{'res': {'input_path': 'textline_rot180_demo.jpg', 'page_index': None, 'class_ids': array([1], dtype=int32), 'scores': array([0.99864], dtype=float32), 'label_names': ['180_degree']}}
```

The meanings of the running results parameters are as follows:

- `input_path`：Indicates the path of the input image.
- `page_index`：If the input is a PDF file, it indicates the current page number of the PDF; otherwise, it is `None`.
- `class_ids`：Indicates the class ID of the prediction result.
- `scores`：Indicates the confidence score of the prediction result.
- `label_names`：Indicates the class name of the prediction result.
The visualization image is as follows:

<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/modules/textline_ori_classification/textline_rot180_demo_res.jpg">

The explanations for the methods, parameters, etc., are as follows:

* `TextLineOrientationClassification` instantiates a textline classification model (here, `PP-LCNet_x0_25_textline_ori` is used as an example), and the specific explanations are as follows:

<table>
<thead>
<tr>
<th>Parameter</th>
<th>Description</th>
<th>Type</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td><code>model_name</code></td>
<td>Model name. If set to <code>None</code>, <code>PP-LCNet_x0_25_textline_ori</code> will be used.</td>
<td><code>str|None</code></td>
<td><code>None</code></td>
</tr>
<tr>
<td><code>model_dir</code></td>
<td>Model storage path.</td>
<td><code>str|None</code></td>
<td><code>None</code></td>
</tr>
<tr>
<td><code>device</code></td>
<td>Device for inference.<br/>
<b>For example:</b> <code>"cpu"</code>, <code>"gpu"</code>, <code>"npu"</code>, <code>"gpu:0"</code>, <code>"gpu:0,1"</code>.<br/>
If multiple devices are specified, parallel inference will be performed.<br/>
By default, GPU 0 is used if available; otherwise, CPU is used.
</td>
<td><code>str|None</code></td>
<td><code>None</code></td>
</tr>
<tr>
<td><code>enable_hpi</code></td>
<td>Whether to enable high-performance inference.</td>
<td><code>bool</code></td>
<td><code>False</code></td>
</tr>
<tr>
<td><code>use_tensorrt</code></td>
<td>Whether to use the Paddle Inference TensorRT subgraph engine. If the model does not support acceleration through TensorRT, setting this flag will not enable acceleration.<br/>
For Paddle with CUDA version 11.8, the compatible TensorRT version is 8.x (x>=6), and it is recommended to install TensorRT 8.6.1.6.<br/>
For Paddle with CUDA version 12.6, the compatible TensorRT version is 10.x (x>=5), and it is recommended to install TensorRT 10.5.0.18.
</td>
<td><code>bool</code></td>
<td><code>False</code></td>
</tr>
<tr>
<td><code>precision</code></td>
<td>Computation precision when using the TensorRT subgraph engine in Paddle Inference.<br/><b>Options:</b> <code>"fp32"</code>, <code>"fp16"</code>.</td>
<td><code>str</code></td>
<td><code>"fp32"</code></td>
</tr>
<tr>
<td><code>enable_mkldnn</code></td>
<td>
Whether to enable MKL-DNN acceleration for inference. If MKL-DNN is unavailable or the model does not support it, acceleration will not be used even if this flag is set.
</td>
<td><code>bool</code></td>
<td><code>True</code></td>
</tr>
<tr>
<td><code>mkldnn_cache_capacity</code></td>
<td>
MKL-DNN cache capacity.
</td>
<td><code>int</code></td>
<td><code>10</code></td>
</tr>
<tr>
<td><code>cpu_threads</code></td>
<td>Number of threads to use for inference on CPUs.</td>
<td><code>int</code></td>
<td><code>10</code></td>
</tr>
</tbody>
</table>

* Use the `predict()` method of the text line direction classification model to perform inference. This method returns a list of results. In addition, this module also provides the `predict_iter()` method. Both methods accept the same parameters and return the same result format. The difference is that `predict_iter()` returns a `generator`, which processes and retrieves prediction results step by step. It is suitable for handling large datasets or memory-efficient scenarios. You can choose either method based on your actual needs. The `predict()` method accepts the parameters `input` and `batch_size`, which are described in detail below:
<table>
<thead>
<tr>
<th>Parameter</th>
<th>Description</th>
<th>Type</th>
<th>Default</th>
</tr>
</thead>
<tr>
<td><code>input</code></td>
<td>Input data to be predicted. Required. Supports multiple input types:<ul>
<li><b>Python Var</b>: e.g., <code>numpy.ndarray</code> representing image data</li>
<li><b>str</b>: 
  - Local image or PDF file path: <code>/root/data/img.jpg</code>;
  - <b>URL</b> of image or PDF file: e.g., <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_doc_preprocessor_002.png">example</a>;
  - <b>Local directory</b>: directory containing images for prediction, e.g., <code>/root/data/</code> (Note: directories containing PDF files are not supported; PDFs must be specified by exact file path)</li>
<li><b>list</b>: Elements must be of the above types, e.g., <code>[numpy.ndarray, numpy.ndarray]</code>, <code>["/root/data/img1.jpg", "/root/data/img2.jpg"]</code>, <code>["/root/data1", "/root/data2"]</code></li>
</ul>
</td>
<td><code>Python Var|str|list</code></td>
<td></td>
</tr>
<tr>
<td><code>batch_size</code></td>
<td>Batch size,  positive integer.</td>
<td><code>int</code></td>
<td>1</td>
</tr>
</table>

* Call the `predict()` method of the text line orientation classification model for inference. This method will return a list of results. In addition, this module also provides a `predict_iter()` method. Both methods accept the same parameters and return the same results, but `predict_iter()` returns a `generator`, which is more suitable for processing large datasets or when you want to save memory. You can choose either method according to your needs. The parameters of the `predict()` method are `input` and `batch_size`, as described below:


<table>
<thead>
<tr>
<th>Parameter</th>
<th>Parameter Description</th>
<th>Parameter Type</th>
<th>Options</th>
<th>Default Value</th>
</tr>
</thead>
<tr>
<td><code>input</code></td>
<td>Data to be predicted, supporting multiple input types</td>
<td><code>Python Var|str|list</code></td>
<td>
<ul>
  <li><b>Python variable</b>, such as image data represented by <code>numpy.ndarray</code></li>
  <li><b>File path</b>, such as the local path of an image file: <code>/root/data/img.jpg</code></li>
  <li><b>URL link</b>, such as the network URL of an image file: <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg">Example</a></li>
  <li><b>Local directory</b>, the directory should contain data files to be predicted, such as the local path: <code>/root/data/</code></li>
  <li><b>list</b>, the elements of the list should be of the above-mentioned data types, such as <code>[numpy.ndarray, numpy.ndarray]</code>, <code>[\"/root/data/img1.jpg\", \"/root/data/img2.jpg\"]</code>, <code>[\"/root/data1\", \"/root/data2\"]</code></li>
</ul>
</td>
<td>None</td>
</tr>
<tr>
<td><code>batch_size</code></td>
<td>Batch size</td>
<td><code>int</code></td>
<td>Any integer</td>
<td>1</td>
</tr>
</table>

* The prediction results are processed, and the prediction result for each sample is of type `dict`. It supports operations such as printing, saving as an image, and saving as a `json` file:

<table>
<thead>
<tr>
<th>Method</th>
<th>Method Description</th>
<th>Parameter</th>
<th>Parameter Type</th>
<th>Parameter Description</th>
<th>Default Value</th>
</tr>
</thead>
<tr>
<td rowspan="3"><code>print()</code></td>
<td rowspan="3">Print the results to the terminal</td>
<td><code>format_json</code></td>
<td><code>bool</code></td>
<td>Whether to format the output content using <code>JSON</code> indentation</td>
<td><code>True</code></td>
</tr>
<tr>
<td><code>indent</code></td>
<td><code>int</code></td>
<td>Specify the indentation level to beautify the output <code>JSON</code> data, making it more readable, only effective when <code>format_json</code> is <code>True</code></td>
<td>4</td>
</tr>
<tr>
<td><code>ensure_ascii</code></td>
<td><code>bool</code></td>
<td>Control whether to escape non-<code>ASCII</code> characters to <code>Unicode</code>. If set to <code>True</code>, all non-<code>ASCII</code> characters will be escaped; <code>False</code> retains the original characters, only effective when <code>format_json</code> is <code>True</code></td>
<td><code>False</code></td>
</tr>
<tr>
<td rowspan="3"><code>save_to_json()</code></td>
<td rowspan="3">Save the results as a JSON file</td>
<td><code>save_path</code></td>
<td><code>str</code></td>
<td>The path to save the file. If it is a directory, the saved file name will be consistent with the input file name</td>
<td>None</td>
</tr>
<tr>
<td><code>indent</code></td>
<td><code>int</code></td>
<td>Specify the indentation level to beautify the output <code>JSON</code> data, making it more readable, only effective when <code>format_json</code> is <code>True</code></td>
<td>4</td>
</tr>
<tr>
<td><code>ensure_ascii</code></td>
<td><code>bool</code></td>
<td>Control whether to escape non-<code>ASCII</code> characters to <code>Unicode</code>. If set to <code>True</code>, all non-<code>ASCII</code> characters will be escaped; <code>False</code> retains the original characters, only effective when <code>format_json</code> is <code>True</code></td>
<td><code>False</code></td>
</tr>
<tr>
<td><code>save_to_img()</code></td>
<td>Save the results as an image file</td>
<td><code>save_path</code></td>
<td><code>str</code></td>
<td>The path to save the file. If it is a directory, the saved file name will be consistent with the input file name</td>
<td>None</td>
</tr>
</table>

* Additionally, it supports obtaining the visualization image with results and the prediction results through attributes, as follows:

<table>
<thead>
<tr>
<th>Attribute</th>
<th>Attribute Description</th>
</tr>
</thead>
<tr>
<td rowspan="1"><code>json</code></td>
<td rowspan="1">Get the prediction result in <code>json</code> format</td>
</tr>
<tr>
<td rowspan="1"><code>img</code></td>
<td rowspan="1">Get the visualization image in <code>dict</code> format</td>
</tr>
</table>

## 4. Custom Development  

Since PaddleOCR does not natively support training for text line orientation classification, refer to [PaddleX's Custom Development Guide](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/ocr_modules/textline_orientation_classification.html#iv-custom-development) for training. Trained models can seamlessly integrate into PaddleOCR's API for inference.
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								---
 								comments: true
 								---
 								# Text Line Orientation Classification Module Tutorial
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								## 1. Overview
 								The text line orientation classification module primarily distinguishes the orientation of text lines and corrects them using post-processing. In processes such as document scanning and license/certificate photography, to capture clearer images, the capture device may be rotated, resulting in text lines in various orientations. Standard OCR pipelines cannot handle such data well. By utilizing image classification technology, the orientation of text lines can be predetermined and adjusted, thereby enhancing the accuracy of OCR processing.
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								## 2. Supported Model List
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								<table>
 								<thead>
 								<tr>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<th>Model</th><th>Model Download Link</th>
 								<th>Top-1 Accuracy (%)</th>
 								<th>GPU Inference Time (ms)<br/>[Normal Mode / High-Performance Mode]</th>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								<th>CPU Inference Time (ms)</th>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<th>Model Storage Size (MB)</th>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								<th>Description</th>
 								</tr>
 								</thead>
 								<tbody>
 								<tr>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>PP-LCNet_x0_25_textline_ori</td>
 								<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x0_25_textline_ori_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/PP-LCNet_x0_25_textline_ori_pretrained.pdparams">Training Model</a></td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>98.85</td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>2.16 / 0.41</td>
 								<td>2.37 / 0.73</td>
-												fix textline docs (#15508)

Co-authored-by: bukejiyu <395822456@qq.com>
											
										
										
											2025-05-30 16:15:55 +08:00
+								<td>0.96</td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>Text line classification model based on PP-LCNet_x0_25, with two classes: 0 degrees and 180 degrees</td>
 								</tr>
 								<tr>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>PP-LCNet_x1_0_textline_ori</td>
 								<td><a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x1_0_textline_ori_infer.tar">Inference Model</a>/<a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_pretrained_model/PP-LCNet_x1_0_textline_ori_pretrained.pdparams">Training Model</a></td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>99.42</td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>- / -</td>
 								<td>2.98 / 2.98</td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>6.5</td>
 								<td>Text line classification model based on PP-LCNet_x1_0, with two classes: 0 degrees and 180 degrees</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								</tbody>
 								</table>
-												fix textline docs (#15508)

Co-authored-by: bukejiyu <395822456@qq.com>
											
										
										
											2025-05-30 16:15:55 +08:00
+								> ❗ **Note**: The text line orientation classification model was upgraded on May 26, 2025, and `PP-LCNet_x1_0_textline_ori` has been added. If you need to use the pre-upgrade model weights, please click the <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/official_inference_model/paddle3.0.0/PP-LCNet_x0_25_textline_ori_infer.bak.tar">download link</a>.
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
 								<strong>Test Environment Description:</strong>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								  <ul>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								      <li><b>Performance Test Environment</b>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								          <ul>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								             <li><strong>Test Dataset：</strong> PaddleX Self-built Dataset, Covering Multiple Scenarios Such as Documents and Certificates, Containing 1000 Images.</li>
-												[docs] Modify test environment (#15914)

* modify test environment

* modify test environment

* modify test environment
											
										
										
											2025-06-30 20:26:05 +08:00
+								              <li><strong>Hardware Configuration:</strong>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								                  <ul>
 								                      <li>GPU: NVIDIA Tesla T4</li>
 								                      <li>CPU: Intel Xeon Gold 6271C @ 2.60GHz</li>
-												[docs] Modify test environment (#15914)

* modify test environment

* modify test environment

* modify test environment
											
										
										
											2025-06-30 20:26:05 +08:00
+								                  </ul>
 								              </li>
 								              <li><strong>Software Environment:</strong>
 								                  <ul>
 								                      <li>Ubuntu 20.04 / CUDA 11.8 / cuDNN 8.9 / TensorRT 8.6.1.6</li>
 								                      <li>paddlepaddle 3.0.0 / paddleocr 3.0.3</li>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								                  </ul>
 								              </li>
 								          </ul>
 								      </li>
 								      <li><b>Inference Mode Description</b></li>
 								  </ul>
 								<table border="1">
 								    <thead>
 								        <tr>
 								            <th>Mode</th>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								            <th>GPU Configuration </th>
 								            <th>CPU Configuration </th>
 								            <th>Acceleration Technology Combination</th>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								        </tr>
 								    </thead>
 								    <tbody>
 								        <tr>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								            <td>Normal Mode</td>
 								            <td>FP32 Precision / No TRT Acceleration</td>
 								            <td>FP32 Precision / 8 Threads</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								            <td>PaddleInference</td>
 								        </tr>
 								        <tr>
 								            <td>High-Performance Mode</td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								            <td>Optimal combination of pre-selected precision types and acceleration strategies</td>
 								            <td>FP32 Precision / 8 Threads</td>
 								            <td>Pre-selected optimal backend (Paddle/OpenVINO/TRT, etc.)</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								        </tr>
 								    </tbody>
 								</table>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								## 3. Quick Integration
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								> ❗ Before starting, please install the wheel package of PaddleOCR. For detailed instructions, refer to the [Installation Guide](../installation.en.md).
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								You can quickly experience the functionality with a single command:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								```bash
-												[Feat] Support `textline_orientation` for chatocr and unify naming of text line orientation (#15337)

* Support textline_orientation for chatocr and unify naming of textline orientation

* Unify description

* Update documentation

* Fix serving doc
											
										
										
											2025-05-30 17:31:30 +08:00
+								paddleocr textline_orientation_classification -i https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg
 								```
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												tip the change of model source (#15726)


											
										
										
											2025-06-14 19:36:55 +08:00
+								<b>Note: </b>The official models would be download from HuggingFace by default. If can't access to HuggingFace, please set the environment variable `PADDLE_PDX_MODEL_SOURCE="BOS"` to change the model source to BOS. In the future, more model sources will be supported.
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								You can also integrate the text line orientation classification model into your project. Run the following code after downloading the [example image](https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg) to your local machine.
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								```bash
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								from paddleocr import TextLineOrientationClassification
 								model = TextLineOrientationClassification(model_name="PP-LCNet_x0_25_textline_ori")
 								output = model.predict("textline_rot180_demo.jpg",  batch_size=1)
 								for res in output:
 								    res.print(json_format=False)
 								    res.save_to_img("./output/demo.png")
 								    res.save_to_json("./output/res.json")
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								```
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								After running, the result obtained is:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								```bash
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								{'res': {'input_path': 'textline_rot180_demo.jpg', 'page_index': None, 'class_ids': array([1], dtype=int32), 'scores': array([0.99864], dtype=float32), 'label_names': ['180_degree']}}
 								```
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								The meanings of the running results parameters are as follows:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								- `input_path`：Indicates the path of the input image.
 								- `page_index`：If the input is a PDF file, it indicates the current page number of the PDF; otherwise, it is `None`.
 								- `class_ids`：Indicates the class ID of the prediction result.
 								- `scores`：Indicates the confidence score of the prediction result.
 								- `label_names`：Indicates the class name of the prediction result.
 								The visualization image is as follows:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<img src="https://raw.githubusercontent.com/cuicheng01/PaddleX_doc_images/refs/heads/main/images/modules/textline_ori_classification/textline_rot180_demo_res.jpg">
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								The explanations for the methods, parameters, etc., are as follows:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								* `TextLineOrientationClassification` instantiates a textline classification model (here, `PP-LCNet_x0_25_textline_ori` is used as an example), and the specific explanations are as follows:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								<table>
 								<thead>
 								<tr>
 								<th>Parameter</th>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<th>Description</th>
 								<th>Type</th>
 								<th>Default</th>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								</thead>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<tbody>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								<tr>
 								<td><code>model_name</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Model name. If set to <code>None</code>, <code>PP-LCNet_x0_25_textline_ori</code> will be used.</td>
 								<td><code>str|None</code></td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>None</code></td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								<tr>
 								<td><code>model_dir</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Model storage path.</td>
 								<td><code>str|None</code></td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>None</code></td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								<tr>
 								<td><code>device</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Device for inference.<br/>
 								<b>For example:</b> <code>"cpu"</code>, <code>"gpu"</code>, <code>"npu"</code>, <code>"gpu:0"</code>, <code>"gpu:0,1"</code>.<br/>
 								If multiple devices are specified, parallel inference will be performed.<br/>
 								By default, GPU 0 is used if available; otherwise, CPU is used.
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								</td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td><code>str|None</code></td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>None</code></td>
 								</tr>
 								<tr>
 								<td><code>enable_hpi</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Whether to enable high-performance inference.</td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>bool</code></td>
 								<td><code>False</code></td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								<tr>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>use_tensorrt</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Whether to use the Paddle Inference TensorRT subgraph engine. If the model does not support acceleration through TensorRT, setting this flag will not enable acceleration.<br/>
 								For Paddle with CUDA version 11.8, the compatible TensorRT version is 8.x (x>=6), and it is recommended to install TensorRT 8.6.1.6.<br/>
-												update descriptions for use_hpip and use_tensorrt settings (#15616)

* update descriptions for use_hpip and use_tensorrt settings

* update

* update
											
										
										
											2025-06-09 14:59:41 +08:00
+								For Paddle with CUDA version 12.6, the compatible TensorRT version is 10.x (x>=5), and it is recommended to install TensorRT 10.5.0.18.
 								</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								<td><code>bool</code></td>
 								<td><code>False</code></td>
 								</tr>
 								<tr>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>precision</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Computation precision when using the TensorRT subgraph engine in Paddle Inference.<br/><b>Options:</b> <code>"fp32"</code>, <code>"fp16"</code>.</td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>str</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td><code>"fp32"</code></td>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								</tr>
 								<tr>
 								<td><code>enable_mkldnn</code></td>
 								<td>
-												Polish  description (#15610)


											
										
										
											2025-06-09 10:53:44 +08:00
+								Whether to enable MKL-DNN acceleration for inference. If MKL-DNN is unavailable or the model does not support it, acceleration will not be used even if this flag is set.
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								</td>
 								<td><code>bool</code></td>
 								<td><code>True</code></td>
 								</tr>
 								<tr>
-												[Feat] Support setting mkldnn_cache_capacity (#15660)

* Support setting mkldnn_cache_capacity

* Update package description to sync with repo

* Remove min_subgraph_size
											
										
										
											2025-06-12 21:00:47 +08:00
+								<td><code>mkldnn_cache_capacity</code></td>
 								<td>
 								MKL-DNN cache capacity.
 								</td>
 								<td><code>int</code></td>
 								<td><code>10</code></td>
 								</tr>
 								<tr>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								<td><code>cpu_threads</code></td>
 								<td>Number of threads to use for inference on CPUs.</td>
 								<td><code>int</code></td>
 								<td><code>10</code></td>
 								</tr>
 								</tbody>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</table>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								* Use the `predict()` method of the text line direction classification model to perform inference. This method returns a list of results. In addition, this module also provides the `predict_iter()` method. Both methods accept the same parameters and return the same result format. The difference is that `predict_iter()` returns a `generator`, which processes and retrieves prediction results step by step. It is suitable for handling large datasets or memory-efficient scenarios. You can choose either method based on your actual needs. The `predict()` method accepts the parameters `input` and `batch_size`, which are described in detail below:
 								<table>
 								<thead>
 								<tr>
 								<th>Parameter</th>
 								<th>Description</th>
 								<th>Type</th>
 								<th>Default</th>
 								</tr>
 								</thead>
 								<tr>
 								<td><code>input</code></td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td>Input data to be predicted. Required. Supports multiple input types:<ul>
 								<li><b>Python Var</b>: e.g., <code>numpy.ndarray</code> representing image data</li>
 								<li><b>str</b>:
 								  - Local image or PDF file path: <code>/root/data/img.jpg</code>;
 								  - <b>URL</b> of image or PDF file: e.g., <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/general_doc_preprocessor_002.png">example</a>;
 								  - <b>Local directory</b>: directory containing images for prediction, e.g., <code>/root/data/</code> (Note: directories containing PDF files are not supported; PDFs must be specified by exact file path)</li>
 								<li><b>list</b>: Elements must be of the above types, e.g., <code>[numpy.ndarray, numpy.ndarray]</code>, <code>["/root/data/img1.jpg", "/root/data/img2.jpg"]</code>, <code>["/root/data1", "/root/data2"]</code></li>
-												update api and doc (#15378)

* fix email for all commit

* update mkldnn default value and description

* fix docs en description
											
										
										
											2025-05-30 13:50:57 +08:00
+								</ul>
 								</td>
 								<td><code>Python Var|str|list</code></td>
 								<td></td>
 								</tr>
 								<tr>
 								<td><code>batch_size</code></td>
 								<td>Batch size,  positive integer.</td>
 								<td><code>int</code></td>
 								<td>1</td>
 								</tr>
 								</table>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
 								* Call the `predict()` method of the text line orientation classification model for inference. This method will return a list of results. In addition, this module also provides a `predict_iter()` method. Both methods accept the same parameters and return the same results, but `predict_iter()` returns a `generator`, which is more suitable for processing large datasets or when you want to save memory. You can choose either method according to your needs. The parameters of the `predict()` method are `input` and `batch_size`, as described below:
 								<table>
 								<thead>
 								<tr>
 								<th>Parameter</th>
 								<th>Parameter Description</th>
 								<th>Parameter Type</th>
 								<th>Options</th>
 								<th>Default Value</th>
 								</tr>
 								</thead>
 								<tr>
 								<td><code>input</code></td>
 								<td>Data to be predicted, supporting multiple input types</td>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								<td><code>Python Var|str|list</code></td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>
 								<ul>
 								  <li><b>Python variable</b>, such as image data represented by <code>numpy.ndarray</code></li>
 								  <li><b>File path</b>, such as the local path of an image file: <code>/root/data/img.jpg</code></li>
 								  <li><b>URL link</b>, such as the network URL of an image file: <a href="https://paddle-model-ecology.bj.bcebos.com/paddlex/imgs/demo_image/textline_rot180_demo.jpg">Example</a></li>
 								  <li><b>Local directory</b>, the directory should contain data files to be predicted, such as the local path: <code>/root/data/</code></li>
-												[docs] Update Performance Statistics in Docs (#15819)

* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
											
										
										
											2025-06-26 20:32:46 +08:00
+								  <li><b>list</b>, the elements of the list should be of the above-mentioned data types, such as <code>[numpy.ndarray, numpy.ndarray]</code>, <code>[\"/root/data/img1.jpg\", \"/root/data/img2.jpg\"]</code>, <code>[\"/root/data1\", \"/root/data2\"]</code></li>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								</ul>
 								</td>
 								<td>None</td>
 								</tr>
 								<tr>
 								<td><code>batch_size</code></td>
 								<td>Batch size</td>
 								<td><code>int</code></td>
 								<td>Any integer</td>
 								<td>1</td>
 								</tr>
 								</table>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								* The prediction results are processed, and the prediction result for each sample is of type `dict`. It supports operations such as printing, saving as an image, and saving as a `json` file:
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								<table>
 								<thead>
 								<tr>
 								<th>Method</th>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<th>Method Description</th>
 								<th>Parameter</th>
 								<th>Parameter Type</th>
 								<th>Parameter Description</th>
 								<th>Default Value</th>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								</thead>
 								<tr>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td rowspan="3"><code>print()</code></td>
 								<td rowspan="3">Print the results to the terminal</td>
 								<td><code>format_json</code></td>
 								<td><code>bool</code></td>
 								<td>Whether to format the output content using <code>JSON</code> indentation</td>
 								<td><code>True</code></td>
 								</tr>
 								<tr>
 								<td><code>indent</code></td>
 								<td><code>int</code></td>
 								<td>Specify the indentation level to beautify the output <code>JSON</code> data, making it more readable, only effective when <code>format_json</code> is <code>True</code></td>
 								<td>4</td>
 								</tr>
 								<tr>
 								<td><code>ensure_ascii</code></td>
 								<td><code>bool</code></td>
 								<td>Control whether to escape non-<code>ASCII</code> characters to <code>Unicode</code>. If set to <code>True</code>, all non-<code>ASCII</code> characters will be escaped; <code>False</code> retains the original characters, only effective when <code>format_json</code> is <code>True</code></td>
 								<td><code>False</code></td>
 								</tr>
 								<tr>
 								<td rowspan="3"><code>save_to_json()</code></td>
 								<td rowspan="3">Save the results as a JSON file</td>
 								<td><code>save_path</code></td>
 								<td><code>str</code></td>
 								<td>The path to save the file. If it is a directory, the saved file name will be consistent with the input file name</td>
 								<td>None</td>
 								</tr>
 								<tr>
 								<td><code>indent</code></td>
 								<td><code>int</code></td>
 								<td>Specify the indentation level to beautify the output <code>JSON</code> data, making it more readable, only effective when <code>format_json</code> is <code>True</code></td>
 								<td>4</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								<tr>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td><code>ensure_ascii</code></td>
 								<td><code>bool</code></td>
 								<td>Control whether to escape non-<code>ASCII</code> characters to <code>Unicode</code>. If set to <code>True</code>, all non-<code>ASCII</code> characters will be escaped; <code>False</code> retains the original characters, only effective when <code>format_json</code> is <code>True</code></td>
 								<td><code>False</code></td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								<tr>
 								<td><code>save_to_img()</code></td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>Save the results as an image file</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								<td><code>save_path</code></td>
 								<td><code>str</code></td>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								<td>The path to save the file. If it is a directory, the saved file name will be consistent with the input file name</td>
 								<td>None</td>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
+								</tr>
 								</table>
-												add textline model for docs (#15409)

Co-authored-by: zhangyubo0722 <zangyubo0722@163.com>
											
										
										
											2025-05-27 22:42:09 +08:00
+								* Additionally, it supports obtaining the visualization image with results and the prediction results through attributes, as follows:
 								<table>
 								<thead>
 								<tr>
 								<th>Attribute</th>
 								<th>Attribute Description</th>
 								</tr>
 								</thead>
 								<tr>
 								<td rowspan="1"><code>json</code></td>
 								<td rowspan="1">Get the prediction result in <code>json</code> format</td>
 								</tr>
 								<tr>
 								<td rowspan="1"><code>img</code></td>
 								<td rowspan="1">Get the visualization image in <code>dict</code> format</td>
 								</tr>
 								</table>
-												add ocr v5 metric (#15196)


											
										
										
											2025-05-20 02:37:26 +08:00
 								## 4. Custom Development
-												fix_docs_add_links (#15214)

* fix_docs_add_links

* fix_docs_add_links
											
										
										
											2025-05-20 15:56:36 +08:00
+								Since PaddleOCR does not natively support training for text line orientation classification, refer to [PaddleX's Custom Development Guide](https://paddlepaddle.github.io/PaddleX/latest/en/module_usage/tutorials/ocr_modules/textline_orientation_classification.html#iv-custom-development) for training. Trained models can seamlessly integrate into PaddleOCR's API for inference.