27 Commits

Author SHA1 Message Date
guoshengjian
c0bb8c4faa
[docs] Update Performance Statistics in Docs (#15819)
* Fix docs

* update performance

* update performance

* Fix docs

* update performance

* update performance

* Update docs

* Update tensorrt docs

* update performance

* update performance

* update performance

* update performance

* update performance

* modify chart recognition

* modify seal recognition

* Add resource notice

* Fix mcp docs

* Fix doc

* Fix name

* update performance

* Fix docs

* Fix docs

* Refactor MCP server docs

---------

Co-authored-by: Bobholamovic <mhlin425@whu.edu.cn>
Co-authored-by: Bobholamovic <bob1998425@hotmail.com>
2025-06-26 20:32:46 +08:00
Lin Manhui
86076a3746
Explicitly setting MKL-DNN (#15790) 2025-06-24 22:55:56 +08:00
Lin Manhui
bbfa08b25b
[Feat] Accommodate PaddleX 3.0.2 changes (#15745)
* Fix default run_mode

* Bump paddlex version

* Fix typo
2025-06-17 21:29:18 +08:00
Lin Manhui
766c4ad2d3
[Fix] Consider nested cases when converting AttrDict to built-in dicts (#15663)
* Fix dict conversion

* By default enable mkldnn

* By default do not save to file
2025-06-14 12:29:17 +08:00
Lin Manhui
966c11e20b
[Feat] Support setting mkldnn_cache_capacity (#15660)
* Support setting mkldnn_cache_capacity

* Update package description to sync with repo

* Remove min_subgraph_size
2025-06-12 21:00:47 +08:00
Tingquan Gao
a016e5ec99
bugfix: some params ware missed when constructing pdx config (#15598) 2025-06-12 14:36:15 +08:00
Lin Manhui
803638d173
Fix PPStructureV3 CLI params (#15666) 2025-06-10 18:48:03 +08:00
timminator
ae21c3d3a9
Enhance OCR model selection logic (#15643)
If there is no model specified select the best available model
for the specified language

Fix #15642
2025-06-09 11:21:39 +08:00
Lin Manhui
b0e622937a
Polish description (#15610) 2025-06-09 10:53:44 +08:00
Lin Manhui
fa621efe0e
Update PP-OCRv4 supported languages and default models (#15561)
Co-authored-by: Sam-gsj <Sam-gsj@users.noreply.github.com>
2025-06-04 13:56:15 +08:00
Lin Manhui
4a3f01e129
[WIP][Feat] Accommodate PaddleX MKL-DNN new behavior (#15471)
* Accommodate PaddleX MKL-DNN new behavior

* Bump paddlex version

* Use stderr
2025-06-01 18:27:52 +08:00
Lin Manhui
fd5b4e1049
[Feat] Support textline_orientation for chatocr and unify naming of text line orientation (#15337)
* Support textline_orientation for chatocr and unify naming of textline orientation

* Unify description

* Update documentation

* Fix serving doc
2025-05-30 17:31:30 +08:00
Lin Manhui
dab58b0377
Fix bugs and polish docs (#15304)
* Add missing API

* Polish documentation
2025-05-30 14:43:31 +08:00
Lin Manhui
6cbf707536
[Fix] Set subcommand as required (#15281)
* Set subcommand as required

* Update
2025-05-30 14:43:24 +08:00
Lin Manhui
878a076e98
Correct unexpected behavior when specifying both lang and model name (#15485) 2025-05-30 14:35:58 +08:00
guoshengjian
8601c4b457
Fix docs (#15303)
* Revise the English document

* Remove the device from the predict()

* replace general_formula_recognition_001.png

* modify docs and paddleocr/pp_structurev3.py

* modify docs

* modify docs load and use

* Modify GPU device

* modify docs

* modify docs_2

* modify docs-4

* modify docs-5
2025-05-28 10:24:37 +08:00
Lin Manhui
ff90dc7eea
Update default models (#15329) 2025-05-23 11:52:04 +08:00
Lin Manhui
eac5578fe2
[Docs] Add upgrade notes and fix docs (#15198)
* Unify refs

* Fix extra newline

* Update mkldnn_blocklists

* Add upgrade notes

* Fix docs

* Add English upgrade notes

* Bump paddlex to 3.0.0

* Update upgrade notes
2025-05-20 15:13:40 +08:00
liuhongen1234567
63823169eb
fix doc in formula (#15191)
* fix doc in formula

* fix doc2
2025-05-20 00:28:19 +08:00
Zhang Zelun
1788b8633f
add ocr docs (#15173)
* add ocr docs

* fix sth
2025-05-19 23:02:27 +08:00
cuicheng01
23e68a0d2b
update docs for spliting 2.x & 3.x (#15167)
* update docs

* update

* update

* update

* update for pre-commit
2025-05-19 15:27:13 +08:00
Lin Manhui
b25dcaae0e
Add deployment docs and enhance CLI (#15117)
* Add serving and hpi docs

* Optimize CLI logging info

* Update interface

* Add on-device deployment and onnx model conversion docs

* Enhance CLI

* _gen->_iter

* Fix CLI help message

* Update table_recognition_v2 and PP-StructureV3 interfaces

* Update installation doc

* Update interface

* Update interface

* Add logging doc

* Update default values

---------

Co-authored-by: cuicheng01 <45199522+cuicheng01@users.noreply.github.com>
2025-05-19 03:01:27 +08:00
zhang-prog
3d594881e4
use mixin for code reuse in text_detection and seal_text_detection modules (#15109)
* use mixin for code reuse in text_detection and seal_text_detection modules

* fix input param of doc_vlm CLI

* fix
2025-05-18 21:19:31 +08:00
Lin Manhui
849226cf7e
Change CLI subparser name and argument order (#15113)
* PP-StructureV3->pp_structurev3

* Change CLI args order

* Polish

* Change order:
2025-05-18 21:13:43 +08:00
Lin Manhui
a4fdd4dbdb
Add dep installation CLI command (#15103) 2025-05-18 21:12:31 +08:00
zhang-prog
2357c63a9a
add new docs (pipelines, modules, legacy) (#15096)
* add ocr doc

* add docs

* fix

* add pipeline docs

* add module docs

* update the descriptions of parameters

* update

* update the description of  predict_iter

* update

* delete 2.2python脚本

* add char_recognition and region_detection

* modify in predict

* remove redundant 2.2 Python scripts

* modify use_wired_table_cells_trans_to_html

* add use_chart_recognition and use_region_detection

* add information

* add use_orc_model

* add legacy docs

* update

---------

Co-authored-by: guoshengjian <guoshengjian@baidu.com>
2025-05-18 21:09:53 +08:00
Lin Manhui
3d03ca5500
[Breaking][Feat] New PaddleOCR inference package (#15046)
* Init new paddleocr

* Remove unused dependency

* Fix typos

* Fix

* Add doc understanding modules

* Fix package finding

* Normalize name

* Fix setting bugs

* Fix setting bug

* Support single model inference

* Add PP-ChatOCRv4-doc

* Add pp_chatocrv4_doc tests

* Enable MKL-DNN when available

* add seal_text_detection modules

* add layout_detection and table_cells_detection modules

* add testing scripts

* Fix desc

* add text_image_unwarping and table_structure_recognition modules

* add formula_recognition and doc_vlm modules

* update formula_recognition default_model_name

* add MKLDNN_BLOCKLIST

* update MKLDNN log

* add seal rec pipeline

* fix sth

* fix sth

* add doc preprocessor pipeline

* fix sth

* add doc understanding

* add table_rec_v2, ppstructurev3, formula_rec pipelines

* move test files

* forward kwargs to pipeline.predict

* clean test files

* Add missing kwargs

* Fix typo

* Fix typo

* rerun CI

* update mkldnn BLOCKLIST

* update

* update warning message

* fix cli args

* update PIPELINE_MKLDNN_BLOCKLIST

* update  of  workflow

* skip resource_intensive tests

* update config

* skip ppdocbee test_predict_params

---------

Co-authored-by: zhangyue66 <zhangyue66@baidu.com>
Co-authored-by: zhangzelun <zhangzelun@baidu.com>
2025-05-04 15:59:02 +08:00