19 Commits

Author SHA1 Message Date
Lin Manhui
de17179186
[Docs] Fix errors (#16005)
* Fix docs

* Update MCP docs

* Update English doc and fix docs
2025-07-14 11:44:37 +08:00
zhang-prog
38320f475c
sup func for pp_chatocrv4 (#15948) 2025-07-03 16:26:26 +08:00
zhang-prog
aeef330434
sup functions for pp_chatocrv4_doc (#15932)
* sup functions for pp_chatocrv4_doc

* sup parameters for pp_doctranslation
2025-07-02 15:39:53 +08:00
Lin Manhui
de0ecd466f
[Feat] Add PP-DocTranslate and update docs (#15890)
* Add PP-DocTranslate and update docs

* Fix docs

* Add English doc

* Fix ut
2025-06-28 23:13:32 +08:00
学卿
0a8a6354f1
support ppocrv5 minor languages (#15893)
* support ppocrv5 minor languages

* fixed bugs
2025-06-28 18:58:13 +08:00
Lin Manhui
766c4ad2d3
[Fix] Consider nested cases when converting AttrDict to built-in dicts (#15663)
* Fix dict conversion

* By default enable mkldnn

* By default do not save to file
2025-06-14 12:29:17 +08:00
Tingquan Gao
a016e5ec99
bugfix: some params ware missed when constructing pdx config (#15598) 2025-06-12 14:36:15 +08:00
Lin Manhui
803638d173
Fix PPStructureV3 CLI params (#15666) 2025-06-10 18:48:03 +08:00
timminator
ae21c3d3a9
Enhance OCR model selection logic (#15643)
If there is no model specified select the best available model
for the specified language

Fix #15642
2025-06-09 11:21:39 +08:00
Lin Manhui
fa621efe0e
Update PP-OCRv4 supported languages and default models (#15561)
Co-authored-by: Sam-gsj <Sam-gsj@users.noreply.github.com>
2025-06-04 13:56:15 +08:00
Lin Manhui
4a3f01e129
[WIP][Feat] Accommodate PaddleX MKL-DNN new behavior (#15471)
* Accommodate PaddleX MKL-DNN new behavior

* Bump paddlex version

* Use stderr
2025-06-01 18:27:52 +08:00
Lin Manhui
fd5b4e1049
[Feat] Support textline_orientation for chatocr and unify naming of text line orientation (#15337)
* Support textline_orientation for chatocr and unify naming of textline orientation

* Unify description

* Update documentation

* Fix serving doc
2025-05-30 17:31:30 +08:00
Lin Manhui
dab58b0377
Fix bugs and polish docs (#15304)
* Add missing API

* Polish documentation
2025-05-30 14:43:31 +08:00
Lin Manhui
878a076e98
Correct unexpected behavior when specifying both lang and model name (#15485) 2025-05-30 14:35:58 +08:00
guoshengjian
8601c4b457
Fix docs (#15303)
* Revise the English document

* Remove the device from the predict()

* replace general_formula_recognition_001.png

* modify docs and paddleocr/pp_structurev3.py

* modify docs

* modify docs load and use

* Modify GPU device

* modify docs

* modify docs_2

* modify docs-4

* modify docs-5
2025-05-28 10:24:37 +08:00
Lin Manhui
b25dcaae0e
Add deployment docs and enhance CLI (#15117)
* Add serving and hpi docs

* Optimize CLI logging info

* Update interface

* Add on-device deployment and onnx model conversion docs

* Enhance CLI

* _gen->_iter

* Fix CLI help message

* Update table_recognition_v2 and PP-StructureV3 interfaces

* Update installation doc

* Update interface

* Update interface

* Add logging doc

* Update default values

---------

Co-authored-by: cuicheng01 <45199522+cuicheng01@users.noreply.github.com>
2025-05-19 03:01:27 +08:00
Lin Manhui
849226cf7e
Change CLI subparser name and argument order (#15113)
* PP-StructureV3->pp_structurev3

* Change CLI args order

* Polish

* Change order:
2025-05-18 21:13:43 +08:00
zhang-prog
2357c63a9a
add new docs (pipelines, modules, legacy) (#15096)
* add ocr doc

* add docs

* fix

* add pipeline docs

* add module docs

* update the descriptions of parameters

* update

* update the description of  predict_iter

* update

* delete 2.2python脚本

* add char_recognition and region_detection

* modify in predict

* remove redundant 2.2 Python scripts

* modify use_wired_table_cells_trans_to_html

* add use_chart_recognition and use_region_detection

* add information

* add use_orc_model

* add legacy docs

* update

---------

Co-authored-by: guoshengjian <guoshengjian@baidu.com>
2025-05-18 21:09:53 +08:00
Lin Manhui
3d03ca5500
[Breaking][Feat] New PaddleOCR inference package (#15046)
* Init new paddleocr

* Remove unused dependency

* Fix typos

* Fix

* Add doc understanding modules

* Fix package finding

* Normalize name

* Fix setting bugs

* Fix setting bug

* Support single model inference

* Add PP-ChatOCRv4-doc

* Add pp_chatocrv4_doc tests

* Enable MKL-DNN when available

* add seal_text_detection modules

* add layout_detection and table_cells_detection modules

* add testing scripts

* Fix desc

* add text_image_unwarping and table_structure_recognition modules

* add formula_recognition and doc_vlm modules

* update formula_recognition default_model_name

* add MKLDNN_BLOCKLIST

* update MKLDNN log

* add seal rec pipeline

* fix sth

* fix sth

* add doc preprocessor pipeline

* fix sth

* add doc understanding

* add table_rec_v2, ppstructurev3, formula_rec pipelines

* move test files

* forward kwargs to pipeline.predict

* clean test files

* Add missing kwargs

* Fix typo

* Fix typo

* rerun CI

* update mkldnn BLOCKLIST

* update

* update warning message

* fix cli args

* update PIPELINE_MKLDNN_BLOCKLIST

* update  of  workflow

* skip resource_intensive tests

* update config

* skip ppdocbee test_predict_params

---------

Co-authored-by: zhangyue66 <zhangyue66@baidu.com>
Co-authored-by: zhangzelun <zhangzelun@baidu.com>
2025-05-04 15:59:02 +08:00