mv layout and doc vqa dataset to docs/dataset

2025-12-28 15:38:18 +00:00 · 2022-04-27 19:36:23 +08:00 · 2022-04-27 19:36:23 +08:00 · 0d6a48625e
commit 0d6a48625e
parent e4348f6ae5
1 changed files with 1 additions and 1 deletions
--- a/doc/doc_en/dataset/ocr_datasets_en.md
+++ b/doc/doc_en/dataset/ocr_datasets_en.md
@ -73,7 +73,7 @@ After decompressing the data set and downloading the annotation file, PaddleOCR/

 The text recognition algorithm in PaddleOCR supports two data formats:
 - `lmdb` is used to train data sets stored in lmdb format, use [lmdb_dataset.py](../../../ppocr/data/lmdb_dataset.py) to load;
- - `通用数据` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load.
+ - `common dataset` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load.


 If you want to use your own data for training, please refer to the following to organize your data.