mirror of
https://github.com/PaddlePaddle/PaddleOCR.git
synced 2025-12-28 15:38:18 +00:00
mv layout and doc vqa dataset to docs/dataset
This commit is contained in:
parent
e4348f6ae5
commit
0d6a48625e
@ -73,7 +73,7 @@ After decompressing the data set and downloading the annotation file, PaddleOCR/
|
||||
|
||||
The text recognition algorithm in PaddleOCR supports two data formats:
|
||||
- `lmdb` is used to train data sets stored in lmdb format, use [lmdb_dataset.py](../../../ppocr/data/lmdb_dataset.py) to load;
|
||||
- `通用数据` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load.
|
||||
- `common dataset` is used to train data sets stored in text files, use [simple_dataset.py](../../../ppocr/data/simple_dataset.py) to load.
|
||||
|
||||
|
||||
If you want to use your own data for training, please refer to the following to organize your data.
|
||||
|
||||
Loading…
x
Reference in New Issue
Block a user