mirror of
https://github.com/PaddlePaddle/PaddleOCR.git
synced 2025-06-26 21:24:27 +00:00
Update PP-OCRv4_introduction.md
This commit is contained in:
parent
82c6434d53
commit
e44c2af762
@ -155,8 +155,25 @@ GTC(Guided Training of CTC),是在PP-OCRv3中使用过的策略,融合
|
||||
经过以上优化,最终PP-OCRv4在速度可比情况下,中文场景端到端Hmean指标相比于PP-OCRv3提升4.5%,效果大幅提升。具体指标如下表所示:
|
||||
|
||||
| Model | Hmean | Model Size (M) | Time Cost (CPU, ms) |
|
||||
|-----|-----|--------|----| --- |
|
||||
|-----|-----|--------|----|
|
||||
| PP-OCRv3 | 57.99% | 15.6 | 78 |
|
||||
| PP-OCRv4 | 62.24% | 15.8 | 76 |
|
||||
|
||||
测试环境:CPU型号为Intel Gold 6148,CPU预测时使用openvino。
|
||||
|
||||
除了更新中文模型,本次升级也优化了英文数字模型,在自有评估集上文本识别准确率提升6%,如下表所示:
|
||||
|
||||
| Model | ACC |
|
||||
|-----|-----|
|
||||
| PP-OCR_en | 54.38% |
|
||||
| PP-OCRv3_en | 64.04% |
|
||||
| PP-OCRv4_en | 70.1% |
|
||||
|
||||
同时,也对已支持的80余种语言识别模型进行了升级更新,在有评估集的四种语系识别准确率平均提升5%以上,如下表所示:
|
||||
|
||||
| Model | 拉丁语系 | 阿拉伯语系 | 日语 | 韩语 |
|
||||
|-----|-----|--------|----| --- |
|
||||
| PP-OCR_mul | 69.60% | 40.50% | 38.50% | 55.40% |
|
||||
| PP-OCRv3_mul | 75.20%| 45.37% | 45.80% | 60.10% |
|
||||
| PP-OCRv4_mul | 80.00%| 75.48% | 56.50% | 83.25% |
|
||||
|
||||
|
Loading…
x
Reference in New Issue
Block a user