Update PP-OCRv4_introduction.md

This commit is contained in:
xiaoting 2023-06-27 16:04:08 +08:00 committed by GitHub
parent 82c6434d53
commit e44c2af762
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23

View File

@ -155,8 +155,25 @@ GTCGuided Training of CTC是在PP-OCRv3中使用过的策略融合
经过以上优化最终PP-OCRv4在速度可比情况下中文场景端到端Hmean指标相比于PP-OCRv3提升4.5%,效果大幅提升。具体指标如下表所示:
| Model | Hmean | Model Size (M) | Time Cost (CPU, ms) |
|-----|-----|--------|----| --- |
|-----|-----|--------|----|
| PP-OCRv3 | 57.99% | 15.6 | 78 |
| PP-OCRv4 | 62.24% | 15.8 | 76 |
测试环境CPU型号为Intel Gold 6148CPU预测时使用openvino。
除了更新中文模型本次升级也优化了英文数字模型在自有评估集上文本识别准确率提升6%,如下表所示:
| Model | ACC |
|-----|-----|
| PP-OCR_en | 54.38% |
| PP-OCRv3_en | 64.04% |
| PP-OCRv4_en | 70.1% |
同时也对已支持的80余种语言识别模型进行了升级更新在有评估集的四种语系识别准确率平均提升5%以上,如下表所示:
| Model | 拉丁语系 | 阿拉伯语系 | 日语 | 韩语 |
|-----|-----|--------|----| --- |
| PP-OCR_mul | 69.60% | 40.50% | 38.50% | 55.40% |
| PP-OCRv3_mul | 75.20%| 45.37% | 45.80% | 60.10% |
| PP-OCRv4_mul | 80.00%| 75.48% | 56.50% | 83.25% |