Readme updates

This commit is contained in:
Jake Poznanski 2025-08-13 22:03:03 +00:00
parent 36ca700669
commit 9f8df232b6
2 changed files with 29 additions and 4 deletions

View File

@ -35,6 +35,7 @@ Features:
- (Based on a 7B parameter VLM, so it requires a GPU) - (Based on a 7B parameter VLM, so it requires a GPU)
### News ### News
- August 13, 2024 - v0.3.0 - [New model release](https://huggingface.co/allenai/olmOCR-7B-0825-FP8), fixes auto-rotation detection, and hallucinations on blank documents.
- July 24, 2025 - v0.2.1 - [New model release](https://huggingface.co/allenai/olmOCR-7B-0725-FP8), scores 3 points higher on [olmOCR-Bench](https://github.com/allenai/olmocr/tree/main/olmocr/bench), also runs significantly faster because it's default FP8, and needs much fewer retries per document. - July 24, 2025 - v0.2.1 - [New model release](https://huggingface.co/allenai/olmOCR-7B-0725-FP8), scores 3 points higher on [olmOCR-Bench](https://github.com/allenai/olmocr/tree/main/olmocr/bench), also runs significantly faster because it's default FP8, and needs much fewer retries per document.
- July 23, 2025 - v0.2.0 - New cleaned up [trainer code](https://github.com/allenai/olmocr/tree/main/olmocr/train), makes it much simpler to train olmOCR models yourself. - July 23, 2025 - v0.2.0 - New cleaned up [trainer code](https://github.com/allenai/olmocr/tree/main/olmocr/train), makes it much simpler to train olmOCR models yourself.
- June 17, 2025 - v0.1.75 - Switch from sglang to vllm based inference pipeline, updated docker image to CUDA 12.8. - June 17, 2025 - v0.1.75 - Switch from sglang to vllm based inference pipeline, updated docker image to CUDA 12.8.
@ -97,7 +98,7 @@ We also ship a comprehensive benchmark suite covering over 7,000 test cases acro
<td align="center">93.6</td> <td align="center">93.6</td>
<td align="center">71.3</td> <td align="center">71.3</td>
<td align="center">77.1</td> <td align="center">77.1</td>
<td align="center"><strong>99.4</strong></td> <td align="center">99.4</td>
<td align="center">72.0 ± 1.1</td> <td align="center">72.0 ± 1.1</td>
</tr> </tr>
<tr> <tr>
@ -115,7 +116,7 @@ We also ship a comprehensive benchmark suite covering over 7,000 test cases acro
<tr> <tr>
<td align="left">olmOCR v0.2.0</td> <td align="left">olmOCR v0.2.0</td>
<td align="center"><strong>78.8</strong></td> <td align="center"><strong>78.8</strong></td>
<td align="center"><strong>77.5</strong></td> <td align="center">77.5</td>
<td align="center">71.9</td> <td align="center">71.9</td>
<td align="center"><strong>45.4</strong></td> <td align="center"><strong>45.4</strong></td>
<td align="center">94.2</td> <td align="center">94.2</td>
@ -124,6 +125,18 @@ We also ship a comprehensive benchmark suite covering over 7,000 test cases acro
<td align="center"><strong>99.8</strong></td> <td align="center"><strong>99.8</strong></td>
<td align="center"><strong>78.5 ± 1.1</strong></td> <td align="center"><strong>78.5 ± 1.1</strong></td>
</tr> </tr>
<tr>
<td align="left">olmOCR v0.3.0</td>
<td align="center">78.6</td>
<td align="center"><strong>79.9</strong></td>
<td align="center">72.9</td>
<td align="center">43.9</td>
<td align="center">95.1</td>
<td align="center">77.3</td>
<td align="center">81.2</td>
<td align="center">98.9</td>
<td align="center">78.5 ± 1.1</td>
</tr>
</tbody> </tbody>
</table> </table>

View File

@ -81,7 +81,7 @@ to run it against your own OCR tools. Your tool just needs to support Markdown o
<td align="center">93.6</td> <td align="center">93.6</td>
<td align="center">71.3</td> <td align="center">71.3</td>
<td align="center">77.1</td> <td align="center">77.1</td>
<td align="center"><strong>99.4</strong></td> <td align="center">99.4</td>
<td align="center">72.0 ± 1.1</td> <td align="center">72.0 ± 1.1</td>
</tr> </tr>
<tr> <tr>
@ -195,7 +195,7 @@ to run it against your own OCR tools. Your tool just needs to support Markdown o
<tr> <tr>
<td align="left">olmOCR v0.2.0</td> <td align="left">olmOCR v0.2.0</td>
<td align="center"><strong>78.8</strong></td> <td align="center"><strong>78.8</strong></td>
<td align="center"><strong>77.5</strong></td> <td align="center">77.5</td>
<td align="center">71.9</td> <td align="center">71.9</td>
<td align="center"><strong>45.4</strong></td> <td align="center"><strong>45.4</strong></td>
<td align="center">94.2</td> <td align="center">94.2</td>
@ -204,6 +204,18 @@ to run it against your own OCR tools. Your tool just needs to support Markdown o
<td align="center"><strong>99.8</strong></td> <td align="center"><strong>99.8</strong></td>
<td align="center"><strong>78.5 ± 1.1</strong></td> <td align="center"><strong>78.5 ± 1.1</strong></td>
</tr> </tr>
<tr>
<td align="left">olmOCR v0.3.0</td>
<td align="center">78.6</td>
<td align="center"><strong>79.9</strong></td>
<td align="center">72.9</td>
<td align="center">43.9</td>
<td align="center">95.1</td>
<td align="center">77.3</td>
<td align="center">81.2</td>
<td align="center">98.9</td>
<td align="center">78.5 ± 1.1</td>
</tr>
</tbody> </tbody>
</table> </table>