You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+35-30Lines changed: 35 additions & 30 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -35,6 +35,7 @@ Features:
35
35
- (Based on a 7B parameter VLM, so it requires a GPU)
36
36
37
37
### News
38
+
- June 17, 2025 - v0.1.75 - Switch from sglang to vllm based inference pipeline, updated docker image to CUDA 12.8.
38
39
- May 23, 2025 - v0.1.70 - Official docker support and images are now available! [See Docker usage](#using-docker)
39
40
- May 19, 2025 - v0.1.68 - [olmOCR-Bench](https://github.com/allenai/olmocr/tree/main/olmocr/bench) launch, scoring 77.4. Launch includes 2 point performance boost in olmOCR pipeline due to bug fixes with prompts.
40
41
- Mar 17, 2025 - v0.1.60 - Performance improvements due to better temperature selection in sampling.
@@ -49,29 +50,29 @@ We also ship a comprehensive benchmark suite covering over 7,000 test cases acro
49
50
<thead>
50
51
<tr>
51
52
<th align="left"><strong>Model</strong></th>
52
-
<th align="center">AR</th>
53
-
<th align="center">OSM</th>
54
-
<th align="center">TA</th>
55
-
<th align="center">OS</th>
56
-
<th align="center">HF</th>
57
-
<th align="center">MC</th>
58
-
<th align="center">LTT</th>
53
+
<th align="center">ArXiv</th>
54
+
<th align="center">Old Scans Math</th>
55
+
<th align="center">Tables</th>
56
+
<th align="center">Old Scans</th>
57
+
<th align="center">Headers and Footers</th>
58
+
<th align="center">Multi column</th>
59
+
<th align="center">Long tiny text</th>
59
60
<th align="center">Base</th>
60
-
<th align="center">Overall Score</th>
61
+
<th align="center">Overall</th>
61
62
</tr>
62
63
</thead>
63
64
<tbody>
64
65
<tr>
65
-
<td align="left">Marker v1.6.2</td>
66
-
<td align="center">24.3</td>
67
-
<td align="center">22.1</td>
68
-
<td align="center">69.8</td>
69
-
<td align="center">24.3</td>
70
-
<td align="center">87.1</td>
71
-
<td align="center">71.0</td>
72
-
<td align="center">76.9</td>
73
-
<td align="center"><strong>99.5</strong></td>
74
-
<td align="center">59.4 ± 1.1</td>
66
+
<td align="left">Marker v1.7.5 (base)</td>
67
+
<td align="center">76.0</td>
68
+
<td align="center">57.9</td>
69
+
<td align="center">57.6</td>
70
+
<td align="center">27.8</td>
71
+
<td align="center">84.9</td>
72
+
<td align="center">72.9</td>
73
+
<td align="center">84.6</td>
74
+
<td align="center">99.1</td>
75
+
<td align="center">70.1 ± 1.1</td>
75
76
</tr>
76
77
<tr>
77
78
<td align="left">MinerU v1.3.10</td>
@@ -94,24 +95,25 @@ We also ship a comprehensive benchmark suite covering over 7,000 test cases acro
<sup><sub>There was a small drop in scores from olmOCR v0.1.68 (77.4), which is due to two factors. One, is that we have adjusted our benchmark code to not include
200
+
any "fallback" mechanism when measuring benchmark scores (though it still exists when you run olmocr.pipeline). Second, there is a small drop in scores as we have updated
201
+
from sglang 0.4.2 to vllm 0.9.1. In net, we think the upgrade to vllm is the right choice, given that sglang 0.4.6 had even lower scores by one point, and vllm comes with a
202
+
small performance boost, and great support for quantization.
203
+
</sub></sup>
204
+
184
205
## Sourcing Documents and Tests
185
206
186
207
We define 7 distinct document types that we found olmOCR (or its earlier iterations) often struggled to process and defined custom acquisition strategies for each (described below). We removed documents that both contained PII and were not meant for public dissemination. We also decontaminate against documents that appear in olmOCR-Mix via URL level deduplication. To scale creation of test cases over these documents, we combined manual design and review with prompting GPT-4o.
@@ -288,6 +309,3 @@ We have an internal data annotation tool that can be used to review the question
0 commit comments