aman-17
|
a036133fdd
|
resolved all the mypy, black and isort issues and updated readme
|
2025-02-07 16:05:00 -08:00 |
|
Jake Poznanski
|
9bf3d35cdb
|
Comment fix
|
2025-01-30 16:02:08 -08:00 |
|
Jake Poznanski
|
2ab7cb280c
|
Removing pymupdf
|
2025-01-30 15:51:54 -08:00 |
|
Jake Poznanski
|
72f4b9a590
|
Project setup
|
2025-01-30 15:33:04 -08:00 |
|
Jake Poznanski
|
cdd830235f
|
Shortened some sample docs
|
2025-01-30 15:28:31 -08:00 |
|
Jake Poznanski
|
10094ffc19
|
Even newer mypy crashes still
|
2025-01-30 14:32:08 -08:00 |
|
Jake Poznanski
|
fb402297ce
|
Isort and black update
|
2025-01-29 15:42:34 -08:00 |
|
Jake Poznanski
|
dcaca8aa90
|
Black formatting
|
2025-01-29 15:30:39 -08:00 |
|
Jake Poznanski
|
4a1762d455
|
isort
|
2025-01-29 15:25:10 -08:00 |
|
Jake Poznanski
|
0628d3161f
|
Some unit test cleanup
|
2025-01-29 15:15:10 -08:00 |
|
Jake Poznanski
|
b28aad61bb
|
More test docs
|
2025-01-27 21:11:23 +00:00 |
|
Jake Poznanski
|
96ae2dd49b
|
Refactoring
|
2025-01-27 20:45:28 +00:00 |
|
Jake Poznanski
|
c6062677aa
|
Cleaning up some unused code
|
2025-01-27 18:48:15 +00:00 |
|
Jake Poznanski
|
b2894d0280
|
Massive refactor from pdelfin to olmocr
|
2025-01-27 18:30:41 +00:00 |
|
Jake Poznanski
|
01469af463
|
Doing some debugging
|
2025-01-23 10:58:43 -08:00 |
|
Jake Poznanski
|
72d2fa2fd4
|
Reviewing molmo training
|
2025-01-22 15:23:08 -08:00 |
|
Jake Poznanski
|
0d1fc08081
|
Small fixes
|
2025-01-10 19:38:42 +00:00 |
|
Jake Poznanski
|
5692a76350
|
Ok, direct easy test for diffs now
|
2024-12-04 13:27:51 -08:00 |
|
Jake Poznanski
|
48f3ab82bd
|
Working on some random tests
|
2024-12-04 13:20:10 -08:00 |
|
Jake Poznanski
|
917cdeccba
|
Some more tests
|
2024-12-03 15:32:53 -08:00 |
|
Jake Poznanski
|
9b9d04c8e9
|
aaa
|
2024-11-26 08:38:25 -08:00 |
|
Jake Poznanski
|
386374bd72
|
More prints
|
2024-11-25 16:08:24 -08:00 |
|
Jake Poznanski
|
04d6123037
|
Doing some experiments
|
2024-11-25 15:36:04 -08:00 |
|
Jake Poznanski
|
51614efc83
|
More log probs investigation
|
2024-11-25 11:24:21 -08:00 |
|
Jake Poznanski
|
28d52602e9
|
More test code
|
2024-11-25 11:00:03 -08:00 |
|
Jake Poznanski
|
606e81bfea
|
Not happy here with this test
|
2024-11-25 10:32:18 -08:00 |
|
Jake Poznanski
|
d7838372e8
|
Full test
|
2024-11-25 10:25:55 -08:00 |
|
Jake Poznanski
|
2e4f7d7827
|
Working on HF test for comparison
|
2024-11-25 10:12:29 -08:00 |
|
Jake Poznanski
|
5e3080db28
|
Sglang based unit test
|
2024-11-25 09:48:05 -08:00 |
|
Jake Poznanski
|
60f24ad2d6
|
tests
|
2024-11-25 09:39:55 -08:00 |
|
Jake Poznanski
|
5289092076
|
Startingon sglang test
|
2024-11-25 09:34:59 -08:00 |
|
Jake Poznanski
|
ba8eba245b
|
Unit tests fixes
|
2024-11-25 09:13:13 -08:00 |
|
Jake Poznanski
|
c9e1a4c540
|
More tests
|
2024-11-20 19:37:00 +00:00 |
|
Jake Poznanski
|
8793fc7d99
|
Adding more retries, and it was able to process more complicated books
|
2024-11-18 14:25:32 -08:00 |
|
Jake Poznanski
|
e499413089
|
Better work queue
|
2024-11-18 11:04:51 -08:00 |
|
Jake Poznanski
|
04429b2862
|
Basic work queue from claude
|
2024-11-18 10:07:03 -08:00 |
|
Jake Poznanski
|
fcabb8e55a
|
Handling more error cases
|
2024-11-18 09:12:04 -08:00 |
|
Jake Poznanski
|
96984fcd77
|
Fix a reliability issue
|
2024-11-18 09:03:24 -08:00 |
|
Jake Poznanski
|
6a4a55f9e0
|
Hopefully working molmo HF trainer config
|
2024-10-30 14:00:27 -07:00 |
|
Jake Poznanski
|
bede854cd5
|
Startng to write molmo formatters
|
2024-10-30 13:24:11 -07:00 |
|
Jake Poznanski
|
85e0e2a61b
|
Fixing issues with pdf parsing
|
2024-10-30 16:26:02 +00:00 |
|
Jake Poznanski
|
08d51b7183
|
Adding some rotation retry contrl
|
2024-10-28 20:16:06 +00:00 |
|
Jake Poznanski
|
ffe470bf0e
|
Fix
|
2024-10-23 22:55:50 +00:00 |
|
Jake Poznanski
|
180dde03c5
|
dataprep sampling tests
|
2024-10-23 22:53:05 +00:00 |
|
Jake Poznanski
|
999f64dd46
|
Adding empty anchor support
|
2024-10-23 22:17:20 +00:00 |
|
Jake Poznanski
|
a1a4798ce7
|
Some crazy idea I had to simplify futures and memory limits
|
2024-10-23 21:51:37 +00:00 |
|
Jake Poznanski
|
302eee3da5
|
Yay matches between birr and hf
|
2024-10-21 16:58:30 +00:00 |
|
Jake Poznanski
|
9d35d3ca8f
|
Birr tokenization test
|
2024-10-18 23:02:37 +00:00 |
|
Jake Poznanski
|
7dbcbc154b
|
Birr tests that don't do anything but help me understand the universe
|
2024-10-18 22:39:17 +00:00 |
|
Jake Poznanski
|
dd4f9670b5
|
Filter refactor
|
2024-10-17 22:36:38 +00:00 |
|