Jake Poznanski
|
84e68f313e
|
Basic forward generation pass with openai dataset and qwen2vl
|
2024-09-19 22:16:59 +00:00 |
|
Jake Poznanski
|
7d2c447dd3
|
Importing core training config stuff from dolma refine
|
2024-09-19 21:55:07 +00:00 |
|
Jake Poznanski
|
bab32aa9b3
|
Formatting
|
2024-09-18 22:52:42 +00:00 |
|
Jake Poznanski
|
f4d18cb287
|
Dataloader capabable of loading 38k rows reasonably fast
|
2024-09-18 22:48:38 +00:00 |
|
Jake Poznanski
|
d22b311340
|
Starting to write dataloader for visual lm data
|
2024-09-18 21:42:09 +00:00 |
|
Jake Poznanski
|
fb4fc4229e
|
Fixing close file warning
|
2024-09-17 20:31:32 +00:00 |
|
Jake Poznanski
|
af2126df99
|
450tok/sec/core with smollm that appears to work well
|
2024-09-17 19:59:02 +00:00 |
|
Jake Poznanski
|
2f71cb9232
|
Using SmolLM, seems a lot better and is able to pass some tests
|
2024-09-17 18:47:27 +00:00 |
|
Jake Poznanski
|
57e80aacd2
|
Testing coherence with distilgpt2, but it doesn't work great
|
2024-09-17 16:58:45 +00:00 |
|
Jake Poznanski
|
cb9b6efb3c
|
Trying distilgpt2 instead of kenlm
|
2024-09-17 16:50:01 +00:00 |
|
Jake Poznanski
|
01bc0b2f10
|
Moving a whole bunch of code over, still broken
|
2024-09-17 16:26:55 +00:00 |
|
Jake Poznanski
|
a534a0180d
|
Moving pdf filter code over with tests
|
2024-09-17 15:16:58 +00:00 |
|
Jake Poznanski
|
9662718bfd
|
Running personalize script on template
|
2024-09-17 15:06:59 +00:00 |
|
Jake Poznanski
|
7d71e2d643
|
Update README.md
|
2024-09-17 07:58:39 -07:00 |
|
Jake Poznanski
|
68b2c0e8d6
|
Initial commit
|
2024-09-17 07:53:43 -07:00 |
|