Jake Poznanski
|
c2909f314e
|
run pipeline
|
2024-10-09 19:55:45 +00:00 |
|
Jake Poznanski
|
954b19a5d4
|
Stuff
|
2024-10-09 19:55:04 +00:00 |
|
Jake Poznanski
|
991b213cf5
|
Refactoring, startng to write run_pipeline
|
2024-10-09 18:48:31 +00:00 |
|
Jake Poznanski
|
4bf6e7a430
|
Refactoring
|
2024-10-09 18:11:18 +00:00 |
|
Jake Poznanski
|
0c56dec704
|
Adding diff to tinyhost
|
2024-10-09 17:53:26 +00:00 |
|
Jake Poznanski
|
400e92180b
|
Unifying some of the pdf rendering stuff
|
2024-10-09 16:57:13 +00:00 |
|
Jake Poznanski
|
dc6440d068
|
Cleaning up anchor text to deal with abnormally long lines
|
2024-10-09 16:29:20 +00:00 |
|
Jake Poznanski
|
b6b74b7832
|
Rewriting prompts to eval with new model
|
2024-10-09 16:04:39 +00:00 |
|
Jake Poznanski
|
7c19a9a856
|
fix
|
2024-10-08 23:54:17 +00:00 |
|
Jake Poznanski
|
ad10add6c1
|
try lower lr
|
2024-10-08 23:52:56 +00:00 |
|
Jake Poznanski
|
230c8a9f9a
|
Trying new run that will rewrite the prompts as it goes
|
2024-10-08 22:10:18 +00:00 |
|
Jake Poznanski
|
97291b3f6a
|
Anchor is fixed to sample text elements better
|
2024-10-08 21:51:43 +00:00 |
|
Jake Poznanski
|
c8a4d14c57
|
Adding image merging to pdf report/hint/anchor
|
2024-10-08 21:23:21 +00:00 |
|
Jake Poznanski
|
57d9a21eeb
|
Adding prompt length histogram to a script
|
2024-10-08 18:22:56 +00:00 |
|
Jake Poznanski
|
adc702c918
|
FIxing wandb key
|
2024-10-08 18:16:39 +00:00 |
|
Jake Poznanski
|
085937859f
|
Lower lr
|
2024-10-08 17:52:00 +00:00 |
|
Jake Poznanski
|
4b30dd867b
|
Fixing eval script, working FSDP config
|
2024-10-08 16:56:07 +00:00 |
|
Jake Poznanski
|
f5fd9ff53a
|
Trying grad checkpoint
|
2024-10-08 16:11:31 +00:00 |
|
Jake Poznanski
|
4fb7e9b184
|
Updated eval script
|
2024-10-08 16:09:25 +00:00 |
|
Jake Poznanski
|
fb4e585e9f
|
Trying out non-lora training
|
2024-10-08 15:20:37 +00:00 |
|
Jake Poznanski
|
ec09408ca9
|
Filtering based on cpu count
|
2024-10-07 15:40:29 -07:00 |
|
Jake Poznanski
|
a90eb94951
|
Fix dataloader bug
|
2024-10-07 15:25:48 -07:00 |
|
Jake Poznanski
|
3d36545fa5
|
loading fix for parquets again...
|
2024-10-07 14:48:53 -07:00 |
|
Jake Poznanski
|
fdcd77eadd
|
typo
|
2024-10-07 14:32:47 -07:00 |
|
Jake Poznanski
|
7416b42023
|
Adding support for parquet datasets which are precached
|
2024-10-07 21:14:33 +00:00 |
|
Jake Poznanski
|
dc26541da2
|
Starting code to build parquets...
|
2024-10-07 20:59:43 +00:00 |
|
Jake Poznanski
|
4557a5b296
|
Typo
|
2024-10-07 13:03:31 -07:00 |
|
Jake Poznanski
|
e973de7ba9
|
Typo
|
2024-10-07 13:01:43 -07:00 |
|
Jake Poznanski
|
ebd40f9084
|
Hopefully fixing dataloader for now
|
2024-10-07 12:59:27 -07:00 |
|
Jake Poznanski
|
5d35461dd2
|
Fix for unicode errors in big datasets for the future
|
2024-10-07 17:01:59 +00:00 |
|
Jake Poznanski
|
44bcdc771b
|
Hopefully can use weka for the train datasets now
|
2024-10-07 16:14:28 +00:00 |
|
Jake Poznanski
|
d8e459c9f3
|
Weird issue with surrogate pairs in json
|
2024-10-07 09:04:13 -07:00 |
|
Jake Poznanski
|
98020cabbb
|
Allow loading files locally
|
2024-10-07 07:49:16 -07:00 |
|
Jake Poznanski
|
13123ddea4
|
Pinning datasets to work around weird issue
|
2024-10-06 03:56:27 +00:00 |
|
Jake Poznanski
|
568dd48509
|
Prepping for qwen2vl full training run
|
2024-10-05 04:04:45 +00:00 |
|
Jake Poznanski
|
6065da268b
|
Hopefully working better
|
2024-10-04 18:06:04 +00:00 |
|
Jake Poznanski
|
a2ff849a78
|
checkpoint on new runner for openai batches
|
2024-10-04 17:32:35 +00:00 |
|
Jake Poznanski
|
2da901d433
|
new better runopenaibatch script
|
2024-10-04 16:58:38 +00:00 |
|
Jake Poznanski
|
35ec67c427
|
Hopefully finishing touches
|
2024-10-04 16:10:19 +00:00 |
|
Jake Poznanski
|
db36608b42
|
Fix
|
2024-10-04 16:05:08 +00:00 |
|
Jake Poznanski
|
f25cb6c261
|
Fixes
|
2024-10-04 15:54:00 +00:00 |
|
Jake Poznanski
|
4630f7b1cb
|
Bugfixes
|
2024-10-04 15:35:52 +00:00 |
|
Jake Poznanski
|
e87729a653
|
New send silver script for testing
|
2024-10-04 15:27:43 +00:00 |
|
Jake Poznanski
|
6e1094ee8a
|
Support for more evals and output formats
|
2024-10-03 20:19:52 +00:00 |
|
Jake Poznanski
|
974ddd3773
|
I'm pretty sure we only need to save on rank0
|
2024-10-03 11:30:44 -07:00 |
|
Jake Poznanski
|
8f1fa4f796
|
Running a mini config again with metric
|
2024-10-03 11:12:30 -07:00 |
|
Jake Poznanski
|
046d4a4534
|
Adding eval on start and seed params
|
2024-10-03 10:54:25 -07:00 |
|
Jake Poznanski
|
2227605bfb
|
Mini train config
|
2024-10-03 10:32:15 -07:00 |
|
Jake Poznanski
|
4505a49420
|
Pinning to normal transformers version now
|
2024-10-03 09:00:53 -07:00 |
|
Jake Poznanski
|
78e3a94173
|
Adding pluto ib
|
2024-10-03 15:33:17 +00:00 |
|