12 Commits

Author SHA1 Message Date
Jake Poznanski
c00e40d1c4 More fixes 2024-09-26 23:10:07 +00:00
Jake Poznanski
84e9da637c Removing lambda due to pickling errors 2024-09-26 21:39:08 +00:00
Jake Poznanski
9cbc128553 Sampling some sequence lengths 2024-09-25 09:05:11 -07:00
Jake Poznanski
ea0226c499 More flexibility in dataloader dims 2024-09-24 19:47:13 -07:00
Jake Poznanski
ea731055d7 More realistic configuration 2024-09-24 14:50:23 -07:00
Jake Poznanski
5a0bcb7b1d batch inference slowness 2024-09-24 09:13:47 -07:00
Jake Poznanski
28bcf72e11 Hoping to get a quick batch inference pipeline rolling 2024-09-24 08:56:36 -07:00
Jake Poznanski
3ed14a9ea5 Prepping new training stuff 2024-09-23 08:53:56 -07:00
Jake Poznanski
55035b02c9 Tries to run a forward pass but oOMS 2024-09-20 15:05:23 -07:00
Jake Poznanski
4eddb1b45f Okay, reasonably happy with the dataprep pipeline 2024-09-20 13:04:47 -07:00
Jake Poznanski
a47afe5c8d Adding test to make sure the traning and inference time tokenization stays identical, currenlty failing 2024-09-20 12:01:05 -07:00
Jake Poznanski
fcb67ebd61 Prepping data to be in a trainable format 2024-09-20 09:25:54 -07:00