olmocr/pdelfin/train/train.py
Jake Poznanski bab32aa9b3 Formatting
2024-09-18 22:52:42 +00:00

12 lines
450 B
Python

# Step 1, load the data
# Probably, we want to see just a folder with openai batch input jsonls, plus the batch output jsonls
# TODO: Figure out hyperparameters for image sizing
# Step 2. Load those prompts through and do a forward pass to calculate the loss
# Step 3. Add hugging face accelerate for training
# Step 4. Checkpointing code, both saving and reloading to restart
# Step 5. Move over from interactive session to gantry launch script