3 Commits

Author SHA1 Message Date
Jake Poznanski
858b49656f Getting ready to train molmo 4096 context 2025-01-23 15:32:04 -08:00
Jake Poznanski
f42bb02fce Manually adding gradient checkpointing 2025-01-23 15:18:22 -08:00
Jake Poznanski
18569a4c63 Adding molmo code locally 2025-01-23 15:18:00 -08:00