LLMs-from-scratch/ch05/02_alternative_weight_loading
Sebastian Raschka e55e3e88e1 Alt weight loading code via PyTorch (#585)
* Alt weight loading code via PyTorch

* commit additional files
2025-03-27 20:10:23 -05:00
..

Alternative Approaches to Loading Pretrained Weights

This folder contains alternative weight loading strategies in case the weights become unavailable from OpenAI.

  • weight-loading-pytorch.ipynb: (Recommended) contains code to load the weights from PyTorch state dicts that I created by converting the original TensorFlow weights

  • weight-loading-hf-transformers.ipynb: contains code to load the weights from the Hugging Face Model Hub via the transformers library

  • weight-loading-hf-safetensors.ipynb: contains code to load the weights from the Hugging Face Model Hub via the safetensors library directly (skipping the instantiation of a Hugging Face transformer model)