LLMs-from-scratch/ch05/02_alternative_weight_loading
Sebastian Raschka 7bd263144e
Switch from urllib to requests to improve reliability (#867)
* Switch from urllib to requests to improve reliability

* Keep ruff linter-specific

* update

* update

* update
2025-10-07 15:22:59 -05:00
..

Alternative Approaches to Loading Pretrained Weights

This folder contains alternative weight loading strategies in case the weights become unavailable from OpenAI.

  • weight-loading-pytorch.ipynb: (Recommended) contains code to load the weights from PyTorch state dicts that I created by converting the original TensorFlow weights

  • weight-loading-hf-transformers.ipynb: contains code to load the weights from the Hugging Face Model Hub via the transformers library

  • weight-loading-hf-safetensors.ipynb: contains code to load the weights from the Hugging Face Model Hub via the safetensors library directly (skipping the instantiation of a Hugging Face transformer model)