mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-09-24 23:55:31 +00:00

* updated .gitignore * removed unused GELU import * fixed model_configs, fixed all tensors on same device * removed unused tiktoken * update * update hparam search * remove redundant tokenizer argument --------- Co-authored-by: rasbt <mail@sebastianraschka.com>
Alternative Approaches to Loading Pretrained Weights
This folder contains alternative weight loading strategies in case the weights become unavailable from OpenAI.
- weight-loading-hf-transformers.ipynb: contains code to load the weights from the Hugging Face Model Hub via the
transformers
library