Sebastian Raschka
|
534a704364
|
RoPE increase (#407)
|
2024-10-21 19:58:38 -05:00 |
|
Sebastian Raschka
|
ec18b6a8a3
|
Add Llama 3.2 RoPE to CI (#391)
* add Llama 3.2 RoPE to CI
* update
|
2024-10-08 08:28:34 -05:00 |
|
Sebastian Raschka
|
1eb0b3810a
|
Introduce buffers to improve Llama 3.2 efficiency (#389)
* Introduce buffers to improve Llama 3.2 efficiency
* update
* update
|
2024-10-06 12:49:04 -05:00 |
|
Daniel Kleine
|
a0c0c765a8
|
fixed Llama 2 to 3.2 NBs (#388)
* updated requirements
* fixes llama2 to llama3
* fixed llama 3.2 standalone
* fixed typo
* fixed rope formula
* Update requirements-extra.txt
* Update ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb
* Update ch05/07_gpt_to_llama/converting-llama2-to-llama3.ipynb
* Update ch05/07_gpt_to_llama/standalone-llama32.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-06 09:56:55 -05:00 |
|
Sebastian Raschka
|
0972ded530
|
Add a note about weight tying in Llama 3.2 (#386)
|
2024-10-05 09:20:54 -05:00 |
|
Sebastian Raschka
|
8553644440
|
Llama 3.2 requirements file
|
2024-10-05 07:32:43 -05:00 |
|
Sebastian Raschka
|
b44096acef
|
Implement Llama 3.2 (#383)
|
2024-10-05 07:30:47 -05:00 |
|
Sebastian Raschka
|
a5405c255d
|
Cos-sin fix in Llama 2 bonus notebook (#381)
|
2024-10-03 20:45:40 -05:00 |
|
Sebastian Raschka
|
b993c2b25b
|
Improve rope settings for llama3 (#380)
|
2024-10-03 08:29:54 -05:00 |
|
rasbt
|
278a50a348
|
add section numbers
|
2024-09-30 08:42:22 -05:00 |
|
Sebastian Raschka
|
b56d0b2942
|
Add llama2 unit tests (#372)
* add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update
|
2024-09-25 19:40:36 -05:00 |
|
rasbt
|
a6d8e93da3
|
improve formatting
|
2024-09-24 18:49:17 -05:00 |
|
Daniel Kleine
|
ff31b345b0
|
ch05/07 gpt_to_llama text improvements (#369)
* fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token
|
2024-09-24 18:45:49 -05:00 |
|
rasbt
|
d144bd5b7a
|
add json import
|
2024-09-23 09:12:35 -05:00 |
|
rasbt
|
6bc3de165c
|
move access token to config.json
|
2024-09-23 08:56:16 -05:00 |
|
rasbt
|
58df945ed4
|
add llama3 comparison
|
2024-09-23 08:17:10 -05:00 |
|
Sebastian Raschka
|
0467c8289b
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|