Sebastian Raschka
|
6f86c78763
|
Implement Llama 3.2 (#383)
|
2024-10-05 07:30:47 -05:00 |
|
Sebastian Raschka
|
d313f61c86
|
Cos-sin fix in Llama 2 bonus notebook (#381)
|
2024-10-03 20:45:40 -05:00 |
|
Sebastian Raschka
|
feb0647c79
|
Improve rope settings for llama3 (#380)
|
2024-10-03 08:29:54 -05:00 |
|
rasbt
|
2ae4ad15ba
|
add section numbers
|
2024-09-30 08:42:22 -05:00 |
|
Sebastian Raschka
|
b8497c1bf5
|
Add llama2 unit tests (#372)
* add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update
|
2024-09-25 19:40:36 -05:00 |
|
rasbt
|
a23fca84d5
|
improve formatting
|
2024-09-24 18:49:17 -05:00 |
|
Daniel Kleine
|
4541177063
|
ch05/07 gpt_to_llama text improvements (#369)
* fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token
|
2024-09-24 18:45:49 -05:00 |
|
rasbt
|
941629d2c7
|
add json import
|
2024-09-23 09:12:35 -05:00 |
|
rasbt
|
835832a0f9
|
move access token to config.json
|
2024-09-23 08:56:16 -05:00 |
|
rasbt
|
5e6c7230ac
|
add llama3 comparison
|
2024-09-23 08:17:10 -05:00 |
|
Sebastian Raschka
|
c38b003aa9
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|