Sebastian Raschka
|
f1434652f2
|
reformat nbs (#602)
|
2025-04-05 16:18:27 -05:00 |
|
Sebastian Raschka
|
c21bfe4a23
|
Add PyPI package (#576)
* Add PyPI package
* fixes
* fixes
|
2025-03-23 19:28:49 -05:00 |
|
casinca
|
bb31de8999
|
[minor] typo & comments (#441)
* typo & comment
- safe -> save
- commenting code: batch_size, seq_len = in_idx.shape
* comment
- adding # NEW for assert num_heads % num_kv_groups == 0
* update memory wording
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-11-18 19:52:42 +09:00 |
|
Daniel Kleine
|
81eed9afe2
|
updated RoPE statement (#423)
* updated RoPE statement
* updated .gitignore
* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-30 08:00:08 -05:00 |
|
ROHAN WINSOR
|
cd24a27161
|
Fix argument name in LlamaTokenizer constructor (#421)
This PR addresses an oversight in the LlamaTokenizer class by changing the constructor argument from filepath to tokenizer_file.
|
2024-10-29 18:01:36 -05:00 |
|
Daniel Kleine
|
d38083c401
|
Updated Llama 2 to 3 paths (#413)
* llama 2 and 3 path fixes
* updated llama 3, 3.1 and 3.2 paths
* updated .gitignore
* Typo fix
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-24 07:40:08 -05:00 |
|
Sebastian Raschka
|
7cd6a670ed
|
RoPE updates (#412)
* RoPE updates
* Apply suggestions from code review
* updates
* updates
* updates
|
2024-10-23 18:07:49 -05:00 |
|
Sebastian Raschka
|
b44096acef
|
Implement Llama 3.2 (#383)
|
2024-10-05 07:30:47 -05:00 |
|
Sebastian Raschka
|
a5405c255d
|
Cos-sin fix in Llama 2 bonus notebook (#381)
|
2024-10-03 20:45:40 -05:00 |
|
Sebastian Raschka
|
b993c2b25b
|
Improve rope settings for llama3 (#380)
|
2024-10-03 08:29:54 -05:00 |
|
rasbt
|
278a50a348
|
add section numbers
|
2024-09-30 08:42:22 -05:00 |
|
Sebastian Raschka
|
b56d0b2942
|
Add llama2 unit tests (#372)
* add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update
|
2024-09-25 19:40:36 -05:00 |
|
rasbt
|
a6d8e93da3
|
improve formatting
|
2024-09-24 18:49:17 -05:00 |
|
Daniel Kleine
|
ff31b345b0
|
ch05/07 gpt_to_llama text improvements (#369)
* fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token
|
2024-09-24 18:45:49 -05:00 |
|
rasbt
|
d144bd5b7a
|
add json import
|
2024-09-23 09:12:35 -05:00 |
|
rasbt
|
6bc3de165c
|
move access token to config.json
|
2024-09-23 08:56:16 -05:00 |
|
rasbt
|
58df945ed4
|
add llama3 comparison
|
2024-09-23 08:17:10 -05:00 |
|
Sebastian Raschka
|
0467c8289b
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|