Sebastian Raschka
|
19c065b342
|
Interleaved Q and K for RoPE in Llama 2 (#750)
|
2025-07-23 08:02:02 -05:00 |
|
Sebastian Raschka
|
a5ea296259
|
Use more recent sentencepiece tokenizer API (#696)
|
2025-06-22 13:52:30 -05:00 |
|
Sebastian Raschka
|
c43d7ef663
|
reformat nbs (#602)
|
2025-04-05 16:18:27 -05:00 |
|
Sebastian Raschka
|
7114ccd10d
|
Add PyPI package (#576)
* Add PyPI package
* fixes
* fixes
|
2025-03-23 19:28:49 -05:00 |
|
casinca
|
57fdd94358
|
[minor] typo & comments (#441)
* typo & comment
- safe -> save
- commenting code: batch_size, seq_len = in_idx.shape
* comment
- adding # NEW for assert num_heads % num_kv_groups == 0
* update memory wording
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-11-18 19:52:42 +09:00 |
|
Daniel Kleine
|
7e6f8ce020
|
updated RoPE statement (#423)
* updated RoPE statement
* updated .gitignore
* Update ch05/07_gpt_to_llama/converting-gpt-to-llama2.ipynb
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-30 08:00:08 -05:00 |
|
ROHAN WINSOR
|
e85d154522
|
Fix argument name in LlamaTokenizer constructor (#421)
This PR addresses an oversight in the LlamaTokenizer class by changing the constructor argument from filepath to tokenizer_file.
|
2024-10-29 18:01:36 -05:00 |
|
Daniel Kleine
|
8b60460319
|
Updated Llama 2 to 3 paths (#413)
* llama 2 and 3 path fixes
* updated llama 3, 3.1 and 3.2 paths
* updated .gitignore
* Typo fix
---------
Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
|
2024-10-24 07:40:08 -05:00 |
|
Sebastian Raschka
|
f8bdfe12e1
|
RoPE updates (#412)
* RoPE updates
* Apply suggestions from code review
* updates
* updates
* updates
|
2024-10-23 18:07:49 -05:00 |
|
Sebastian Raschka
|
6f86c78763
|
Implement Llama 3.2 (#383)
|
2024-10-05 07:30:47 -05:00 |
|
Sebastian Raschka
|
d313f61c86
|
Cos-sin fix in Llama 2 bonus notebook (#381)
|
2024-10-03 20:45:40 -05:00 |
|
Sebastian Raschka
|
feb0647c79
|
Improve rope settings for llama3 (#380)
|
2024-10-03 08:29:54 -05:00 |
|
rasbt
|
2ae4ad15ba
|
add section numbers
|
2024-09-30 08:42:22 -05:00 |
|
Sebastian Raschka
|
b8497c1bf5
|
Add llama2 unit tests (#372)
* add llama2 unit tests
* update
* updates
* updates
* update file path
* update requirements file
* rmsnorm test
* update
|
2024-09-25 19:40:36 -05:00 |
|
rasbt
|
a23fca84d5
|
improve formatting
|
2024-09-24 18:49:17 -05:00 |
|
Daniel Kleine
|
4541177063
|
ch05/07 gpt_to_llama text improvements (#369)
* fixed typo
* fixed RMSnorm formula
* fixed SwiGLU formula
* temperature=0 for untrained model for reproducibility
* added extra info hf token
|
2024-09-24 18:45:49 -05:00 |
|
rasbt
|
941629d2c7
|
add json import
|
2024-09-23 09:12:35 -05:00 |
|
rasbt
|
835832a0f9
|
move access token to config.json
|
2024-09-23 08:56:16 -05:00 |
|
rasbt
|
5e6c7230ac
|
add llama3 comparison
|
2024-09-23 08:17:10 -05:00 |
|
Sebastian Raschka
|
c38b003aa9
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|