Sebastian Raschka
|
e719bd86ad
|
Qwen3 From Scratch (#678)
* Qwen3 From Scratch
* rev other file
* upd
* upd
* upd
* url fixes
|
2025-06-19 18:44:38 -05:00 |
|
Sebastian Raschka
|
feb1e9a83d
|
Add readme (#577)
|
2025-03-23 19:35:12 -05:00 |
|
Sebastian Raschka
|
908dd2f71e
|
PyTorch tips for better training performance (#525)
* PyTorch tips for better training performance
* formatting
* pep 8
|
2025-02-12 16:10:34 -06:00 |
|
Sebastian Raschka
|
05b04f2a5a
|
Memory efficient weight loading (#401)
* memory efficient weight loading
* remove unused code
|
2024-10-14 10:30:25 -05:00 |
|
Sebastian Raschka
|
b6c4b2f9f1
|
Update bonus section formatting (#400)
|
2024-10-12 10:26:08 -05:00 |
|
Sebastian Raschka
|
8a448a4410
|
Llama 3 (#384)
* Implement Llama 3.2
* Add Llama 3.2 files
* exclude IMDB link because stanford website seems down
|
2024-10-05 07:52:15 -05:00 |
|
Sebastian Raschka
|
0467c8289b
|
GPT to Llama (#368)
* GPT to Llama
* fix urls
|
2024-09-23 07:34:06 -05:00 |
|
Sebastian Raschka
|
ea9b4e83a4
|
Add chatpgpt-like user interface (#360)
* Add chatpgpt-like user interface
* fixes
|
2024-09-17 08:26:44 -05:00 |
|
rasbt
|
e24fd98cdf
|
distinguish better between main chapter code and bonus materials
|
2024-06-11 21:07:42 -05:00 |
|
rasbt
|
cd7ea15e8d
|
add readme
|
2024-05-13 08:50:55 -05:00 |
|
Sebastian Raschka
|
e757091301
|
Organized setup instructions (#115)
* Organized setup instructions
* update tets
* link checker action
* raise error upon broken link
* fix links
* fix links
* delete duplicated paragraph
|
2024-04-10 22:09:46 -04:00 |
|
Sebastian Raschka
|
cf39abac04
|
Add and link bonus material (#84)
|
2024-03-23 07:27:43 -05:00 |
|
Sebastian Raschka
|
4582995ced
|
Add alternative weight loading strategy as backup (#82)
|
2024-03-20 08:43:18 -05:00 |
|
rasbt
|
87a743076d
|
hparam tuning script
|
2024-02-27 08:51:03 -06:00 |
|