14 Commits

Author SHA1 Message Date
Sebastian Raschka
e719bd86ad
Qwen3 From Scratch (#678)
* Qwen3 From Scratch

* rev other file

* upd

* upd

* upd

* url fixes
2025-06-19 18:44:38 -05:00
Sebastian Raschka
feb1e9a83d
Add readme (#577) 2025-03-23 19:35:12 -05:00
Sebastian Raschka
908dd2f71e
PyTorch tips for better training performance (#525)
* PyTorch tips for better training performance

* formatting

* pep 8
2025-02-12 16:10:34 -06:00
Sebastian Raschka
05b04f2a5a
Memory efficient weight loading (#401)
* memory efficient weight loading

* remove unused code
2024-10-14 10:30:25 -05:00
Sebastian Raschka
b6c4b2f9f1
Update bonus section formatting (#400) 2024-10-12 10:26:08 -05:00
Sebastian Raschka
8a448a4410
Llama 3 (#384)
* Implement Llama 3.2

* Add Llama 3.2 files

* exclude IMDB link because stanford website seems down
2024-10-05 07:52:15 -05:00
Sebastian Raschka
0467c8289b
GPT to Llama (#368)
* GPT to Llama

* fix urls
2024-09-23 07:34:06 -05:00
Sebastian Raschka
ea9b4e83a4
Add chatpgpt-like user interface (#360)
* Add chatpgpt-like user interface

* fixes
2024-09-17 08:26:44 -05:00
rasbt
e24fd98cdf
distinguish better between main chapter code and bonus materials 2024-06-11 21:07:42 -05:00
rasbt
cd7ea15e8d
add readme 2024-05-13 08:50:55 -05:00
Sebastian Raschka
e757091301
Organized setup instructions (#115)
* Organized setup instructions

* update tets

* link checker action

* raise error upon broken link

* fix links

* fix links

* delete duplicated paragraph
2024-04-10 22:09:46 -04:00
Sebastian Raschka
cf39abac04 Add and link bonus material (#84) 2024-03-23 07:27:43 -05:00
Sebastian Raschka
4582995ced Add alternative weight loading strategy as backup (#82) 2024-03-20 08:43:18 -05:00
rasbt
87a743076d hparam tuning script 2024-02-27 08:51:03 -06:00