7 Commits

Author SHA1 Message Date
Sebastian Raschka
e9c1c1da38
Fix qk_norm comment (#769) 2025-08-15 08:38:48 -05:00
Sebastian Raschka
f92b40e4ab
Qwen3 Coder Flash & MoE from Scratch (#760)
* Qwen3 Coder Flash & MoE from Scratch

* update

* refinements

* updates

* update

* update

* update
2025-08-01 19:13:17 -05:00
Sebastian Raschka
21c41721cc
Add more sophisticated Qwen3 tokenizer (#729) 2025-07-09 13:16:26 -05:00
Sebastian Raschka
0405b0c8e7
Handle other Qwen3 tokenizer settings (#716) 2025-06-30 17:49:51 -05:00
Sebastian Raschka
c4ec55edac
Support different Qwen3 sizes in pkg (#714) 2025-06-28 08:00:23 -05:00
Sebastian Raschka
0a2e8c39c4
Qwen3 KV cache (#688) 2025-06-21 17:34:39 -05:00
Sebastian Raschka
e719bd86ad
Qwen3 From Scratch (#678)
* Qwen3 From Scratch

* rev other file

* upd

* upd

* upd

* url fixes
2025-06-19 18:44:38 -05:00