Sebastian Raschka
|
27fa95d24b
|
Fix qk_norm comment (#769)
|
2025-08-15 08:38:48 -05:00 |
|
Sebastian Raschka
|
71ef67be46
|
Qwen3 Coder Flash & MoE from Scratch (#760)
* Qwen3 Coder Flash & MoE from Scratch
* update
* refinements
* updates
* update
* update
* update
|
2025-08-01 19:13:17 -05:00 |
|
Sebastian Raschka
|
d23b1f07b8
|
Add more sophisticated Qwen3 tokenizer (#729)
|
2025-07-09 13:16:26 -05:00 |
|
Sebastian Raschka
|
30645a6d64
|
Handle other Qwen3 tokenizer settings (#716)
|
2025-06-30 17:49:51 -05:00 |
|
Sebastian Raschka
|
dc2f8e95d4
|
Support different Qwen3 sizes in pkg (#714)
|
2025-06-28 08:00:23 -05:00 |
|
Sebastian Raschka
|
0b15a00574
|
Qwen3 KV cache (#688)
|
2025-06-21 17:34:39 -05:00 |
|
Sebastian Raschka
|
3d4bce6d57
|
Qwen3 From Scratch (#678)
* Qwen3 From Scratch
* rev other file
* upd
* upd
* upd
* url fixes
|
2025-06-19 18:44:38 -05:00 |
|