Sebastian Raschka
|
701090815e
|
Add backup URL for gpt2 weights (#469)
* Add backup URL for gpt2 weights
* newline
|
2025-01-05 11:28:09 -06:00 |
|
rasbt
|
9de277421e
|
consistent header for appendix E
|
2024-08-30 08:08:01 +02:00 |
|
Sebastian Raschka
|
01cb137bfd
|
Note about MPS devices (#329)
|
2024-08-19 20:58:45 -05:00 |
|
Sebastian Raschka
|
8d02cb1cee
|
Add download help message (#274)
|
2024-07-19 08:29:29 -05:00 |
|
Daniel Kleine
|
dcbdc1d2e5
|
fixes for code (#206)
* updated .gitignore
* removed unused GELU import
* fixed model_configs, fixed all tensors on same device
* removed unused tiktoken
* update
* update hparam search
* remove redundant tokenizer argument
---------
Co-authored-by: rasbt <mail@sebastianraschka.com>
|
2024-06-11 20:59:48 -05:00 |
|
rasbt
|
1b1fd21d64
|
fix typo in comment
|
2024-06-09 06:14:02 -05:00 |
|
Sebastian Raschka
|
72a073bbbf
|
Remove leftover instances of self.tokenizer (#201)
* Remove leftover instances of self.tokenizer
* add endoftext token
|
2024-06-08 14:57:34 -05:00 |
|
rasbt
|
f5c4e0778f
|
remove redundant import
|
2024-06-04 07:11:12 -05:00 |
|
rasbt
|
f3a2e93160
|
100x -> 50x
|
2024-05-19 20:26:53 -05:00 |
|
rasbt
|
0d48725b5c
|
use macbook version
|
2024-05-19 20:19:02 -05:00 |
|
rasbt
|
c2028871e4
|
update lora init
|
2024-05-19 20:11:56 -05:00 |
|
Daniel Kleine
|
7e3638649e
|
fixed var name
|
2024-05-17 13:58:07 +00:00 |
|
Daniel Kleine
|
9664e2a252
|
added missing word
|
2024-05-15 06:33:25 +00:00 |
|
rasbt
|
209f63943b
|
10 -> 5
|
2024-05-12 10:42:25 -05:00 |
|
rasbt
|
2e47a6e61c
|
update dataset naming
|
2024-05-12 09:22:42 -05:00 |
|
rasbt
|
55c3a91838
|
rename download_and_unzip to make it more specific
|
2024-05-12 08:36:24 -05:00 |
|
rasbt
|
24e9110fa8
|
make spam spelling consistent
|
2024-05-08 06:48:28 -05:00 |
|
rasbt
|
68c33a64e6
|
fix sentence
|
2024-05-07 07:05:47 -05:00 |
|
rasbt
|
d18f92fa34
|
add additional lora figure
|
2024-05-07 07:04:35 -05:00 |
|
rasbt
|
c93c90eb1e
|
spelling fix
|
2024-05-07 06:46:33 -05:00 |
|
rasbt
|
e99c511721
|
spelling and consistency improvements
|
2024-05-06 21:02:13 -05:00 |
|
rasbt
|
6f486460bc
|
ouput -> output
|
2024-05-05 12:21:10 -05:00 |
|
Sebastian Raschka
|
978ef48ccc
|
Appendix E: Parameter-efficient Finetuning with LoRA (#142)
|
2024-05-05 12:05:17 -05:00 |
|