704 Commits

Author SHA1 Message Date
TITC
6cbe6520a2
47,678-->48,725 (#281) 2024-07-22 21:24:57 -05:00
rasbt
46fcde15d8
use utf-8 encoding in json 2024-07-22 07:01:54 -05:00
Sebastian Raschka
fa56c80402
Simplify embedding vs linear layer code (#278) 2024-07-21 12:21:10 -05:00
Sebastian Raschka
bbe09e9799
Update README.md 2024-07-21 10:42:28 -05:00
Sebastian Raschka
0d28079a8a
Update README.md 2024-07-21 10:41:53 -05:00
Sebastian Raschka
8d02cb1cee
Add download help message (#274) 2024-07-19 08:29:29 -05:00
Sebastian Raschka
eb6ca4311f
Updated test badges 2024-07-18 20:26:51 -05:00
rasbt
be14f06442
explain extra padding token 2024-07-17 07:38:25 -05:00
Thanh Tran
070a69fc8b
fix typos & inconsistent texts (#269)
Co-authored-by: TRAN <you@example.com>
2024-07-17 07:34:51 -05:00
rasbt
a33e89c12c
fixes bold font #267 2024-07-16 17:51:15 -05:00
Sebastian Raschka
45d0318ae6
Gh workflow for spelling errors (#265)
* Gh workflow for spelling errors

* add files to check
2024-07-16 08:17:20 -05:00
Jeroen Van Goey
48bd72c890
fix typos, add codespell pre-commit hook (#264)
* fix typos, add codespell pre-commit hook

* Update .pre-commit-config.yaml

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-07-16 07:07:04 -05:00
rasbt
6ffd628bb6
add missing "be" to figure 2024-07-15 08:06:05 -05:00
rasbt
7e632f90eb
clarify libraries 2024-07-14 08:53:16 -05:00
Sebastian Raschka
e0148f6ecc
break long lines in ch07 (#263) 2024-07-13 07:26:23 -05:00
Sebastian Raschka
17ff4c6328
Add download instructions 2024-07-13 06:31:14 -05:00
Sebastian Raschka
4f0a107692 show how to use the finetuned model 2024-07-09 06:43:26 -07:00
Sebastian Raschka
e2918af036
Update README.md 2024-07-06 08:26:50 -05:00
Daniel Kleine
88186bf64a
minor: removed redundant imports (#260)
* removed duplicated imports

* removed empty cell
2024-07-05 15:33:19 -05:00
rasbt
b92dea8bc6
update decode method 2024-07-05 08:34:27 -05:00
Suman Debnath
2cdcf68598
fixing the regular expression used in the SimpleTokenizer (#259)
* fixing the regular expression used in the SimpleTokenizer class and a typo in the 2.7 Creating token embedding introduction section

* rerun

---------

Co-authored-by: rasbt <mail@sebastianraschka.com>
2024-07-04 12:27:27 -05:00
Sebastian Raschka
b02ccb5f6f
Update README.md 2024-07-03 20:28:04 -05:00
Sebastian Raschka
e7ab84d98c
Update README.md 2024-07-03 20:27:54 -05:00
Daniel Kleine
90b25ece3d
fixed spelling typos (#258) 2024-07-03 07:47:33 -05:00
Sebastian Raschka
78b783f6fd
Update README.md 2024-07-03 05:56:46 -05:00
rasbt
921e91a05f
use correct chapter reference 2024-07-02 17:29:57 -05:00
rasbt
0988996eb8
update figures 2024-07-02 17:12:42 -05:00
rasbt
f5a4f9dee3
add spearman and kendall-tau analysis 2024-07-02 07:55:32 -05:00
Sebastian Raschka
f6bcdd37bd
Fix links in summary sections (#254) 2024-06-29 07:51:31 -05:00
rasbt
31806828d0
add links to summary sections 2024-06-29 07:33:26 -05:00
rasbt
219f45f808
refresh cross entropy figure 2024-06-29 07:22:23 -05:00
rasbt
c7f892550e
add clarification about :num_tokens 2024-06-29 07:16:42 -05:00
rasbt
796f0e2a30
add clarifying note about GELU 2024-06-29 07:14:36 -05:00
rasbt
ab23ca5b1b
force refresh figure 2024-06-29 07:01:37 -05:00
rasbt
6a8acf5135
remove redundant plus sign 2024-06-29 06:59:36 -05:00
Daniel Kleine
1e69c8e0b5
fixed minor issues (#252)
* fixed typo

* fixed var name in md text
2024-06-29 06:38:25 -05:00
Sebastian Raschka
e296e8f6be
Use deterministic ollama settings (#250)
* deterministic ollama settings

* add missing file
2024-06-27 07:16:48 -05:00
Daniel Kleine
1db1999951
minor fixes (#248)
* removed duplicated numpy req

* labels to array

* autopep8 NBs
2024-06-26 08:52:51 -05:00
Sebastian Raschka
7e78b52a30
remove redundant code lines (#247) 2024-06-25 21:44:19 -05:00
rasbt
7095e84fab
update with latest versions 2024-06-25 21:09:27 -05:00
Daniel Kleine
81c843bdc0
minor fixes (#246)
* removed duplicated white spaces

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/05_dataset-generation/llama3-ollama.ipynb

* removed duplicated white spaces

* fixed title again

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-06-25 17:30:30 -05:00
Sebastian Raschka
9a9b3530c9
Update title 2024-06-25 10:01:12 -05:00
Sebastian Raschka
dbcdc7593b
Generating a synthetic dataset for instruction finetuning (#245)
* Generating a synthetic dataset for instruction finetuning

* fix link
2024-06-23 21:48:25 -05:00
rasbt
0734570b28
refresh figures 2024-06-23 10:41:29 -05:00
Sebastian Raschka
4c69df6070
removes redundent figsize code (#244) 2024-06-23 09:24:07 -05:00
Sebastian Raschka
f78ad1f95b
Update README.md 2024-06-23 08:25:01 -05:00
Sebastian Raschka
cf0df54d7d
Show epochs as integers on x-axis (#241)
* Show epochs as integers on x-axis

* Update ch07/01_main-chapter-code/previous_chapters.py

* remove extra s

* modify exercise plots

* update chapter 7 plot

* resave ch07 for better file diff
2024-06-23 07:41:25 -05:00
Daniel Kleine
51ca343e83
fixed typo (#243)
* fixed copy-paste typo

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/01_main-chapter-code/ch07.ipynb

* Update ch07/01_main-chapter-code/ch07.ipynb

---------

Co-authored-by: Sebastian Raschka <mail@sebastianraschka.com>
2024-06-23 07:11:04 -05:00
Daniel Kleine
3ac363d005
updated .gitignore for ch07/01 artefacts (#242)
* fixed markdown

* removed redundant imports

* updated .gitignore for ch07/01 artefacts
2024-06-22 18:12:01 -05:00
rasbt
523a3b9bb4
spelling fix 2024-06-22 12:22:21 -05:00