From a31d571625d48a5262dc4dacca263d3b7f00f51c Mon Sep 17 00:00:00 2001 From: rasbt Date: Wed, 8 May 2024 08:14:03 -0500 Subject: [PATCH] text -> dataset --- ch06/01_main-chapter-code/ch06.ipynb | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/ch06/01_main-chapter-code/ch06.ipynb b/ch06/01_main-chapter-code/ch06.ipynb index 3d0ddb3..e1e0a76 100644 --- a/ch06/01_main-chapter-code/ch06.ipynb +++ b/ch06/01_main-chapter-code/ch06.ipynb @@ -519,7 +519,7 @@ " - 1. truncate all messages to the length of the shortest message in the dataset or batch\n", " - 2. pad all messages to the length of the longest message in the dataset or batch\n", "\n", - "- We choose option 2 and pad all messages to the longest message in the text\n", + "- We choose option 2 and pad all messages to the longest message in the dataset\n", "- For that, we use `<|endoftext|>` as a padding token, as discussed in chapter 2" ] },