{ "cells": [ { "cell_type": "markdown", "id": "c024bfa4-1a7a-4751-b5a1-827225a3478b", "metadata": { "id": "c024bfa4-1a7a-4751-b5a1-827225a3478b" }, "source": [ "
\n",
"\n",
"Supplementary code for the Build a Large Language Model From Scratch book by Sebastian Raschka \n", " Code repository: https://github.com/rasbt/LLMs-from-scratch\n", "\n", " | \n",
"\n",
"![]() | \n",
"
\n", " | Label | \n", "Text | \n", "
---|---|---|
0 | \n", "ham | \n", "Go until jurong point, crazy.. Available only ... | \n", "
1 | \n", "ham | \n", "Ok lar... Joking wif u oni... | \n", "
2 | \n", "spam | \n", "Free entry in 2 a wkly comp to win FA Cup fina... | \n", "
3 | \n", "ham | \n", "U dun say so early hor... U c already then say... | \n", "
4 | \n", "ham | \n", "Nah I don't think he goes to usf, he lives aro... | \n", "
... | \n", "... | \n", "... | \n", "
5567 | \n", "spam | \n", "This is the 2nd time we have tried 2 contact u... | \n", "
5568 | \n", "ham | \n", "Will ü b going to esplanade fr home? | \n", "
5569 | \n", "ham | \n", "Pity, * was in mood for that. So...any other s... | \n", "
5570 | \n", "ham | \n", "The guy did some bitching but I acted like i'd... | \n", "
5571 | \n", "ham | \n", "Rofl. Its true to its name | \n", "
5572 rows × 2 columns
\n", "\n", " | Label | \n", "Text | \n", "
---|---|---|
4307 | \n", "0 | \n", "Awww dat is sweet! We can think of something t... | \n", "
4138 | \n", "0 | \n", "Just got to <#> | \n", "
4831 | \n", "0 | \n", "The word \"Checkmate\" in chess comes from the P... | \n", "
4461 | \n", "0 | \n", "This is wishing you a great day. Moji told me ... | \n", "
5440 | \n", "0 | \n", "Thank you. do you generally date the brothas? | \n", "
... | \n", "... | \n", "... | \n", "
5537 | \n", "1 | \n", "Want explicit SEX in 30 secs? Ring 02073162414... | \n", "
5540 | \n", "1 | \n", "ASKED 3MOBILE IF 0870 CHATLINES INCLU IN FREE ... | \n", "
5547 | \n", "1 | \n", "Had your contract mobile 11 Mnths? Latest Moto... | \n", "
5566 | \n", "1 | \n", "REMINDER FROM O2: To get 2.50 pounds free call... | \n", "
5567 | \n", "1 | \n", "This is the 2nd time we have tried 2 contact u... | \n", "
1494 rows × 2 columns
\n", "