add clarifying note about GELU

2025-11-14 17:13:39 +00:00 · 2024-06-29 07:14:36 -05:00 · 2024-06-29 07:14:36 -05:00 · 1ffb7500e4
commit 1ffb7500e4
parent 1e61943bf2
1 changed files with 1 additions and 1 deletions
--- a/ch04/01_main-chapter-code/ch04.ipynb
+++ b/ch04/01_main-chapter-code/ch04.ipynb
@ -667,7 +667,7 @@
   "metadata": {},
   "source": [
    "- As we can see, ReLU is a piecewise linear function that outputs the input directly if it is positive; otherwise, it outputs zero\n",
-    "- GELU is a smooth, non-linear function that approximates ReLU but with a non-zero gradient for negative values\n",
+    "- GELU is a smooth, non-linear function that approximates ReLU but with a non-zero gradient for negative values (except at approximately -0.75)\n",
    "\n",
    "- Next, let's implement the small neural network module, `FeedForward`, that we will be using in the LLM's transformer block later:"
   ]