mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-10-24 06:20:19 +00:00
clarify overfitting
This commit is contained in:
parent
ad200a4f3f
commit
dadd0f7ea3
@ -2043,7 +2043,7 @@
|
|||||||
"metadata": {},
|
"metadata": {},
|
||||||
"source": [
|
"source": [
|
||||||
"- We can see that the training and test set performances are practically identical\n",
|
"- We can see that the training and test set performances are practically identical\n",
|
||||||
"- However, based on the slightly lower test set performance, we can see that the model overfits the training data to a very small degree\n",
|
"- However, based on the slightly lower test set performance, we can see that the model overfits the training data to a very small degree, as well as the validation data that has been used for tweaking some of the hyperparameters, such as the learning rate\n",
|
||||||
"- This is normal, however, and this gap could potentially be further reduced by increasing the model's dropout rate (`drop_rate`) or the `weight_decay` in the optimizer setting"
|
"- This is normal, however, and this gap could potentially be further reduced by increasing the model's dropout rate (`drop_rate`) or the `weight_decay` in the optimizer setting"
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
@ -2265,7 +2265,7 @@
|
|||||||
"name": "python",
|
"name": "python",
|
||||||
"nbconvert_exporter": "python",
|
"nbconvert_exporter": "python",
|
||||||
"pygments_lexer": "ipython3",
|
"pygments_lexer": "ipython3",
|
||||||
"version": "3.10.12"
|
"version": "3.10.6"
|
||||||
}
|
}
|
||||||
},
|
},
|
||||||
"nbformat": 4,
|
"nbformat": 4,
|
||||||
|
Loading…
x
Reference in New Issue
Block a user