mirror of
https://github.com/rasbt/LLMs-from-scratch.git
synced 2025-10-05 04:58:00 +00:00
Mention small discrepancy due to Dropout non-reproducibility in PyTorch (#519)
* Mention small discrepancy due to Dropout non-reproducibility in PyTorch * bump pytorch version
This commit is contained in:
parent
bd8f7522cb
commit
68e2efe1c9
@ -1,4 +1,4 @@
|
|||||||
name: Test PyTorch 2.0 and 2.5
|
name: Test PyTorch 2.0 and 2.6
|
||||||
|
|
||||||
on:
|
on:
|
||||||
push:
|
push:
|
||||||
@ -23,7 +23,7 @@ jobs:
|
|||||||
runs-on: ubuntu-latest
|
runs-on: ubuntu-latest
|
||||||
strategy:
|
strategy:
|
||||||
matrix:
|
matrix:
|
||||||
pytorch-version: [ 2.0.1, 2.5.0 ]
|
pytorch-version: [ 2.0.1, 2.6.0 ]
|
||||||
|
|
||||||
steps:
|
steps:
|
||||||
- uses: actions/checkout@v4
|
- uses: actions/checkout@v4
|
||||||
|
@ -317,6 +317,7 @@
|
|||||||
"```\n",
|
"```\n",
|
||||||
"\n",
|
"\n",
|
||||||
"- Since these are just random numbers, this is not a reason for concern, and you can proceed with the remainder of the chapter without issues\n",
|
"- Since these are just random numbers, this is not a reason for concern, and you can proceed with the remainder of the chapter without issues\n",
|
||||||
|
"- One possible reason for this discrepancy is the differing behavior of `nn.Dropout` across operating systems, depending on how PyTorch was compiled, as discussed [here on the PyTorch issue tracker](https://github.com/pytorch/pytorch/issues/121595)\n",
|
||||||
"\n",
|
"\n",
|
||||||
"---"
|
"---"
|
||||||
]
|
]
|
||||||
|
@ -1348,6 +1348,16 @@
|
|||||||
"# print(f\"Training completed in {execution_time_minutes:.2f} minutes.\")"
|
"# print(f\"Training completed in {execution_time_minutes:.2f} minutes.\")"
|
||||||
]
|
]
|
||||||
},
|
},
|
||||||
|
{
|
||||||
|
"cell_type": "markdown",
|
||||||
|
"id": "2e8b86f0-b07d-40d7-b9d3-a9218917f204",
|
||||||
|
"metadata": {},
|
||||||
|
"source": [
|
||||||
|
"- Note that you might get slightly different loss values on your computer, which is not a reason for concern if they are roughly similar (a training loss below 1 and a validation loss below 7)\n",
|
||||||
|
"- Small differences can often be due to different GPU hardware and CUDA versions or small changes in newer PyTorch versions\n",
|
||||||
|
"- Even if you are running the example on a CPU, you may observe slight differences; a possible reason for a discrepancy is the differing behavior of `nn.Dropout` across operating systems, depending on how PyTorch was compiled, as discussed [here on the PyTorch issue tracker](https://github.com/pytorch/pytorch/issues/121595)"
|
||||||
|
]
|
||||||
|
},
|
||||||
{
|
{
|
||||||
"cell_type": "code",
|
"cell_type": "code",
|
||||||
"execution_count": 28,
|
"execution_count": 28,
|
||||||
|
Loading…
x
Reference in New Issue
Block a user