Update README.md
Browse files
README.md
CHANGED
@@ -10,13 +10,10 @@ architecture based on Torchvision's Resnet152 default implementation
|
|
10 |
|
11 |
hyperparameters:
|
12 |
|
13 |
-
|
14 |
-
-
|
15 |
-
-
|
16 |
-
-
|
17 |
-
-
|
18 |
-
|
19 |
-
-
|
20 |
-
- num_epochs = 16 (higher would be even better, but maybe by <1%)
|
21 |
-
- crossmax_k = 2 (difference between crossmax_k=2 and crossmax_k=3 is about 1-2%, so it's not a big deal)
|
22 |
-
```
|
|
|
10 |
|
11 |
hyperparameters:
|
12 |
|
13 |
+
- criterion: `torch.nn.CrossEntropyLoss()`
|
14 |
+
- optimizer: `torch.optim.AdamW`
|
15 |
+
- scaler: `GradScaler`
|
16 |
+
- datasets: `["cifar10", "cirfar100"]`
|
17 |
+
- lr: `0.0001`
|
18 |
+
- num_epochs: `16` (higher would be even better, but maybe by <1%)
|
19 |
+
- crossmax_k: `2` (difference between `crossmax_k=2` and `crossmax_k=3` is about 1-2%, so it's not a big deal)
|
|
|
|
|
|