Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ This network uses the optimal transport flow matching objective outlined [Flow M
|
|
12 |
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
|
13 |
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
|
14 |
|
15 |
-
```python train.py``` will train a new image network on the provided dataset
|
16 |
|
17 |
```python test_sample.py step_1799.safetensors``` Where step_1799.safetensors is the desired model to test inference on. This will always generate a sample grid of 16x16 images.
|
18 |
|
|
|
12 |
xATGLU Layers are used instead of linears for entry into the transformer MLP layer [Expanded Gating Ranges
|
13 |
Improve Activation Functions](https://arxiv.org/pdf/2405.20768)
|
14 |
|
15 |
+
```python train.py``` will train a new image network on the provided dataset (Currently the dataset is being fully rammed into GPU and is defined in the preload_dataset function)
|
16 |
|
17 |
```python test_sample.py step_1799.safetensors``` Where step_1799.safetensors is the desired model to test inference on. This will always generate a sample grid of 16x16 images.
|
18 |
|