Update README.md
Browse files
README.md
CHANGED
@@ -6,7 +6,7 @@ license: apache-2.0
|
|
6 |
|
7 |
This is are model weights originally provided by the authors of the paper [Text-Only Training for Image Captioning using Noise-Injected CLIP](https://arxiv.org/pdf/2211.00575.pdf).
|
8 |
|
9 |
-
|
10 |
|
11 |
In their words:
|
12 |
*Specifically, we assume that the visual embedding corresponding to a text embedding
|
|
|
6 |
|
7 |
This is are model weights originally provided by the authors of the paper [Text-Only Training for Image Captioning using Noise-Injected CLIP](https://arxiv.org/pdf/2211.00575.pdf).
|
8 |
|
9 |
+
Their method aims to train CLIP with only text samples. Therefore they are injecting zero-mean Gaussian Noise with a standard-deviation(STD) of into the text embeddings before decoding.
|
10 |
|
11 |
In their words:
|
12 |
*Specifically, we assume that the visual embedding corresponding to a text embedding
|