ydshieh HF staff commited on
Commit
d12ecc2
1 Parent(s): 3ea1ff8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -4,4 +4,4 @@ Part of the [Huggingface JAX/Flax event](https://discuss.huggingface.co/t/open-t
4
 
5
  The GPT2 model source code is modified so it can accept an encoder's output.
6
  The pretained weights of both models are loaded, with a set of randomly initialized cross-attention weigths.
7
- The model is trained on 65000 images from the COCO dataset for about 1500 steps, with the original english cpationis are translated to french for training purpose.
 
4
 
5
  The GPT2 model source code is modified so it can accept an encoder's output.
6
  The pretained weights of both models are loaded, with a set of randomly initialized cross-attention weigths.
7
+ The model is trained on 65000 images from the COCO dataset for about 1500 steps (batch\_size=256), with the original english cpationis being translated to french for training purpose.