selfmaker
/

image_caption

Model card Files Files and versions Community

selfmaker commited on Jan 27

Commit

7461ae2

·

verified ·

1 Parent(s): 74fa892

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -5,4 +5,15 @@ license: cc-by-nc-nd-4.0
 ## Introduction
 This model is defined as proposed in the book "mastering pytorch".
-It is based on CNN-encoder and a LSTM-decoder.

 ## Introduction
 This model is defined as proposed in the book "mastering pytorch".
+It is based on CNN-encoder and a LSTM-decoder.
+The CNN-encoder is based on a pretrained RESNET-152. The last layer of the resnet is replaced by a vector embedding layer of 256 elements.
+The LSTM-decoder use an input of 256, a hidden layer of 512, and uses the vocabulary size.
+The model has been trained as a pure learning exercise, and so the model performances remain relatively mean.
+## Training procedure
+For the sake of the exercise, the model has been trained for only 5 epochs.
+It has been trained on the COCO dataset.