tarekziade commited on
Commit
3dccd4d
1 Parent(s): 9b9dca5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -11,10 +11,28 @@ widget:
11
  - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/airport.jpg
12
  example_title: Airport
13
  base_model:
14
- - distilbert/distilgpt2
15
  - google/vit-base-patch16-224-in21k
16
  ---
17
 
18
- Variation of https://huggingface.co/tarekziade/distilvit
19
 
20
- Further fine-tuned on Flickr
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  - src: https://huggingface.co/datasets/mishig/sample_images/resolve/main/airport.jpg
12
  example_title: Airport
13
  base_model:
 
14
  - google/vit-base-patch16-224-in21k
15
  ---
16
 
17
+ This model is a work in progress.
18
 
19
+ You can find the code used to create the model here: https://github.com/mozilla/distilvit
20
+
21
+ Results after after 3 epochs (and ~45 hours of training)
22
+
23
+ - eval_loss: 0.19939416646957397
24
+ - eval_rouge1: 43.006
25
+ - eval_rouge2: 16.9939
26
+ - eval_rougeL: 38.8923
27
+ - eval_rougeLsum: 38.8877
28
+ - eval_gen_len: 11.327256736227712
29
+ - eval_runtime: 1816.5255
30
+ - eval_samples_per_second: 13.77
31
+ - eval_steps_per_second': 1.721
32
+ - train_runtime: 46263.3695
33
+ - train_samples_per_second: 38.373
34
+ - train_steps_per_second: 4.797
35
+ - train_loss: 0.05974134062104816
36
+
37
+
38
+