AhmedSSabir commited on
Commit
ee67b3c
1 Parent(s): 4699d07

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -1,12 +1,16 @@
1
 
2
  # Visual semantic with BERT-CNN
3
- To take advantage of the overlapping between the visual context and the caption, and to extract global information from each visual, we use BERT as an embedding layer followed by a shallow CNN (tri-gram kernel) (Kim, 2014).
 
 
 
 
4
 
 
5
  For datasets that are less than 100K please have look at our [shallow model](https://github.com/ahmedssabir/Semantic-Relatedness-Based-Reranker-for-Text-Spotting)
6
 
7
 
8
- This model can be used to assign an object-to-caption relatedness score, which is valuable for
9
- (1) caption diverse re-ranking, and (2) generate soft labels for caption filtering when scraping text-to-captions from the internet.
10
 
11
  The model is trained with a strict filter of 0.4 similarity distance thresholds between the object and its related caption.
12
 
 
1
 
2
  # Visual semantic with BERT-CNN
3
+
4
+ This model can be used to assign an object-to-caption relatedness score, which is valuable for
5
+ (1) caption diverse re-ranking, and (2) generate soft labels for caption filtering when scraping text-to-captions from the internet.
6
+
7
+ To take advantage of the overlapping between the visual context and the caption, and to extract global information from each visual (i.e., object, scene, etc) we use BERT as an embedding layer followed by a shallow CNN (tri-gram kernel) (Kim, 2014).
8
 
9
+
10
  For datasets that are less than 100K please have look at our [shallow model](https://github.com/ahmedssabir/Semantic-Relatedness-Based-Reranker-for-Text-Spotting)
11
 
12
 
13
+
 
14
 
15
  The model is trained with a strict filter of 0.4 similarity distance thresholds between the object and its related caption.
16