File size: 585 Bytes
3c8e76e
a425565
 
 
9265b5f
a425565
 
 
 
 
 
 
c2447c0
a425565
c2447c0
a425565
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
Model	Human Metric	Auto Metric	Identify (Binary Accuracy)
Humans	95	92	
Ground-truth Caption β†’ Llama-2-7b (Oracle)		71	
Ground-truth Caption β†’ GPT3 (Oracle)	68	70	74
Ground-truth Caption β†’ Llama-2-13b (Oracle)		70	
Ground-truth Caption β†’ GPT4 (Oracle)		69	
Predicted Caption β†’ GPT3	33	36	59
Predicted Caption β†’ Llama-2-7b		36	
Predicted Caption β†’ Llama-2-13b		36	
Predicted Caption β†’ GPT4		36	
InstructBLIP 		31	
LLaVA 		31	
BLIP2 FlanT5-XXL (Fine-tuned)	27	27	73
mPLUG-Owl 		24	
BLIP2 FlanT5-XL (Fine-tuned)	15	18	60
BLIP2 FlanT5-XXL (Zero-shot)	0	12	50