sashakunitsyn commited on
Commit
67c80a5
1 Parent(s): 8f90d90

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ base_model: Salesforce/blip2-opt-2.7b
12
  ---
13
  # VLRM
14
  This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
15
- Reward Models for Image Captioning (on submission ATM)](https://arxiv.org/submit/5511483/view).
16
 
17
  The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
18
 
 
12
  ---
13
  # VLRM
14
  This repository contains the weights of BLIP-2 OPT-2.7B model fine-tuned by reinforcement learning method introduced in the paper [VLRM: Vision-Language Models act as
15
+ Reward Models for Image Captioning](https://arxiv.org/abs/2404.01911).
16
 
17
  The RL-tuned model is able to generate longer and more comprehensive descriptions with zero computational overhead compared to the original model.
18