File size: 525 Bytes
d497325 3b33e89 d497325 3b33e89 |
1 2 3 4 5 6 7 8 9 10 11 12 |
# VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
## Model Summary
This Hub repository contains a HuggingFace's `transformers` implementation of VLM-RLAIF model of SNUMPR lab.
* VLM-RLAIF-7b [[HF]](https://huggingface.co/SNUMPR/vlm_rlaif_video_llava_7b): 7B RLAIF model
<!-- | Model | Model size | Model Description |
| ------- | ------------- | ------------- |
| VLM-RLAIF-7b [[HF]](https://huggingface.co/SNUMPR/vlm_rlaif_video_llava_7b) | 7B | RLAIF model
-->
|