VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
Model Summary
This Hub repository contains a HuggingFace's transformers
implementation of VLM-RLAIF model of SNUMPR lab.
- VLM-RLAIF-7b [HF]: 7B RLAIF model