SNUMPR's picture
Update README.md
3b33e89 verified
|
raw
history blame
525 Bytes

VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

Model Summary

This Hub repository contains a HuggingFace's transformers implementation of VLM-RLAIF model of SNUMPR lab.

  • VLM-RLAIF-7b [HF]: 7B RLAIF model