File size: 525 Bytes
d497325
 
 
 
 
 
3b33e89
 
d497325
3b33e89
 
1
2
3
4
5
6
7
8
9
10
11
12
# VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback

## Model Summary

This Hub repository contains a HuggingFace's `transformers` implementation of VLM-RLAIF model of SNUMPR lab.

* VLM-RLAIF-7b [[HF]](https://huggingface.co/SNUMPR/vlm_rlaif_video_llava_7b): 7B RLAIF model
<!-- | Model   | Model size | Model Description | 
| ------- | ------------- |   ------------- |  
| VLM-RLAIF-7b [[HF]](https://huggingface.co/SNUMPR/vlm_rlaif_video_llava_7b) | 7B | RLAIF model
 -->