SNUMPR
/

vlm_rlaif_video_llava_7b

Text Generation

Inference Endpoints

Model card Files Files and versions Community

SNUMPR commited on Jun 28

Commit

d497325

•

1 Parent(s): 2067236

Create README.md

Files changed (1) hide show

README.md +9 -0

README.md ADDED Viewed

	@@ -0,0 +1,9 @@

+# VLM-RLAIF: Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback
+## Model Summary
+This Hub repository contains a HuggingFace's `transformers` implementation of VLM-RLAIF model of SNUMPR lab.
+| Model   | Model size | Model Description |
+| ------- | ------------- |   ------------- |
+| VLM-RLAIF-7b [[HF]](https://huggingface.co/SNUMPR/vlm_rlaif_video_llava_7b) | 7B | RLAIF model |