mmaaz60 commited on
Commit
1b89a97
1 Parent(s): 050e059

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -8,5 +8,7 @@ library_name: transformers
8
  pipeline_tag: visual-question-answering
9
  ---
10
 
11
- + Video-ChatGPT is a large vision-language model with a visual-encoder and large language model (LLM), enabling video understanding and conversation about videos.
12
- + A simple and scalable multimodal design on top of pretrained visual and language encoders that adapts only a linear projection layer for multimodal alignment.
 
 
 
8
  pipeline_tag: visual-question-answering
9
  ---
10
 
11
+ Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos.
12
+ It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation.
13
+
14
+ **GitHub:** [https://github.com/mbzuai-oryx/Video-ChatGPT](https://github.com/mbzuai-oryx/Video-ChatGPT)