File size: 943 Bytes
04eb460 beefb8e c024a5b 0b66d3a 5ff2646 beefb8e 5ff2646 beefb8e 6d611d1 beefb8e |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 |
---
license: apache-2.0
---
# AskVideos-7B-Instruct-v0.1
## Model details
**Model type:**
AskVideos-7B-Instruct-v0.1 is an open-source chatbot trained by fine-tuning a Video-LLaMA variant on additional video Q&A data.
It uses a frozen Vicuna 7B v1.1 LLM to answer Video-Text queries and a frozen BLIP style image encoder.
A video feature is derived from the encoded image using a video-QFormer and the result is projected onto the LLM space.
**Github repo for demo:**
https://github.com/AskYoutubeAI/AskVideos-Instruct
**Acknowledgement**
This model is based on Video-LLaMA. Check out the original work here: https://github.com/DAMO-NLP-SG/Video-LLaMA
## License
AskVideos-7B-Instruct-v0.1 code and models are distributed under the Apache License 2.0.
## Training dataset
- Finetuned with 50K video synthetic Q&A pairs mined from videos.
- For each Q&A pair, 16 frames are sampled over a 30s video.
- Finetuned on Video-LLaAMA Vicuna 7B. |