KangarooGroup
/

kangaroo

Video-Text-to-Text

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

WEBing commited on Jul 19

Commit

0a3336d

•

1 Parent(s): 1f1db3a

update example

Files changed (1) hide show

README.md +42 -2

README.md CHANGED Viewed

@@ -7,11 +7,50 @@ pipeline_tag: visual-question-answering
 ---
 # Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
-## Release
 - [2024/07/17] 🔥 **Kangaroo** has been released. We release [blog](https://kangaroogroup.github.io/Kangaroo.github.io/) and [model](https://huggingface.co/KangarooGroup/kangaroo). Please check out the blog for details.
-## Citation
 If you find it useful for your research , please cite related papers/blogs using this BibTeX:
 ```bibtex
@@ -23,3 +62,4 @@ If you find it useful for your research , please cite related papers/blogs using
 	month={July},
 	year={2024}
 }

 ---
 # Kangaroo: A Powerful Video-Language Model Supporting Long-context Video Input
+# Release
 - [2024/07/17] 🔥 **Kangaroo** has been released. We release [blog](https://kangaroogroup.github.io/Kangaroo.github.io/) and [model](https://huggingface.co/KangarooGroup/kangaroo). Please check out the blog for details.
+# Get Started with the Model
+```python
+import torch
+from transformers import AutoTokenizer, AutoModelForCausalLM
+tokenizer = AutoTokenizer.from_pretrained("/path/to/kangaroo")
+model = AutoModelForCausalLM.from_pretrained(
+    "/path/to/kangaroo",
+    torch_dtype=torch.bfloat16,
+    trust_remote_code=True,
+)
+model = model.to("cuda")
+terminators = [tokenizer.eos_token_id, tokenizer.convert_tokens_to_ids("<|eot_id|>")]
+video_path = "path/to/video"
+query = "Please describe this video"
+out, history = model.chat(video_path=video_path,
+                          query=query,
+                          tokenizer=tokenizer,
+                          max_new_tokens=512,
+                          eos_token_id=terminators,
+                          do_sample=True,
+                          temperature=0.6,
+                          top_p=0.9,)
+print(out)
+query = "What happend at the end of the video?"
+out, history = model.chat(video_path=video_path,
+                          query=query,
+                          history=history,
+                          tokenizer=tokenizer,
+                          max_new_tokens=512,
+                          eos_token_id=terminators,
+                          do_sample=True,
+                          temperature=0.6,
+                          top_p=0.9,)
+print(out)
+```
+# Citation
 If you find it useful for your research , please cite related papers/blogs using this BibTeX:
 ```bibtex
 	month={July},
 	year={2024}
 }
+```