PolyU-ChenLab
/

ETChat-Phi3-Mini-Stage-3

Model card Files Files and versions Community

yeliudev commited on Sep 27, 2024

Commit

ee0ef1b

·

verified ·

1 Parent(s): 8d576c6

Update model card

Files changed (1) hide show

README.md +39 -3

README.md CHANGED Viewed

@@ -1,3 +1,39 @@
----
-license: bsd-3-clause
----

+---
+license: bsd-3-clause
+---
+# E.T. Chat
+[arXiv](https://arxiv.org/abs/2409.18111) | [Project Page](https://polyu-chenlab.github.io/etbench) | [GitHub](https://github.com/PolyU-ChenLab/ETBench)
+E.T. Chat is a novel time-sensitive Video-LLM that reformulates timestamp prediction as an embedding matching problem, serving as a strong baseline on E.T. Bench. E.T. Chat consists of a visual encoder, a frame compressor, and a LLM. A special token \<vid\> is introduced to trigger frame embedding matching for timestamp prediction.
+## 🔖 Model Details
+### Model Description
+- **Developed by:** Ye Liu
+- **Model type:** Multi-modal Large Language Model
+- **Language(s):** English
+- **License:** BSD-3-Clause
+### Training Data
+The stage-3 checkpoint of E.T. Chat was trained from [ET-Instruct-164K](https://huggingface.co/datasets/PolyU-ChenLab/ET-Instruct-164K) dataset.
+### More Details
+Please refer to our [GitHub Repository](https://github.com/PolyU-ChenLab/ETBench) for more details about this model.
+## 📖 Citation
+Please kindly cite our paper if you find this project helpful.
+```
+@inproceedings{liu2024etbench,
+  title={E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding},
+  author={Liu, Ye and Ma, Zongyang and Qi, Zhongang and Wu, Yang and Chen, Chang Wen and Shan, Ying},
+  booktitle={Neural Information Processing Systems (NeurIPS)},
+  year={2024}
+}
+```