AdamG012
/

Llama-2-13b-deepspeed-visualchat

Model card Files Files and versions Community

AdamG012 commited on Nov 10, 2023

Commit

127f7fa

•

1 Parent(s): 0a9e3b3

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -15,6 +15,9 @@ datasets:
 # Llama-2-13b-deepspeed-visualchat
 DeepSpeed-VisualChat is a scalable, efficient, and user-friendly multi-modal training pipeline that leverages a novel multi-modal causal attention mechanism for better alignment of visual and text features. It uses data blending techniques to address the scarcity of interleaved text-and-image inputs in datasets.

 # Llama-2-13b-deepspeed-visualchat
+> [!NOTE]
+> ATTENTION: this encoder needs QwenCLIP model
 DeepSpeed-VisualChat is a scalable, efficient, and user-friendly multi-modal training pipeline that leverages a novel multi-modal causal attention mechanism for better alignment of visual and text features. It uses data blending techniques to address the scarcity of interleaved text-and-image inputs in datasets.