Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
YanweiLi
/
llama-vid-13b-pretrain-336
like
0
Text Generation
Transformers
llava
vision-language model
llama
video understanding
Inference Endpoints
arxiv:
2311.17043
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llama-vid-13b-pretrain-336
1 contributor
History:
3 commits
YanweiLi
Create README.md
da09ec4
11 months ago
.gitattributes
Safe
1.52 kB
initial commit
11 months ago
README.md
Safe
1.52 kB
Create README.md
11 months ago
config.json
Safe
1.23 kB
Upload 3 files
11 months ago
mm_projector.bin
Safe
pickle
Detected Pickle imports (3)
"torch._utils._rebuild_tensor_v2"
,
"torch.BFloat16Storage"
,
"collections.OrderedDict"
What is a pickle import?
459 MB
LFS
Upload 3 files
11 months ago
trainer_state.json
Safe
263 kB
Upload 3 files
11 months ago