Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
SakanaAI
/
Llama-3-EvoVLM-JP-v2
like
14
Image-to-Text
Transformers
Safetensors
Japanese
llava
pretraining
multimodal
vision-language
mantis
llama3
siglip
Inference Endpoints
arxiv:
2403.13187
License:
llama3
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
Llama-3-EvoVLM-JP-v2
1 contributor
History:
10 commits
Inoichan
Delete the repository
b34a669
verified
13 days ago
.gitattributes
1.52 kB
initial commit
16 days ago
README.md
5.6 kB
Delete the repository
13 days ago
config.json
1.16 kB
Upload LlavaForConditionalGeneration
16 days ago
generation_config.json
147 Bytes
Upload LlavaForConditionalGeneration
16 days ago
model-00001-of-00004.safetensors
4.89 GB
LFS
Upload LlavaForConditionalGeneration
16 days ago
model-00002-of-00004.safetensors
5 GB
LFS
Upload LlavaForConditionalGeneration
16 days ago
model-00003-of-00004.safetensors
4.92 GB
LFS
Upload LlavaForConditionalGeneration
16 days ago
model-00004-of-00004.safetensors
2.16 GB
LFS
Upload LlavaForConditionalGeneration
16 days ago
model.safetensors.index.json
76.1 kB
Upload LlavaForConditionalGeneration
16 days ago