Post
868
π llava-calm2-siglip
CyberAgent Inc. has announced the public release of "llava-calm2-siglip," a 7.5 billion parameter Vision Language Model (VLM) for Japanese, available for commercial use. This model, trained primarily on a high-quality Japanese dataset, is accessible on Hugging Face Hub under an Apache-2.0 license. The advancement aims to improve Japanese language-specific VLMs, which are fewer compared to English-centric models.
Model URL:
cyberagent/llava-calm2-siglip
Demo URL:
cyberagent/llava-calm2-preview
Detailed press release (in Japanese): https://www.cyberagent.co.jp/news/detail/id=30344
CyberAgent Inc. has announced the public release of "llava-calm2-siglip," a 7.5 billion parameter Vision Language Model (VLM) for Japanese, available for commercial use. This model, trained primarily on a high-quality Japanese dataset, is accessible on Hugging Face Hub under an Apache-2.0 license. The advancement aims to improve Japanese language-specific VLMs, which are fewer compared to English-centric models.
Model URL:
cyberagent/llava-calm2-siglip
Demo URL:
cyberagent/llava-calm2-preview
Detailed press release (in Japanese): https://www.cyberagent.co.jp/news/detail/id=30344