sbrzz
/

TinyLLaVA-OpenELM-270M-Instruct-Dinov2-small

Image-Text-to-Text

Model card Files Files and versions Community

sbrzz commited on Aug 5

Commit

efa5237

•

1 Parent(s): 231ae81

Update README.md

Files changed (1) hide show

README.md +7 -1

README.md CHANGED Viewed

@@ -5,4 +5,10 @@ language:
 metrics:
 - accuracy
 pipeline_tag: image-text-to-text
----

 metrics:
 - accuracy
 pipeline_tag: image-text-to-text
+---
+We use the powerfull [TinyLLaVA Factory](https://github.com/TinyLLaVA/TinyLLaVA_Factory) to create a super small image-text-to-text model with only 296M params.
+The goal is to make it possible to run LLaVA models on edge devices (with few gigabytes of memory).
+For LLM and vision tower, we choose [OpenELM-270M-Instruct](apple/OpenELM-270M-Instruct) and [facebook/dinov2-small](facebook/dinov2-small), respectively.