innocent-charles commited on
Commit
5ab56ef
1 Parent(s): 9da0b83

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -81
README.md CHANGED
@@ -1,81 +0,0 @@
1
- ---
2
- language:
3
- - multilingual
4
- - en
5
- - sw
6
- - ha
7
- - yo
8
- - ig
9
- - zu
10
- - sn
11
- - ar
12
- - am
13
- - fr
14
- - pt
15
- tags:
16
- - zero-shot-image-classification
17
- - image generation
18
- - visual qa
19
- - text-image embedding
20
- - image-text embedding
21
- - pytorch
22
- - sartify
23
- - visual conversional ai
24
- - image semantic retrival
25
- - african raw resourced languages
26
- - safetensors
27
- - vision-text-dual-encoder
28
- license: apache-2.0
29
- library_name: transformers
30
- ---
31
-
32
- # AViLaMa : African Vision-Languages Aligment Pre-Training Model.
33
- Learning Visual Concepts Directly From African Languages Supervision. [Paper is coming]()
34
-
35
- ## Model Details
36
- AViLaMa is the large open-source text-vision alignment pre-training model in African languages. It brings a way to learn visual concepts directly from African languages supervision. Inspired from OpenAI CLIP, but with more based on African languages to capture the nuances, cultural context, and social aspect use of our languages that are so impossible to get just from machine translation. It includes techniques like agnostic languages encoding, data filtering network etc... All for more than 12 African languages, trained on the #AViLaDa-2B datasets of filtered image-text pairs.
37
-
38
- - **Developed by :** Sartify LLC (www.sartify.com)
39
- - **Authors :** Innocent Charles, Zephania Reuben
40
- - **Funded by :** Sartify LLC, Open Source Community, etc..(We always welcome other donors)
41
- - **Model type :** multilingual & multimodality transformer
42
- - **Language(s) :** en (English), sw (Swahili), ha (Hausa), yo (Yoruba), ig (Igbo), zu (Zulu), sn (Shona), ar (Arabic), am (Amharic), fr (French), pt (Portuguese)
43
- - **License:** apache 2.0
44
-
45
- ## Load model from hugging face.
46
- ```python
47
- import torch
48
- from transformers import AutoModel, AutoTokenizer
49
-
50
- model = AutoModel.from_pretrained("sartifyllc/AViLaMa")
51
- tokenizer = AutoTokenizer.from_pretrained("sartifyllc/AViLaMa")
52
- model = model.eval()
53
- ```
54
- ## Model Sources
55
- - **Repository :** [AViLaMa-Sources](https://github.com/Sartify/AViLaMa-Sources)
56
- - **Datasets :** Coming...
57
- - **Paper :** Coming...
58
- - **Demo :** Coming...
59
-
60
- ## Direct & Downstream Use In African Languages:
61
- 1. zero shot semantic image retrieval and ranking tasks.
62
- 4. zero shot image classification tasks.
63
- 7. visual QA tasks with African languages.
64
- 8. visual conversional GenAI tasks.
65
- 9. image capturing tasks.
66
- 10. images and art generation guiding and conditioning tasks.
67
- 11. text-images analysis tasks.
68
- 12. content moderation task etc....
69
-
70
- ## Citation
71
-
72
- **BibTeX:**
73
- ```bibtex
74
- AViLaMa paper
75
- @article{sartifyllc2023africanvision,
76
- title={AViLaMa: Learning Visual Concepts Directly From African Languages Supervision},
77
- author={Innocent Charles, Zephania Reuben},
78
- journal={To be inserted},
79
- year={2023}
80
- }
81
- ```