File size: 1,423 Bytes
d296320
75c50ec
 
 
 
d296320
75c50ec
 
 
 
 
 
1ff094a
75c50ec
d296320
00fc1c1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
---
language:
- en
- hi
license: llama2
library_name: transformers
tags:
- hindi
- 'english '
- Bilingual
datasets:
- sarvamai/samvaad-hi-v1
- aaditya/orca_dpo_pairs-Hindi_
pipeline_tag: text-generation
---
# Model 

# 🐘 Gaja
Gaja is a Hindi/Hinglish chat model, initially trained on SarvamAI's OpenHathi model and further fine-tuned for conversational interactions. 
<img src="https://cdn-uploads.huggingface.co/production/uploads/6487239cca30096ea9f52115/sLgskU1h2Tih_bJo0ktYR.jpeg" width="600" alt="Image">

# Additional Information
- It outperforms Airavata, AI4Bharat's chat version, on Huggingface OpenLLM benchmark suite.
- It was fine-tuned on only 1k samples 


# 💬 Prompt template 

```
<|im_start|>user
{}<|im_end|> 
<|im_start|>assistant
{}<|im_end|> 
```

# 😎 Features:
- Language Support: Gaja is designed to understand and generate responses in both Hindi and Hinglish, catering to a diverse range of users.
- Base Model: Built upon SarvamAI's OpenHathi model, Gaja inherits its foundational capabilities while being optimized for conversational tasks.
- Fine-tuning: Gaja has undergone fine-tuning specifically for chat-based interactions, enhancing its ability to engage in meaningful conversations with users.
- Experimental Platform: With its flexibility and adaptability, Gaja serves as a valuable platform for conducting experiments and exploring innovative approaches to chatbot development.