Gaja-v2.00-dpo / README.md
damerajee's picture
Update README.md
00fc1c1 verified
metadata
language:
  - en
  - hi
license: llama2
library_name: transformers
tags:
  - hindi
  - 'english '
  - Bilingual
datasets:
  - sarvamai/samvaad-hi-v1
  - aaditya/orca_dpo_pairs-Hindi_
pipeline_tag: text-generation

Model

🐘 Gaja

Gaja is a Hindi/Hinglish chat model, initially trained on SarvamAI's OpenHathi model and further fine-tuned for conversational interactions. Image

Additional Information

  • It outperforms Airavata, AI4Bharat's chat version, on Huggingface OpenLLM benchmark suite.
  • It was fine-tuned on only 1k samples

πŸ’¬ Prompt template

<|im_start|>user
{}<|im_end|> 
<|im_start|>assistant
{}<|im_end|> 

😎 Features:

  • Language Support: Gaja is designed to understand and generate responses in both Hindi and Hinglish, catering to a diverse range of users.
  • Base Model: Built upon SarvamAI's OpenHathi model, Gaja inherits its foundational capabilities while being optimized for conversational tasks.
  • Fine-tuning: Gaja has undergone fine-tuning specifically for chat-based interactions, enhancing its ability to engage in meaningful conversations with users.
  • Experimental Platform: With its flexibility and adaptability, Gaja serves as a valuable platform for conducting experiments and exploring innovative approaches to chatbot development.