pranavajay
commited on
Commit
•
056adc2
1
Parent(s):
1588b29
Update README.md
Browse files
README.md
CHANGED
@@ -1,39 +1,68 @@
|
|
1 |
---
|
2 |
-
|
3 |
-
|
4 |
-
|
|
|
5 |
tags:
|
6 |
-
-
|
7 |
-
|
8 |
-
|
|
|
9 |
---
|
10 |
-
# merge
|
11 |
|
12 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
|
14 |
-
|
15 |
-
|
|
|
|
|
|
|
16 |
|
17 |
-
|
18 |
|
19 |
-
|
20 |
|
21 |
-
|
22 |
-
|
23 |
|
24 |
-
|
|
|
|
|
25 |
|
26 |
-
|
|
|
|
|
|
|
27 |
|
28 |
-
|
29 |
-
|
30 |
-
|
31 |
-
|
32 |
-
|
33 |
-
|
34 |
-
|
35 |
-
|
36 |
-
|
37 |
-
dtype: float16
|
38 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
39 |
```
|
|
|
|
|
|
1 |
---
|
2 |
+
license: apache-2.0
|
3 |
+
pipeline_tag: text-generation
|
4 |
+
language:
|
5 |
+
- hi
|
6 |
tags:
|
7 |
+
- pretrained
|
8 |
+
inference:
|
9 |
+
parameters:
|
10 |
+
temperature: 0.7
|
11 |
---
|
|
|
12 |
|
13 |
+
# pranavajay/hindi-8b
|
14 |
+
|
15 |
+
## Overview
|
16 |
+
|
17 |
+
🌟 Introducing the "pranavajay/hindi-8b" model, a text generator language model specifically fine-tuned for Hindi language conversational tasks. This model is designed to engage in fluent and contextually relevant conversations, making it an ideal tool for chatbots, language translation, and more.
|
18 |
+
|
19 |
+
## Model Details
|
20 |
|
21 |
+
- **Base Model**: A robust pre-trained language model, adapted for Hindi.
|
22 |
+
- **Fine-Tuning**: The model has been further fine-tuned on a diverse dataset of Hindi conversations to enhance its conversational capabilities.
|
23 |
+
- **Language Support**: Primarily Hindi, with the ability to understand and respond in contextually appropriate English when necessary.
|
24 |
+
- **Model Size**: 10.2B parameters for rich and nuanced responses.
|
25 |
+
- **Usage**: Ideal for building interactive AI applications that require natural language understanding and generation in Hindi.
|
26 |
|
27 |
+
## Quick Start
|
28 |
|
29 |
+
To use the "pranavajay/hindi-8b" model with the Hugging Face Transformers library, follow these steps:
|
30 |
|
31 |
+
```python
|
32 |
+
from transformers import AutoTokenizer, AutoModelForCausalLM
|
33 |
|
34 |
+
# Load the tokenizer and model
|
35 |
+
tokenizer = AutoTokenizer.from_pretrained("pranavajay/hindi-8b")
|
36 |
+
model = AutoModelForCausalLM.from_pretrained("pranavajay/hindi-8b")
|
37 |
|
38 |
+
# Example conversation
|
39 |
+
input_text = "हमारे देश का नवाब कौन है?"
|
40 |
+
input_ids = tokenizer.encode(input_text, return_tensors="pt")
|
41 |
+
output = model.generate(input_ids, max_length=50, num_return_sequences=1)
|
42 |
|
43 |
+
# Decode the response
|
44 |
+
response = tokenizer.decode(output[0], skip_special_tokens=True)
|
45 |
+
print(response)
|
46 |
+
```
|
47 |
+
Training
|
48 |
+
The "pranavajay/hindi-8b" model was trained using a combination of supervised learning and reinforcement learning techniques. The training dataset consisted of a wide range of Hindi conversations, including dialogues from various domains such as customer service, education, and entertainment.
|
49 |
+
|
50 |
+
License
|
51 |
+
The "pranavajay/hindi-8b" model is released under the Apache License 2.0. Please review the license for details regarding the use, modification, and distribution of the model.
|
|
|
52 |
|
53 |
+
Citation
|
54 |
+
If you use the "pranavajay/hindi-8b" model in your research or applications, please cite it as follows:
|
55 |
+
|
56 |
+
bibtex
|
57 |
+
```
|
58 |
+
@misc{hindi-8b,
|
59 |
+
author = {Pranav Ajay},
|
60 |
+
title = {Hindi Chat 8B Model},
|
61 |
+
year = {2024},
|
62 |
+
publisher = {Hugging Face},
|
63 |
+
journal = {Hugging Face Model Repository},
|
64 |
+
howpublished = {\url{https://huggingface.co/pranavajay/hindi-chat-8b}}
|
65 |
+
}
|
66 |
```
|
67 |
+
Feedback and Support
|
68 |
+
For any questions, feedback, or support regarding the "pranavajay/hindi-chat-8b" model, please contact us at pranavajay74@gmail.com
|