oguzhandoganoglu
commited on
Commit
•
ac03e55
1
Parent(s):
a92a309
Update README.md
Browse files
README.md
CHANGED
@@ -3,16 +3,16 @@ license: llama3
|
|
3 |
language:
|
4 |
- tr
|
5 |
---
|
6 |
-
CERE-LLMA-3-8b-TR
|
7 |
|
8 |
This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
|
9 |
|
|
|
10 |
|
11 |
-
Model
|
12 |
-
|
13 |
-
|
14 |
-
Training
|
15 |
-
|
16 |
|
17 |
[Open LLM Turkish Leaderboard v0.2 Evaluation Results]
|
18 |
|
|
|
3 |
language:
|
4 |
- tr
|
5 |
---
|
6 |
+
# CERE-LLMA-3-8b-TR
|
7 |
|
8 |
This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
|
9 |
|
10 |
+
## Model Details
|
11 |
|
12 |
+
- **Base Model**: LLMA 3 7B based LLM
|
13 |
+
- **Tokenizer Extension**: Specifically extended for Turkish
|
14 |
+
- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
|
15 |
+
- **Training Method**: Initially with DORA, followed by fine-tuning with LORA
|
|
|
16 |
|
17 |
[Open LLM Turkish Leaderboard v0.2 Evaluation Results]
|
18 |
|