oguzhandoganoglu commited on
Commit
ac03e55
1 Parent(s): a92a309

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -3,16 +3,16 @@ license: llama3
3
  language:
4
  - tr
5
  ---
6
- CERE-LLMA-3-8b-TR
7
 
8
  This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
9
 
 
10
 
11
- Model Details
12
-
13
- Base Model: Llama3 8B based LLM
14
- Training Dataset: High Quality Turkish instruction sets
15
-
16
 
17
  [Open LLM Turkish Leaderboard v0.2 Evaluation Results]
18
 
 
3
  language:
4
  - tr
5
  ---
6
+ # CERE-LLMA-3-8b-TR
7
 
8
  This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
9
 
10
+ ## Model Details
11
 
12
+ - **Base Model**: LLMA 3 7B based LLM
13
+ - **Tokenizer Extension**: Specifically extended for Turkish
14
+ - **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
15
+ - **Training Method**: Initially with DORA, followed by fine-tuning with LORA
 
16
 
17
  [Open LLM Turkish Leaderboard v0.2 Evaluation Results]
18