CerebrumTech
/

cere-llama-3-8b-tr

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

oguzhandoganoglu commited on Jun 12

Commit

ac03e55

•

1 Parent(s): a92a309

Update README.md

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -3,16 +3,16 @@ license: llama3
 language:
 - tr
 ---
-CERE-LLMA-3-8b-TR
 This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
-Model Details
-Base Model: Llama3 8B based LLM
-Training Dataset: High Quality Turkish instruction sets
 [Open LLM Turkish Leaderboard v0.2 Evaluation Results]

 language:
 - tr
 ---
+# CERE-LLMA-3-8b-TR
 This model is an fine-tuned version of a Llama3 8b Large Language Model (LLM) for Turkish. It was trained on a high quality Turkish instruction sets created from various open-source and internal resources. Turkish Instruction dataset carefully annotated to carry out Turkish instructions in an accurate and organized manner.
+## Model Details
+- **Base Model**: LLMA 3 7B based LLM
+- **Tokenizer Extension**: Specifically extended for Turkish
+- **Training Dataset**: Cleaned Turkish raw data with 5 billion tokens, custom Turkish instruction sets
+- **Training Method**: Initially with DORA, followed by fine-tuning with LORA
 [Open LLM Turkish Leaderboard v0.2 Evaluation Results]