phanerozoic
/

Llama3-Pirate-Talk-8b-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

phanerozoic commited on Apr 20

Commit

6bbf3e4

•

1 Parent(s): b74a303

Update README.md

Files changed (1) hide show

README.md +15 -17

README.md CHANGED Viewed

@@ -49,23 +49,21 @@ To enhance output quality and thematic consistency, custom stopping strings incl
 - "\n"
 ## Training Hyperparameters and Fine-Tuning Details:
-\`\`\`yaml
-micro_batch_size: 1
-batch_size: 0
-epochs: 1
-learning_rate: "2e-5"
-lr_scheduler_type: "linear"
-lora_rank: 8
-lora_alpha: 16
-lora_dropout: 0.05
-cutoff_len: 256
-warmup_steps: 8
-optimizer: "adamw_torch"
-grad_accumulation: 1
-train_runtime: 1697.081 seconds
-total_flos: 1.3663655883177984e+16
-train_loss: 1.7511341453808817
-\`\`\`
 ## Testing and Evaluation:
 During the testing phase, we conducted a series of evaluations to compare Llama3-Pirate-Talk-8b-v0.1 against the base Llama3 model. These tests involved complex navigational and general knowledge questions designed to assess the model's ability to maintain its thematic integrity while responding accurately to technically demanding prompts. The model demonstrated a strong thematic presence with consistent use of pirate vernacular. However, it showed limitations in handling high-precision technical content, which is an expected trade-off given its thematic specialization. These insights have been instrumental in identifying areas for further model refinement.

 - "\n"
 ## Training Hyperparameters and Fine-Tuning Details:
+- micro_batch_size: 1
+- batch_size: 0
+- epochs: 1
+- learning_rate: "2e-5"
+- lr_scheduler_type: "linear"
+- lora_rank: 8
+- lora_alpha: 16
+- lora_dropout: 0.05
+- cutoff_len: 256
+- warmup_steps: 8
+- optimizer: "adamw_torch"
+- grad_accumulation: 1
+- train_runtime: 1697.081 seconds
+- total_flos: 1.3663655883177984e+16
+- train_loss: 1.7511341453808817
 ## Testing and Evaluation:
 During the testing phase, we conducted a series of evaluations to compare Llama3-Pirate-Talk-8b-v0.1 against the base Llama3 model. These tests involved complex navigational and general knowledge questions designed to assess the model's ability to maintain its thematic integrity while responding accurately to technically demanding prompts. The model demonstrated a strong thematic presence with consistent use of pirate vernacular. However, it showed limitations in handling high-precision technical content, which is an expected trade-off given its thematic specialization. These insights have been instrumental in identifying areas for further model refinement.