esfrankel17
/

llama3_8b_baseline_instructskillmix

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

esfrankel17 commited on Oct 31, 2024

Commit

993a4ae

·

verified ·

1 Parent(s): c2a1984

Model save

Files changed (1) hide show

README.md +6 -6

README.md CHANGED Viewed

@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 # llama3_8b_baseline_instructskillmix
-This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on the PrincetonPLI/Instruct-SkillMix-SDD dataset.
 It achieves the following results on the evaluation set:
-- Loss: nan
 ## Model description
@@ -50,19 +50,19 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.1
 - lr_scheduler_warmup_steps: 1738
-- num_epochs: 3.0
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.5333 | 1    | nan             |
-| No log        | 1.6    | 3    | nan             |
 ### Framework versions
 - Transformers 4.45.2
-- Pytorch 2.5.0+cu124
 - Datasets 2.21.0
 - Tokenizers 0.20.1

 # llama3_8b_baseline_instructskillmix
+This model is a fine-tuned version of [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.7085
 ## Model description
 - lr_scheduler_type: constant
 - lr_scheduler_warmup_ratio: 0.1
 - lr_scheduler_warmup_steps: 1738
+- num_epochs: 3
 ### Training results
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.5333 | 1    | 1.8346          |
+| No log        | 1.6    | 3    | 1.7085          |
 ### Framework versions
 - Transformers 4.45.2
+- Pytorch 2.4.0+cu121
 - Datasets 2.21.0
 - Tokenizers 0.20.1