helixx999
/

gemma-2-9b-bnb-absa_v2

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

helixx999 commited on Aug 23, 2024

Commit

8783b54

·

verified ·

1 Parent(s): faef20e

Update README.md

Files changed (1) hide show

README.md +3 -12

README.md CHANGED Viewed

@@ -19,19 +19,11 @@ tags:
 This is gemma2 trained on semeval restaurant data 2014 using unsloth framework.
-`trainer = SFTTrainer(
-    model = model,
-    tokenizer = tokenizer,
-    train_dataset = dataset,
-    dataset_text_field = "text_new",
-    max_seq_length = max_seq_length,
-    dataset_num_proc = 2,
-    packing = False, # Can make training 5x faster for short sequences.
-    args = TrainingArguments(
         per_device_train_batch_size = 2,
         gradient_accumulation_steps = 4,
         warmup_steps = 6, #Previous 5
-        #num_train_epochs = 1, # Set this for 1 full training run.
         max_steps = 60,
         #learning_rate = 2e-4,
         learning_rate = 1e-4,
@@ -44,7 +36,6 @@ This is gemma2 trained on semeval restaurant data 2014 using unsloth framework.
         seed = 3407,
         output_dir = "./tensorLog",
         report_to="wandb"
-    ),
-)`
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 This is gemma2 trained on semeval restaurant data 2014 using unsloth framework.
+Training Parameters:
         per_device_train_batch_size = 2,
         gradient_accumulation_steps = 4,
         warmup_steps = 6, #Previous 5
         max_steps = 60,
         #learning_rate = 2e-4,
         learning_rate = 1e-4,
         seed = 3407,
         output_dir = "./tensorLog",
         report_to="wandb"
 [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)