rAIfle
/

experiment_1_8b-fp16

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rAIfle commited on Apr 29

Commit

c49981d

•

1 Parent(s): dacc147

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

	@@ -1 +1,25 @@
1	- 1 epoch of grimulkan/LimaRP-augmented on LLaMA-8b via unsloth on colab.

+1 epoch of grimulkan/LimaRP-augmented on LLaMA-8b via unsloth on colab, using the llama-chat template.
+```trainer = SFTTrainer(
+    model = model,
+    tokenizer = tokenizer,
+    train_dataset = dataset,
+    dataset_text_field = "text",
+    max_seq_length = max_seq_length,
+    dataset_num_proc = 2,
+    packing = False, # Can make training 5x faster for short sequences.
+    args = TrainingArguments(
+        per_device_train_batch_size = 2,
+        gradient_accumulation_steps = 4,
+        warmup_steps = 5,
+        num_train_epochs=1,
+        learning_rate = 2e-4,
+        fp16 = not torch.cuda.is_bf16_supported(),
+        bf16 = torch.cuda.is_bf16_supported(),
+        logging_steps = 1,
+        optim = "adamw_8bit",
+        weight_decay = 0.01,
+        lr_scheduler_type = "linear",
+        seed = 3407,
+        output_dir = "outputs",
+    ),
+)```