iamplus
/

bloomz-7b1-v3

Text Generation

Inference Endpoints

Model card Files Files and versions Community

manojpreveen commited on Mar 21, 2023

Commit

4f1cef7

·

1 Parent(s): 56460db

Create README.md

Files changed (1) hide show

README.md +25 -0

README.md ADDED Viewed

	@@ -0,0 +1,25 @@

+---
+license: bigscience-openrail-m
+datasets:
+- manojpreveen/Instruction_Tuning
+---
+Instruction Tuned Bloomz-7B1 model on ChatGPT dataset (85k data) using ***Colossal AI***
+**Base Model:** bigscience/bloomz-7b1
+**Training Details :**
+* Epochs: 5
+* Batch Size : 32 instantaneous per device x 1 gradient accumulation steps x 8 gpus = 256
+* Max Length : 512
+* Weight Decay : 0
+* Learning Rate : 2e-5
+* Learning Rate Scheduler Type : Cosine
+* Number of warmup steps : 0
+* Machine : 8xA100 80GB
+**Dataset Details :**
+Dataset : manojpreveen/Instruction_Tuning
+Files :
+* chat_gpt_v1.csv