YzZ-George
/

output-1.3b-RM_350m-nokl

YzZ-George commited on Jul 1, 2023

Commit

5c081c6

•

1 Parent(s): 9492584

Upload 5 files

Files changed (5) hide show

actor_ema/config.json ADDED Viewed

+{
+  "_name_or_path": "/home/zhaiyuanzhao/code/DeepSpeedExamples-4datasets/applications/DeepSpeed-Chat/training/step1_supervised_finetuning/output-1.3b",
+  "_remove_final_layer_norm": false,
+  "activation_dropout": 0.0,
+  "activation_function": "relu",
+  "architectures": [
+    "OPTForCausalLM"
+  ],
+  "attention_dropout": 0.0,
+  "bos_token_id": 2,
+  "do_layer_norm_before": true,
+  "dropout": 0.1,
+  "enable_bias": true,
+  "end_token_id": 2,
+  "eos_token_id": 2,
+  "ffn_dim": 8192,
+  "hidden_size": 2048,
+  "init_std": 0.02,
+  "layer_norm_elementwise_affine": true,
+  "layerdrop": 0.0,
+  "max_position_embeddings": 2048,
+  "model_type": "opt",
+  "num_attention_heads": 32,
+  "num_hidden_layers": 24,
+  "pad_token_id": 2,
+  "prefix": "</s>",
+  "torch_dtype": "float16",
+  "transformers_version": "4.30.0.dev0",
+  "use_cache": true,
+  "vocab_size": 50272,
+  "word_embed_proj_dim": 2048
+}

actor_ema/merges.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

actor_ema/pytorch_model.bin ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:963f891adc23b037faa3fdfe71fb679cc752b43b07bfb638bf98cddf333d2aec
+size 2631663773

actor_ema/vocab.json ADDED Viewed

The diff for this file is too large to render. See raw diff

training.log ADDED Viewed

The diff for this file is too large to render. See raw diff