Text Generation
Transformers
PyTorch
English
opt
deepspeed
chatgpt
sft
Inference Endpoints
text-generation-inference
Adam commited on
Commit
8322d15
1 Parent(s): 13f157d

feat: updating model

Browse files
Files changed (3) hide show
  1. config.json +1 -1
  2. pytorch_model.bin +1 -1
  3. training.log +0 -0
config.json CHANGED
@@ -9,7 +9,7 @@
9
  "attention_dropout": 0.0,
10
  "bos_token_id": 2,
11
  "do_layer_norm_before": true,
12
- "dropout": 0.0,
13
  "enable_bias": true,
14
  "end_token_id": 2,
15
  "eos_token_id": 2,
 
9
  "attention_dropout": 0.0,
10
  "bos_token_id": 2,
11
  "do_layer_norm_before": true,
12
+ "dropout": 0.1,
13
  "enable_bias": true,
14
  "end_token_id": 2,
15
  "eos_token_id": 2,
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:eabca84fb352bba1f3f26c7f7990350cf1e1da46dfee7ec8b4ff64be08949957
3
  size 2631585575
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d741ef95b3410f0046271703547f328272d26f1c89d62a0ac718e825246e669a
3
  size 2631585575
training.log CHANGED
The diff for this file is too large to render. See raw diff