arxyzan commited on
Commit
975e75b
1 Parent(s): 559b014

Hezar: Upload model and config

Browse files
Files changed (1) hide show
  1. model_config.yaml +17 -0
model_config.yaml ADDED
@@ -0,0 +1,17 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ name: distilbert_lm
2
+ config_type: model
3
+ task: language_modeling
4
+ activation: gelu
5
+ attention_dropout: 0.1
6
+ dim: 768
7
+ dropout: 0.1
8
+ hidden_dim: 3072
9
+ initializer_range: 0.02
10
+ max_position_embeddings: 512
11
+ n_heads: 12
12
+ n_layers: 6
13
+ output_past: true
14
+ pad_token_id: 0
15
+ qa_dropout: 0.1
16
+ tie_weights_: true
17
+ vocab_size: 42000