alistairewj commited on
Commit
4821da3
1 Parent(s): d673b88

add model_max_length

Browse files

model_max_length is present in the BERT base tokenizer config, but not here, and it should be

without this, the model_max_length for the tokenizer defaults to a very large int, and inference crashes if not handled properly

Files changed (1) hide show
  1. tokenizer_config.json +2 -1
tokenizer_config.json CHANGED
@@ -1,3 +1,4 @@
1
  {
2
- "do_lower_case": true
 
3
  }
 
1
  {
2
+ "do_lower_case": true,
3
+ "model_max_length": 512
4
  }