add model_max_length

#3

without specifying the model_max_length for the tokenizer defaults to a very large int, and inference crashes

I believe this may be due to a recent change to transformers - bert-large-uncased config was updated 2 months ago: https://huggingface.co/google-bert/bert-large-uncased/commit/6da4b6a26a1877e173fca3225479512db81a5e5b

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment