codeparrot-ds-500sample-gpt-neo-2ep / tokenizer_config.json
Pavithra's picture
Training in progress, step 1000
99bce8d
raw
history blame
276 Bytes
{"unk_token": "<|endoftext|>", "bos_token": "<|endoftext|>", "eos_token": "<|endoftext|>", "add_prefix_space": false, "model_max_length": 1024, "special_tokens_map_file": null, "name_or_path": "huggingface-course/code-search-net-tokenizer", "tokenizer_class": "GPT2Tokenizer"}