wbxlala
/

gptmedium

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

gptmedium / tokenizer_config.json

wbxlala's picture

Training in progress, step 500

3b070f5 10 months ago

234 Bytes

	{
	"add_prefix_space": false,
	"bos_token": "<\|endoftext\|>",
	"clean_up_tokenization_spaces": true,
	"eos_token": "<\|endoftext\|>",
	"model_max_length": 1024,
	"tokenizer_class": "GPT2Tokenizer",
	"unk_token": "<\|endoftext\|>"
	}