google/flan-t5-xxl · Flan-T5 tokenizer supports neither Chinese nor many code-related tokens despite being advertised as such

@michaelroyzen Flan-T5 uses the T5 tokenizer, which is English-only and not well suited to coding tasks. We do include multilingual and coding tasks in the Flan Collection, which plays well with multilingual models and appropriate tokenizers, but of course may not overcome the limitations of the T5 tokenizer. With limited experiments we did not see any evidence that including coding or multilingual tasks hurt Flan-T5 for English held-in and held-out evaluation tasks (not including program synthesis eval), but may even have helped by adding task diversity.

If you'd like to do multilingual or coding tasks we do recommend applying FLAN tuning on top of a more appropriate model/tokenizer, and even up-sample the data sources you're targeting.