gradio tqdm tiktoken transformers torch numpy datasets