Edit model card

wangchanberta-base-wiki-20210520-spm-finetune-qa

Finetuning airesearchth/wangchanberta-base-wiki-20210520-spmd with the training set of iapp_wiki_qa_squad, thaiqa_squad, and nsc_qa (removed examples which have cosine similarity with validation and test examples over 0.8; contexts of the latter two are trimmed to be around 300 newmm words). Benchmarks shared on wandb using validation and test sets of iapp_wiki_qa_squad. Trained with thai2transformers.

Run with:

export MODEL_NAME=airesearchth/wangchanberta-base-wiki-20210520-news-spm
CUDA_LAUNCH_BLOCKING=1 python train_question_answering_lm_finetuning.py \\n  --model_name $MODEL_NAME \\n  --dataset_name chimera_qa \\n  --output_dir $MODEL_NAME-finetune-chimera_qa-model \\n  --log_dir $MODEL_NAME-finetune-chimera_qa-log \\n  --model_max_length 400 \\n  --pad_on_right \\n  --fp16
Downloads last month
96
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.