Accuracy

#1
by ayazdan - opened

I got the following accuracy 88.09 / 84.83 (F1/Exact) with this checkpoint. The reported accuracy in the original repo is 89.8/86.9.

I used the following command to measure accuracy.

python3 transformer-sparsity/examples/pytorch/question-answering/run_qa.py \
                        --model_name_or_path ${model} \
                        --dataset_name squad_v2 \
                        --do_eval \
                        --per_device_eval_batch_size 8 \
                        --max_seq_length 512 \
                        --doc_stride 128 \
                        --fp16 \
                        --fp16_opt_level O1 \
                        --version_2_with_negative \
                        --overwrite_output_dir \
                        --output_dir ${ckpt_path} 2>&1 | tee ~/${ckpt_path}/eval_run_$(date +"%Y_%m_%d_%I_%M_%p").log

Sign up or log in to comment