Add evaluation results on conll2003 dataset
Beep boop, I am a bot from Hugging Face's automatic model evaluator π!
Your model has been evaluated on the conll2003 dataset by
@douwekiela
, using the predictions stored here.
Accept this pull request to see the results displayed on the Hub leaderboard.
Evaluate your model on more datasets here.
@douwekiela
@lewtun
. QQ: Why is the f1
so much higher than on when i evaluated?
Good question :) This job was run on the validation
split, so maybe the results you have were reported on the test
split instead?
on the model card there are results for validation
and test
could you please share the evaluation function you used @philschmid ? :)
thats the script i used for training evaluation: https://github.com/philschmid/distilroberta-token-classification/blob/master/src/training/train.py
π