autoevaluate/model-evaluator · How does the space know whether a model is fine-tuned or not?

Hey @patrickvonplaten , the list of compatible models is determined by two criteria:

Whether the pipeline_tag in the model card matches the selected task
Whether the selected dataset belongs to one of the datasets listed on the model card

So yes, you won't find fill-mask models in the list right now as we don't support this (yet) in the backend - do you see a good use case for evaluating pretrained models?

For references, here's the filter I apply on the models: https://huggingface.co/spaces/autoevaluate/model-evaluator/blob/main/utils.py#L89