Token Classification
GLiNER
PyTorch
English

Change license to non commercial

#3

The dataset you have trained your model on has a non commercial license.

Owner

Thanks for raising this issues. I am working on training GLiNER on permissible dataset (https://huggingface.co/datasets/numind/NuNER)

@urchade That sounds great! I think you should still change the license on the models trained on the current dataset though.

EDIT: I just had a look at NuNER, and it’s generated by GPT-3.5, which also means that it can’t be used commercially. So it’s the same conclusion in that case. They mention it in their paper: https://arxiv.org/pdf/2402.15343.pdf

Hi @saattrupdan . Is datasets generated by GPT-3.5 automatically non-commercial ? For instance, https://huggingface.co/microsoft/phi-2 is a trained using GPT3.5/GPT-4 output but has an mit licence

@urchade Yep, as otherwise it would be a direct competitor to OpenAI, which is exactly what is forbidden. And you’re right that many datasets and models have wrong licenses here on HF. It doesn’t help that the default license when you create new datasets and models here on the hub is fully commercially viable, so people choose them without thinking about it, unfortunately!

urchade changed pull request status to closed

Sign up or log in to comment