The PR makes data collection via mturk work. Here is the live code from this PR:

https://huggingface.co/spaces/Tristan/dadc

And here you can see the dataset that is generated:

https://huggingface.co/datasets/Tristan/dadc-data/blob/main/data.jsonl

Data is added to this dataset whenever a turker enters data in a preview (without an assignment ID), whenever a turker enters data in a HIT (with an assignment ID), and whenever a random person enters data in the hugging face space (without an assignment ID).

Let me know if you have time to try it out both on mturk sandbox and as a visitor to the hugging face space! Your suggestions would probably be helpful.

Tristan changed pull request status to open

Awesome thanks!

douwekiela changed pull request status to merged

Sign up or log in to comment