AITA multiclass dataset

#1
by justinxzhao - opened

Hi @MattBoraske ,

Which dataset are you using to train this model? Would you be able to kindly point me to it?

Thanks in advance!

--Justin

Hey @justinxzhao you can check it the dataset here - https://huggingface.co/datasets/MattBoraske/reddit-AITA-submissions-and-comments-multiclass. It's part of a set of four datasets actually - you can check them all out in this collection - https://huggingface.co/collections/MattBoraske/reddit-aita-finetuning-66038dc9281f16df5a9bab7f

I'm also writing a paper on the dataset and finetuning flan-t5 (encoder-decoder) and llama-2 (decoder-only) on it to evaluate the capabilities of each for interpersonal conflict resolution. Happy to share it with you when its finished!

Very cool!

Thank you for the pointers @MattBoraske . I'll check out the links and I look forward hearing about your work.

I'm working a paper myself. Any chance you'd be interested to meet virtually?

I guess there aren't any DM's in HF yet! Hopefully that doesn't come across as too forward in a public forum ๐Ÿ˜ฎ. Feel free to email me at justinxzhao@gmail.com if that's something that you'd have time for!

(At a quick glance, data looks awesome by the way. I have more questions about it, which I can share in a bit. Thank you for putting it up!)

Sign up or log in to comment