A new synthetic preference dataset built using distilabel on top of the awesome LDJnr/Capybara from @LDJnr
The current dataset combines the already generated alternative completions from argilla/distilabel-capybara-dpo-7k-binarized, while also adding the remaining ones using the same approach!
Here are some key features on how we built it:
- 🧹 Duplicate removal, keeping the conversation besides the last assistant response, and some slight pre-processing