Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
aari1995 
posted an update 27 days ago
Post
2732
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)

ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual

multilingual/orca_dpo_pairs

Is the dataset translated by GPT or by humans (or human validated)?

·

no there will be some filtering happening, working on the algorithm currently to do so.

the German translation is fairly bad unfortunately

Hi! We in Vikhr works on same, can help with russian aligment and evaluation