no there will be some filtering happening, working on the algorithm currently to do so.
Aaron Chibb
aari1995
AI & ML interests
Multilinguality and German LLMs
Organizations
aari1995's activity
replied to
their
post
2 months ago
posted
an
update
2 months ago
Post
2787
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual
multilingual/orca_dpo_pairs
mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual
multilingual/orca_dpo_pairs