no there will be some filtering happening, working on the algorithm currently to do so.
Aaron Chibb
aari1995
AI & ML interests
Multilinguality and German LLMs
Organizations
aari1995's activity
replied to
their
post
27 days ago
posted
an
update
27 days ago
Post
2732
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual
multilingual/orca_dpo_pairs
mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH
Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual
multilingual/orca_dpo_pairs