Aaron Chibb

aari1995

AI & ML interests

Multilinguality and German LLMs

Organizations

aari1995's activity

replied to their post 2 months ago
view reply

no there will be some filtering happening, working on the algorithm currently to do so.

posted an update 2 months ago
view post
Post
2787
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)

ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual

multilingual/orca_dpo_pairs
ยท
posted an update 3 months ago
view post
Post
looking at the tokenizer and the naming (โ€œ_enโ€œ), Google Gemma is very likely to have a multilingual counterpart. ๐Ÿ‘€

Thoughts?
  • 3 replies
ยท
posted an update 4 months ago
view post
Post
@clem ist das der erste nicht Englische post auf huggingface?๐Ÿ‘‹๐Ÿฝ ๐Ÿ‡ฉ๐Ÿ‡ช๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡ฎ๐Ÿ‡น๐Ÿ‡ช๐Ÿ‡ธ๐Ÿ‡ฎ๐Ÿ‡ณโ€ฆ
  • 1 reply
ยท