Aaron Chibb

aari1995

AI & ML interests

Multilinguality and German LLMs

Organizations

aari1995's activity

replied to their post 27 days ago
view reply

no there will be some filtering happening, working on the algorithm currently to do so.

posted an update 27 days ago
view post
Post
2732
ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

mLLM - first release:
orca_dpo_pairs by Intel (translated into 7 languages)

ARABIC CHINESE FRENCH GERMAN RUSSIAN SPANISH TURKISH

Upcoming:
- more datasets
- cleaning steps
- a blogpost
- stay updated at https://hf.co/multilingual

multilingual/orca_dpo_pairs
ยท
posted an update 2 months ago
view post
Post
looking at the tokenizer and the naming (โ€œ_enโ€œ), Google Gemma is very likely to have a multilingual counterpart. ๐Ÿ‘€

Thoughts?
  • 3 replies
ยท
posted an update 3 months ago
view post
Post
@clem ist das der erste nicht Englische post auf huggingface?๐Ÿ‘‹๐Ÿฝ ๐Ÿ‡ฉ๐Ÿ‡ช๐Ÿ‡ซ๐Ÿ‡ท๐Ÿ‡ฎ๐Ÿ‡น๐Ÿ‡ช๐Ÿ‡ธ๐Ÿ‡ฎ๐Ÿ‡ณโ€ฆ
  • 1 reply
ยท