AbrutiNoir's picture

1 1

AbrutiNoir

Abru

·

AI & ML interests

None yet

Recent Activity

reacted to ZennyKenny's post with 🔥 30 days ago

Besides being the coolest named benchmark in the game, HellaSwag is an important measurement of здравый смысль (or common sense) in LLMs. - More on HellaSwag: https://github.com/rowanz/hellaswag I spent the afternoon benchmarking YandexGPT Pro 4th Gen, one of the Russian tech giant's premier models. - Yandex HF Org: https://huggingface.co/yandex - More on Yandex models: https://yandex.cloud/ru/docs/foundation-models/concepts/yandexgpt/models The eval notebook is available on GitHub and the resulting dataset is already on the HF Hub! - Eval Notebook: https://github.com/kghamilton89/ai-explorer/blob/main/yandex-hellaswag/hellaswag-assess.ipynb - Eval Dataset: https://huggingface.co/datasets/ZennyKenny/yandexgptpro_4th_gen-hellaswag And of course, everyone wants to see the results so have a look at the results in the context of other zero-shot experiments that I was able to find!

liked a Space 10 months ago

archit11/gemma-10m

upvoted a paper 10 months ago

Mistral 7B

View all activity

Organizations

None yet

Abru's activity

reacted to ZennyKenny's post with 🔥 30 days ago

Post

1937

Besides being the coolest named benchmark in the game, HellaSwag is an important measurement of здравый смысль (or common sense) in LLMs.

- More on HellaSwag: https://github.com/rowanz/hellaswag

I spent the afternoon benchmarking YandexGPT Pro 4th Gen, one of the Russian tech giant's premier models.

- Yandex HF Org:

yandex
- More on Yandex models: https://yandex.cloud/ru/docs/foundation-models/concepts/yandexgpt/models

The eval notebook is available on GitHub and the resulting dataset is already on the HF Hub!

- Eval Notebook: https://github.com/kghamilton89/ai-explorer/blob/main/yandex-hellaswag/hellaswag-assess.ipynb
- Eval Dataset: ZennyKenny/yandexgptpro_4th_gen-hellaswag

And of course, everyone wants to see the results so have a look at the results in the context of other zero-shot experiments that I was able to find!

2 replies

·

liked a Space 10 months ago

Gemma 10m

Genenerate detailed responses to text prompts

upvoted a paper 10 months ago

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 48

reacted to turiabu's post with 🤗 11 months ago

Post

2208

Can anyone see my post on🤗?
Reply with 🤗

4 replies

·

updated a Space over 1 year ago

Mistralai Mixtral 8x7B V0.1