---
title: Spanish LLM Benchmark Annotation with Argilla
pin: true
emoji: 🏷️
colorFrom: purple
colorTo: red
sdk: docker
app_port: 6900
fullWidth: true
tags:
- argilla
short_description: Collaborative effort on Spanish ARC-C, HellaSwag, and MMLU
---

## Español

Este Space de Argilla se ha creado para realizar un esfuerzo colaborativo para anotar o mejorar algunos conjuntos de datos sintéticamente traducidos al Español, utilizados para evaluar LLMs (HellaSwag, ARC-C y MMLU, creados originalmente en Inglés).

Los conjuntos de datos utilizados surgen de la combinación de los originales en Inglés, con sus respectivas alternativas traducidas al Español por medio de ChatGPT, gracias al esfuerzo del grupo de Procesamiento Natural de la Universidad de Oregón.

Para obtener más información sobre los conjuntos de datos, consultad las guías de anotación para cada uno de ellos en Argilla.

## English

This Argilla Space has been created to start a collaborative effort to annotate / curate some existing synthetically generated Spanish translations from existing LLM benchmarks (HellaSwag, ARC-C, and MMLU) orginally written in English.

The datasets used are ports from a combination of the original datasets (English ones) with their counterparts translated to Spanish using ChatGPT by the Natural Language Processing group of the University of Oregon.

Read more about the datasets within their respective guidelines.