Spaces:
Runtime error
Runtime error
title: Shopping MMLU Leaderboard | |
emoji: 🌎 | |
colorFrom: blue | |
colorTo: green | |
sdk: gradio | |
sdk_version: 4.44.1 | |
app_file: app.py | |
pinned: true | |
license: apache-2.0 | |
tags: | |
- leaderboard | |
short_description: 'Massive Multi-Task LLM Benchmark for Online Shopping' | |
In this leaderboard, we display evaluation results obtained with Shopping MMLU. The space provides an overall leaderboard, consisting of 4 main online shopping skills: | |
- Shopping Concept Understanding | |
- Shopping Knowledge Reasoning | |
- User Behavior Alignment | |
- Multi-lingual Abilities | |
Github: https://github.com/KL4805/ShoppingMMLU | |
Report: https://arxiv.org/abs/2410.20745 | |
Please consider to cite the report if the resource is useful to your research: | |
```BibTex | |
``` |