File size: 732 Bytes
6a15afe 1874081 287e410 c0fabde 287e410 3a4a78e 9af5dce 6a15afe 77c1fdd 1590d6b 77c1fdd 1874081 77c1fdd 1590d6b 77c1fdd |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 |
---
title: Shopping MMLU Leaderboard
emoji: 🌎
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: true
license: apache-2.0
tags:
- leaderboard
short_description: 'Massive Multi-Task LLM Benchmark for Online Shopping'
---
In this leaderboard, we display evaluation results obtained with Shopping MMLU. The space provides an overall leaderboard, consisting of 4 main online shopping skills:
- Shopping Concept Understanding
- Shopping Knowledge Reasoning
- User Behavior Alignment
- Multi-lingual Abilities
Github: https://github.com/KL4805/ShoppingMMLU
Report: https://arxiv.org/abs/2410.20745
Please consider to cite the report if the resource is useful to your research:
```BibTex
``` |