File size: 732 Bytes
6a15afe
1874081
287e410
 
 
 
c0fabde
287e410
 
 
 
3a4a78e
9af5dce
6a15afe
 
77c1fdd
1590d6b
 
 
 
 
77c1fdd
1874081
 
77c1fdd
 
 
 
1590d6b
77c1fdd
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
title: Shopping MMLU Leaderboard
emoji: 🌎
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: true
license: apache-2.0
tags:
- leaderboard
short_description: 'Massive Multi-Task LLM Benchmark for Online Shopping'
---


In this leaderboard, we display evaluation results obtained with Shopping MMLU. The space provides an overall leaderboard, consisting of 4 main online shopping skills:
- Shopping Concept Understanding
- Shopping Knowledge Reasoning
- User Behavior Alignment
- Multi-lingual Abilities  

Github: https://github.com/KL4805/ShoppingMMLU
Report: https://arxiv.org/abs/2410.20745

Please consider to cite the report if the resource is useful to your research:

```BibTex

```