A newer version of the Gradio SDK is available:
5.6.0
metadata
title: Shopping MMLU Leaderboard
emoji: 🌎
colorFrom: blue
colorTo: green
sdk: gradio
sdk_version: 4.44.1
app_file: app.py
pinned: true
license: apache-2.0
tags:
- leaderboard
short_description: Massive Multi-Task LLM Benchmark for Online Shopping
In this leaderboard, we display evaluation results obtained with Shopping MMLU. The space provides an overall leaderboard, consisting of 4 main online shopping skills:
- Shopping Concept Understanding
- Shopping Knowledge Reasoning
- User Behavior Alignment
- Multi-lingual Abilities
Github: https://github.com/KL4805/ShoppingMMLU Report: https://arxiv.org/abs/2410.20745
Please consider to cite the report if the resource is useful to your research: