Datasets used for the OLMo experiments in the "Not All Data are Unlearned Equally" paper https://arxiv.org/abs/2504.05058

McGill NLP Group
university
AI & ML interests
computational linguistics, natural language processing
Recent Activity
Collections
12
-
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
Paper • 2504.08942 • Published • 25 -
McGill-NLP/agent-reward-bench
Viewer • Updated • 1.41k • 1.21k • 2 -
3
Agent Reward Bench Demo
💻Visualize agent interactions with WebArena tasks
-
Agent Reward Bench Leaderboard
🥇Leaderboard for AgentRewardBench
spaces
5
pinned
Running
15
WebLINX Explorer
😻
Browse and visualize web demonstration recordings
Running
Agent Reward Bench Leaderboard
🥇
Leaderboard for AgentRewardBench
Running
3
Agent Reward Bench Demo
💻
Visualize agent interactions with WebArena tasks
Running
2
Safearena Leaderboard
🏃
SafeArena Leaderboard
Runtime error
5
AURORA
🌖
models
58

McGill-NLP/nano-aha-moment-3b
Text Generation
•
Updated
•
100
•
2

McGill-NLP/AURORA
Image-to-Image
•
Updated
•
66
•
4

McGill-NLP/pix2act-large-weblinx
Text Generation
•
Updated
•
20
•
1

McGill-NLP/LLM2Vec-Meta-Llama-31-8B-Instruct-mntp
Sentence Similarity
•
Updated
•
257
•
2

McGill-NLP/LLM2Vec-Meta-Llama-31-8B-Instruct-mntp-supervised
Sentence Similarity
•
Updated
•
159
•
4

McGill-NLP/LLM2Vec-Meta-Llama-31-8B-Instruct-mntp-unsup-simcse
Sentence Similarity
•
Updated
•
154
•
2

McGill-NLP/LLM2Vec-Mistral-7B-Instruct-v2-mntp
Sentence Similarity
•
Updated
•
2.8k
•
10

McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp
Sentence Similarity
•
Updated
•
78

McGill-NLP/LLM2Vec-Sheared-LLaMA-mntp
Sentence Similarity
•
Updated
•
2.54k
•
5

McGill-NLP/LLM2Vec-Meta-Llama-3-8B-Instruct-mntp
Sentence Similarity
•
Updated
•
8.18k
•
16
datasets
26
McGill-NLP/zsre_qa
Viewer
•
Updated
•
1.52k
•
30
McGill-NLP/book_author_qa
Viewer
•
Updated
•
1.48k
•
28
McGill-NLP/country_capital_qa
Viewer
•
Updated
•
1.39k
•
31
McGill-NLP/agent-reward-bench
Viewer
•
Updated
•
1.41k
•
1.21k
•
2
McGill-NLP/MultiDigit-20
Viewer
•
Updated
•
16k
•
63
McGill-NLP/AdvBench-IR
Viewer
•
Updated
•
520
•
110
•
3
McGill-NLP/safearena
Updated
•
43
•
2
McGill-NLP/WebLINX-full
Updated
•
102k
•
6
McGill-NLP/CHASE-Code
Viewer
•
Updated
•
500
•
94
McGill-NLP/CHASE-Math
Viewer
•
Updated
•
500
•
87