Each dataset is split into easy, medium and a difficult split using the familiarity metric. Please see our paper for details.
Jonas Golde
whoisjones
AI & ML interests
Data-efficient transfer learning
Recent Activity
updated
a model
5 days ago
whoisjones/fineweb-edu-scorer-mdeberta-binary
updated
a model
6 days ago
whoisjones/fineweb-edu-scorer-xlm-binary
updated
a model
6 days ago
whoisjones/fineweb-edu-scorer-xlm-binary
Organizations
Collections
2
models
4

whoisjones/fineweb-edu-scorer-mdeberta-binary
Text Classification
•
Updated
•
2

whoisjones/fineweb-edu-scorer-xlm-binary
Text Classification
•
Updated
•
4

whoisjones/fineweb-edu-scorer-xlm-multi
Text Classification
•
Updated

whoisjones/fineweb-edu-scorer-xlm-binary-0.0003
Text Classification
•
Updated
datasets
9
whoisjones/mastermind_24_random
Updated
•
53
whoisjones/pilener_max_splits
Viewer
•
Updated
•
43.7k
•
47
whoisjones/pilener_entropy_splits
Viewer
•
Updated
•
78.8k
•
49
whoisjones/nuner_max_splits
Viewer
•
Updated
•
546k
•
51
whoisjones/nuner_entropy_splits
Viewer
•
Updated
•
618k
•
43
whoisjones/mastermind_46
Viewer
•
Updated
•
36.1k
•
52
whoisjones/mastermind_35
Viewer
•
Updated
•
37.1k
•
40
whoisjones/mastermind_24
Viewer
•
Updated
•
30.4k
•
52
whoisjones/litset
Viewer
•
Updated
•
1.81M
•
52
•
2