Update README.md
Browse files
README.md
CHANGED
|
@@ -107,7 +107,8 @@ pipeline_tag: text-classification
|
|
| 107 |
# Multilingual Educational Content Classifier
|
| 108 |
|
| 109 |
Trained on full documents of up to 8192 tokens in total. The train set of [tartuNLP/fineweb-c-combined-resample](https://huggingface.co/datasets/tartuNLP/fineweb-c-combined-resample)
|
| 110 |
-
was used.
|
|
|
|
| 111 |
|
| 112 |
## Labels
|
| 113 |
|
|
|
|
| 107 |
# Multilingual Educational Content Classifier
|
| 108 |
|
| 109 |
Trained on full documents of up to 8192 tokens in total. The train set of [tartuNLP/fineweb-c-combined-resample](https://huggingface.co/datasets/tartuNLP/fineweb-c-combined-resample)
|
| 110 |
+
was used, which itself is a mix and a resample of [HuggingFaceFW/fineweb-edu-llama3-annotations](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu-llama3-annotations) and
|
| 111 |
+
[https://huggingface.co/datasets/data-is-better-together/fineweb-c](https://huggingface.co/datasets/data-is-better-together/fineweb-c).
|
| 112 |
|
| 113 |
## Labels
|
| 114 |
|