Text Classification
Transformers
Safetensors
modernbert
adorkin commited on
Commit
cbdd329
·
verified ·
1 Parent(s): 45fa765

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -107,7 +107,8 @@ pipeline_tag: text-classification
107
  # Multilingual Educational Content Classifier
108
 
109
  Trained on full documents of up to 8192 tokens in total. The train set of [tartuNLP/fineweb-c-combined-resample](https://huggingface.co/datasets/tartuNLP/fineweb-c-combined-resample)
110
- was used.
 
111
 
112
  ## Labels
113
 
 
107
  # Multilingual Educational Content Classifier
108
 
109
  Trained on full documents of up to 8192 tokens in total. The train set of [tartuNLP/fineweb-c-combined-resample](https://huggingface.co/datasets/tartuNLP/fineweb-c-combined-resample)
110
+ was used, which itself is a mix and a resample of [HuggingFaceFW/fineweb-edu-llama3-annotations](https://huggingface.co/datasets/HuggingFaceFW/fineweb-edu-llama3-annotations) and
111
+ [https://huggingface.co/datasets/data-is-better-together/fineweb-c](https://huggingface.co/datasets/data-is-better-together/fineweb-c).
112
 
113
  ## Labels
114