Update README.md
Browse files
README.md
CHANGED
@@ -246,7 +246,7 @@ Feel free to click the expand button below to see the full list of sources.
|
|
246 |
| proof-pile | en | [Link](https://huggingface.co/datasets/hoskinson-center/proof-pile) |
|
247 |
| RedPajama-Data T1 (StackExchange subset) | en | Computer, 2023 |
|
248 |
| The Pile (PhilPapers subset) | en | Gao et al., 2021 |
|
249 |
-
| Biomedical | es | Internally generated
|
250 |
| HPLTDatasets v1 - Spanish | es | de Gibert et al., 2024 |
|
251 |
| Legal | es | Internally generated legal dataset: BOE, BORME, Senado, Congreso, Spanish court orders, DOGC |
|
252 |
| Scientific | es | Internally generated scientific dataset: Dialnet, Scielo, CSIC, TDX, BSC, UCM |
|
|
|
246 |
| proof-pile | en | [Link](https://huggingface.co/datasets/hoskinson-center/proof-pile) |
|
247 |
| RedPajama-Data T1 (StackExchange subset) | en | Computer, 2023 |
|
248 |
| The Pile (PhilPapers subset) | en | Gao et al., 2021 |
|
249 |
+
| Biomedical | es | Internally generated biomedical dataset: Wikipedia LS, Pubmed, MeSpEn, patents, clinical cases, medical crawler |
|
250 |
| HPLTDatasets v1 - Spanish | es | de Gibert et al., 2024 |
|
251 |
| Legal | es | Internally generated legal dataset: BOE, BORME, Senado, Congreso, Spanish court orders, DOGC |
|
252 |
| Scientific | es | Internally generated scientific dataset: Dialnet, Scielo, CSIC, TDX, BSC, UCM |
|