s-lilo's picture
Update README.md
d6eca51 verified
---
license: cc-by-4.0
language:
- es
base_model: PlanTL-GOB-ES/bsc-bio-ehr-es
---
# Training data
Model trained on the anonymization part of [CARMEN-I](https://zenodo.org/records/10171540) and [MEDDOCAN](https://zenodo.org/records/4279323).
# Citation
Please cite the following works:
```
@inproceedings{meddocan,
title={{Automatic De-identification of Medical Texts in Spanish: the MEDDOCAN Track, Corpus, Guidelines, Methods and Evaluation of Results}},
author={Marimon, Montserrat and Gonzalez-Agirre, Aitor and Intxaurrondo, Ander and Villegas, Marta and Krallinger, Martin},
booktitle="Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2019)",
year={2019}
}
@misc{carmen_physionet,
author = {Farre Maduell, Eulalia and Lima-Lopez, Salvador and Frid, Santiago Andres and Conesa, Artur and Asensio, Elisa and Lopez-Rueda, Antonio and Arino, Helena and Calvo, Elena and Bertran, Maria Jesús and Marcos, Maria Angeles and Nofre Maiz, Montserrat and Tañá Velasco, Laura and Marti, Antonia and Farreres, Ricardo and Pastor, Xavier and Borrat Frigola, Xavier and Krallinger, Martin},
title = {{CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools (version 1.0.1)}},
year = {2024},
publisher = {PhysioNet},
url = {https://doi.org/10.13026/x7ed-9r91}
}
@article{physionet,
author = {Ary L. Goldberger and Luis A. N. Amaral and Leon Glass and Jeffrey M. Hausdorff and Plamen Ch. Ivanov and Roger G. Mark and Joseph E. Mietus and George B. Moody and Chung-Kang Peng and H. Eugene Stanley },
title = {PhysioBank, PhysioToolkit, and PhysioNet },
journal = {Circulation},
volume = {101},
number = {23},
pages = {e215-e220},
year = {2000},
doi = {10.1161/01.CIR.101.23.e215},
URL = {https://www.ahajournals.org/doi/abs/10.1161/01.CIR.101.23.e215}
}
```
# Contacting authors
jan.rodriguez [at] bsc.es
## More information on data, usage, limitations, and performance metrics soon