s-lilo's picture
Update README.md
d6eca51 verified
metadata
license: cc-by-4.0
language:
  - es
base_model: PlanTL-GOB-ES/bsc-bio-ehr-es

Training data

Model trained on the anonymization part of CARMEN-I and MEDDOCAN.

Citation

Please cite the following works:

@inproceedings{meddocan,
  title={{Automatic De-identification of Medical Texts in Spanish: the MEDDOCAN Track, Corpus, Guidelines, Methods and Evaluation of Results}},
  author={Marimon, Montserrat and Gonzalez-Agirre, Aitor and Intxaurrondo, Ander and Villegas, Marta and Krallinger, Martin},
  booktitle="Proceedings of the Iberian Languages Evaluation Forum (IberLEF 2019)",
  year={2019}
}

@misc{carmen_physionet, 
  author = {Farre Maduell, Eulalia and Lima-Lopez, Salvador and Frid, Santiago Andres and Conesa, Artur and Asensio, Elisa and Lopez-Rueda, Antonio and Arino, Helena and Calvo, Elena and Bertran, Maria Jesús and Marcos, Maria Angeles and Nofre Maiz, Montserrat and Tañá Velasco, Laura and Marti, Antonia and Farreres, Ricardo and Pastor, Xavier and Borrat Frigola, Xavier and Krallinger, Martin}, 
  title = {{CARMEN-I: A resource of anonymized electronic health records in Spanish and Catalan for training and testing NLP tools (version 1.0.1)}}, 
  year = {2024}, 
  publisher = {PhysioNet}, 
  url = {https://doi.org/10.13026/x7ed-9r91} 
}

@article{physionet,
  author = {Ary L. Goldberger  and Luis A. N. Amaral  and Leon Glass  and Jeffrey M. Hausdorff  and Plamen Ch. Ivanov  and Roger G. Mark  and Joseph E. Mietus  and George B. Moody  and Chung-Kang Peng  and H. Eugene Stanley },
  title = {PhysioBank, PhysioToolkit, and PhysioNet  },
  journal = {Circulation},
  volume = {101},
  number = {23},
  pages = {e215-e220},
  year = {2000},
  doi = {10.1161/01.CIR.101.23.e215},
  URL = {https://www.ahajournals.org/doi/abs/10.1161/01.CIR.101.23.e215}
}

Contacting authors

jan.rodriguez [at] bsc.es

More information on data, usage, limitations, and performance metrics soon