mdm-code
/

me-lemmatize-byt5-small

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mdm-code commited on Jul 30, 2023

Commit

50b114e

•

1 Parent(s): 4471561

Update readme

Files changed (1) hide show

README.md +8 -7

README.md CHANGED Viewed

@@ -3,6 +3,7 @@ license: gpl-3.0
 ---
 [MANX](https://github.com/mdm-code/manx)
 This is a ByT5-small model fine-tuned for early Middle English lemmatization.
 This is a PoC. The model has been fed series of 11-grams extracted from eLAEME
@@ -11,14 +12,14 @@ lemmatizer for all sorts of Middle English texts because eLAEME employes bespoke
 transcription rules that diverge from your regular transcript rules.
 The `manx` package that you can use the model with can be found here:
-`https://github.com/mdm-code/manx`. The package will give a more in-depth look
 at the data used to fine-tune the model. It lets you download corpus files, parse
 them and get them ready for fine-tuning the base model checkpoint.
-It has links to Colab notebooks and ready-made API that lets you feed
 texts to have them lemmatized.
-Make sure to reference this Huggingface repository (`https://huggingface.co/mdm-code/me-lemmatize-byt5-small`)
-and the Github repository (`https://github.com/mdm-code/manx`) for
-`manx` whenever you use this model for your own research. The model and package
-are published under the GPL-3 license, so make sure any research output and codebase
-are made publicly available. Do not violate this license.

 ---
 [MANX](https://github.com/mdm-code/manx)
+[Colab Notebook](https://colab.research.google.com/drive/1qpd4F8BoHMGzZnSqrGxZe-1YyX9IhVHc?usp=sharing)
 This is a ByT5-small model fine-tuned for early Middle English lemmatization.
 This is a PoC. The model has been fed series of 11-grams extracted from eLAEME
 transcription rules that diverge from your regular transcript rules.
 The `manx` package that you can use the model with can be found here:
+`https://github.com/mdm-code/manx`. The package will give a more general look
 at the data used to fine-tune the model. It lets you download corpus files, parse
 them and get them ready for fine-tuning the base model checkpoint.
+It has links to Colab notebook and ready-made API that lets you feed
 texts to have them lemmatized.
+Make sure to reference this Huggingface repository
+(`https://huggingface.co/mdm-code/me-lemmatize-byt5-small`) and the Github
+repository (`https://github.com/mdm-code/manx`) for `manx` whenever you use
+this model for your own research. The model and package are published under the
+GPL-3 license.