PantagrueLLM
/

jargon-general-biomed

Model card Files Files and versions Community

jargon-general-biomed / README.md

a-mannion's picture

Update README.md

27896c2 verified 2 months ago

|

No virus

1.98 kB

	---
	license: mit
	language:
	- fr
	library_name: transformers
	tags:
	- linformer
	- legal
	- medical
	- RoBERTa
	- pytorch
	---

	# Jargon-general-base

	[Jargon](https://hal.science/hal-04535557/file/FB2_domaines_specialises_LREC_COLING24.pdf) is an efficient transformer encoder LM for French, combining the LinFormer attention mechanism with the RoBERTa model architecture.

	Jargon is available in several versions with different context sizes and types of pre-training corpora.

	<!-- Provide a quick summary of what the model is/does. -->

	<!-- This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
	-->



	## Using Jargon models with HuggingFace transformers

	You can get started with `jargon-general-base` using the code snippet below:

	```python
	from transformers import AutoModelForMaskedLM, AutoTokenizer, pipeline

	tokenizer = AutoTokenizer.from_pretrained("PantagrueLLM/jargon-general-base", trust_remote_code=True)
	model = AutoModelForMaskedLM.from_pretrained("PantagrueLLM/jargon-general-base", trust_remote_code=True)

	jargon_maskfiller = pipeline("fill-mask", model=model, tokenizer=tokenizer)
	output = jargon_maskfiller("Il est allé au <mask> hier")
	```

	- Funded by
	- GENCI-IDRIS (Grant 2022 A0131013801)
	- French National Research Agency: Pantagruel grant ANR-23-IAS1-0001
	- MIAI@Grenoble Alpes ANR-19-P3IA-0003
	- PROPICTO ANR-20-CE93-0005
	- Lawbot ANR-20-CE38-0013
	- Swiss National Science Foundation (grant PROPICTO N°197864)
	<!-- - Shared by [optional]: [More Information Needed] -->
	<!-- - Model type: [More Information Needed] -->
	- Language(s): French
	- License: MIT
	- Developed by: Vincent Segonne
	<!-- - Finetuned from model [optional]: [More Information Needed] -->
	<!--
	### Model Sources [optional]

	<!-- Provide the basic links for the model. -->