ccdv
/

lsg-bart-base-16384-arxiv

text2text-generation

Model card Files Files and versions Community

ccdv commited on May 30, 2022

Commit

0a7d026

·

1 Parent(s): a917f9e

readme

Files changed (1) hide show

README.md +19 -1

README.md CHANGED Viewed

@@ -18,6 +18,24 @@ should probably proofread and complete it, then remove this comment. -->
 **This model relies on a custom modeling file, you need to add trust_remote_code=True**\
 **See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
 # ccdv/lsg-bart-base-16384-arxiv
 This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv) on the scientific_papers arxiv dataset. \
@@ -70,7 +88,7 @@ The following hyperparameters were used during training:
 The following hyperparameters were used during generation:
 - dataset_name: scientific_papers
 - dataset_config_name: arxiv
-- eval_batch_size: 8
 - eval_samples: 6440
 - early_stopping: True
 - ignore_pad_token_for_loss: True

 **This model relies on a custom modeling file, you need to add trust_remote_code=True**\
 **See [\#13467](https://github.com/huggingface/transformers/pull/13467)**
+```python
+from transformers import AutoTokenizer, AutoModelForSeq2SeqLM, pipeline
+tokenizer = AutoTokenizer.from_pretrained("ccdv/lsg-bart-base-16384-arxiv", trust_remote_code=True)
+model = AutoModelForSeq2SeqLM.from_pretrained("ccdv/lsg-bart-base-16384-arxiv", trust_remote_code=True)
+text = "Replace by what you want."
+pipe = pipeline("text2text-generation", model=model, tokenizer=tokenizer, device=0)
+generated_text = pipe(
+  text,
+  truncation=True,
+  max_length=64,
+  no_repeat_ngram_size=7,
+  num_beams=2,
+  early_stopping=True
+  )
+```
 # ccdv/lsg-bart-base-16384-arxiv
 This model is a fine-tuned version of [ccdv/lsg-bart-base-4096-arxiv](https://huggingface.co/ccdv/lsg-bart-base-4096-arxiv) on the scientific_papers arxiv dataset. \
 The following hyperparameters were used during generation:
 - dataset_name: scientific_papers
 - dataset_config_name: arxiv
+- eval_batch_size: 4
 - eval_samples: 6440
 - early_stopping: True
 - ignore_pad_token_for_loss: True