hyesunyun commited on
Commit
5410d3d
1 Parent(s): 1966a75

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +72 -0
README.md ADDED
@@ -0,0 +1,72 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ tags:
5
+ - update summarization
6
+ - longformer
7
+ - transformers
8
+ - BART
9
+ - PyTorch
10
+ - Tensorboard
11
+ - led
12
+ metrics:
13
+ - edit distance
14
+ - ROUGE
15
+ - BertScore
16
+ ---
17
+
18
+ # Update Summarization with BART Large and Longformer Encoder Decoder
19
+
20
+ ## Model description
21
+
22
+ This model is a Transformer-based model that supports long document generative sequence-to-sequence.
23
+
24
+ Based on [BART Large](https://huggingface.co/transformers/model_doc/bart.html) with [Longformer Encode Decoder](https://huggingface.co/transformers/model_doc/led.html) to allow for longer inputs.
25
+
26
+ ## Intended uses & limitations
27
+
28
+ #### How to use
29
+
30
+ Format your data so that each new article or evidence to add have `<EV>` token in front with each title prefixed by `<t>` and each abstract prefixed by `<abs>`. Please have the original summary also in the same format. You can have the list of articles and original summary concatenated in any order as long as they have the correct separator tokens.
31
+
32
+ ```python
33
+ from transformers import LEDTokenizer, LEDForConditionalGeneration
34
+
35
+ tokenizer = LEDTokenizer.from_pretrained("hyesunyun/update-summarization-led-edit-at-a-time")
36
+ model = LEDForConditionalGeneration.from_pretrained("hyesunyun/update-summarization-led-edit-at-a-time")
37
+
38
+ input = "<EV> <t> Hypoglycemic effect of bitter melon compared with metformin in newly diagnosed type 2 diabetes patients. <abs> ETHNOPHARMACOLOGICAL RELEVANCE: Bitter melon (Momordica charantia L.) has been widely used as an traditional medicine treatment for diabetic patients in Asia. In vitro and animal studies suggested its hypoglycemic activity, but limited human studies are available to support its use. AIM OF STUDY: This study was conducted to assess the efficacy and safety of three doses of bitter melon compared with metformin. MATERIALS AND METHODS: This is a 4-week, multicenter, randomized, double-blind, active-control trial. Patients were randomized into 4 groups to receive bitter melon 500 mg/day, 1,000 mg/day, and 2,000 mg/day or metformin 1,000 mg/day. All patients were followed for 4 weeks. RESULTS: There was a significant decline in fructosamine at week 4 of the metformin group (-16.8; 95% CI, -31.2, -2.4 mumol/L) and the bitter melon 2,000 mg/day group (-10.2; 95% CI, -19.1, -1.3 mumol/L). Bitter melon 500 and 1,000 mg/day did not significantly decrease fructosamine levels (-3.5; 95% CI -11.7, 4.6 and -10.3; 95% CI -22.7, 2.2 mumol/L, respectively). CONCLUSIONS: Bitter melon had a modest hypoglycemic effect and significantly reduced fructosamine levels from baseline among patients with type 2 diabetes who received 2,000 mg/day. However, the hypoglycemic effect of bitter melon was less than metformin 1,000 mg/day. <EV> <t> Momordica charantia for type 2 diabetes mellitus. <abs> There is insufficient evidence to recommend momordica charantia for type 2 diabetes mellitus. Further studies are therefore required to address the issues of standardization and the quality control of preparations. For medical nutritional therapy, further observational trials evaluating the effects of momordica charantia are needed before RCTs are established to guide any recommendations in clinical practice."
39
+ inputs_dict = tokenizer(input, padding="max_length", max_length=10240, return_tensors="pt", truncation=True)
40
+ input_ids = inputs_dict.input_ids
41
+ attention_mask = inputs_dict.attention_mask
42
+ global_attention_mask = torch.zeros_like(attention_mask)
43
+ # put global attention on <s> token
44
+ global_attention_mask[:, 0] = 1
45
+
46
+ predicted_summary_ids = model.generate(input_ids, attention_mask=attention_mask, global_attention_mask=global_attention_mask)
47
+ print(tokenizer.batch_decode(predicted_summary_ids, skip_special_tokens=False))
48
+ ```
49
+
50
+ The expected output should be something like `<s> insertion <edit_pad> zero <edit_pad> bla bla bla some text </s>`
51
+
52
+ #### Limitations and bias
53
+
54
+ Provide examples of latent issues and potential remediations.
55
+
56
+ ## Training data
57
+
58
+ Used pre-trained [LED model](https://huggingface.co/transformers/model_doc/led.html) and fine-tuned using the dataset found in [this github repo](https://github.com/hyesunyun/update_summarization_data).
59
+
60
+ ## Training procedure
61
+
62
+ Preprocessing, hardware used, hyperparameters...
63
+
64
+ ## Eval results
65
+
66
+ ### BibTeX entry and citation info
67
+
68
+ ```bibtex
69
+ @inproceedings{...,
70
+ year={2022}
71
+ }
72
+ ```