ISLAB309 commited on
Commit
a4b6510
1 Parent(s): 6677417

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +38 -85
README.md CHANGED
@@ -1,41 +1,35 @@
1
  ---
2
- library_name: peft
3
- base_model: EleutherAI/pythia-160m-deduped
4
  ---
5
 
6
  # Model Card for Model ID
7
 
8
- <!-- Provide a quick summary of what the model is/does. -->
9
-
10
-
11
 
12
  ## Model Details
13
 
14
  ### Model Description
15
 
16
- <!-- Provide a longer summary of what this model is. -->
17
-
18
 
 
 
 
 
 
19
 
20
- - **Developed by:** [More Information Needed]
21
- - **Funded by [optional]:** [More Information Needed]
22
- - **Shared by [optional]:** [More Information Needed]
23
- - **Model type:** [More Information Needed]
24
- - **Language(s) (NLP):** [More Information Needed]
25
- - **License:** [More Information Needed]
26
- - **Finetuned from model [optional]:** [More Information Needed]
27
 
28
- ### Model Sources [optional]
29
-
30
- <!-- Provide the basic links for the model. -->
31
-
32
- - **Repository:** [More Information Needed]
33
- - **Paper [optional]:** [More Information Needed]
34
- - **Demo [optional]:** [More Information Needed]
35
 
36
  ## Uses
37
 
38
- <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 
 
39
 
40
  ### Direct Use
41
 
@@ -51,27 +45,33 @@ base_model: EleutherAI/pythia-160m-deduped
51
 
52
  ### Out-of-Scope Use
53
 
54
- <!-- This section addresses misuse, malicious use, and uses that the model will not work well for. -->
55
 
56
- [More Information Needed]
57
-
58
- ## Bias, Risks, and Limitations
59
 
60
- <!-- This section is meant to convey both technical and sociotechnical limitations. -->
61
 
62
- [More Information Needed]
63
 
64
- ### Recommendations
65
 
66
- <!-- This section is meant to convey recommendations with respect to the bias, risk, and technical limitations. -->
67
 
68
- Users (both direct and downstream) should be made aware of the risks, biases and limitations of the model. More information needed for further recommendations.
69
 
70
- ## How to Get Started with the Model
 
 
71
 
72
- Use the code below to get started with the model.
 
 
 
73
 
74
- [More Information Needed]
 
 
 
75
 
76
  ## Training Details
77
 
@@ -81,7 +81,7 @@ Use the code below to get started with the model.
81
 
82
  [More Information Needed]
83
 
84
- ### Training Procedure
85
 
86
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
 
@@ -126,47 +126,11 @@ Use the code below to get started with the model.
126
 
127
  ### Results
128
 
129
- [More Information Needed]
130
-
131
- #### Summary
132
-
133
-
134
-
135
- ## Model Examination [optional]
136
-
137
- <!-- Relevant interpretability work for the model goes here -->
138
-
139
- [More Information Needed]
140
-
141
- ## Environmental Impact
142
-
143
- <!-- Total emissions (in grams of CO2eq) and additional considerations, such as electricity usage, go here. Edit the suggested text below accordingly -->
144
-
145
- Carbon emissions can be estimated using the [Machine Learning Impact calculator](https://mlco2.github.io/impact#compute) presented in [Lacoste et al. (2019)](https://arxiv.org/abs/1910.09700).
146
-
147
- - **Hardware Type:** [More Information Needed]
148
- - **Hours used:** [More Information Needed]
149
- - **Cloud Provider:** [More Information Needed]
150
- - **Compute Region:** [More Information Needed]
151
- - **Carbon Emitted:** [More Information Needed]
152
-
153
- ## Technical Specifications [optional]
154
-
155
- ### Model Architecture and Objective
156
-
157
- [More Information Needed]
158
-
159
- ### Compute Infrastructure
160
-
161
- [More Information Needed]
162
-
163
- #### Hardware
164
 
165
- [More Information Needed]
166
 
167
- #### Software
168
 
169
- [More Information Needed]
170
 
171
  ## Citation [optional]
172
 
@@ -180,12 +144,6 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
180
 
181
  [More Information Needed]
182
 
183
- ## Glossary [optional]
184
-
185
- <!-- If relevant, include terms and calculations in this section that can help readers understand the model or model card. -->
186
-
187
- [More Information Needed]
188
-
189
  ## More Information [optional]
190
 
191
  [More Information Needed]
@@ -196,9 +154,4 @@ Carbon emissions can be estimated using the [Machine Learning Impact calculator]
196
 
197
  ## Model Card Contact
198
 
199
- [More Information Needed]
200
-
201
-
202
- ### Framework versions
203
-
204
- - PEFT 0.7.1
 
1
  ---
2
+ library_name: transformers
3
+ tags: []
4
  ---
5
 
6
  # Model Card for Model ID
7
 
8
+ The OpenDeid AICUP Suite is a collection of models developed to facilitate deidentification and temporal normalization research (see paper). It contains sets of eight models of sizes 70M, 160M, 410M, 1B, 1.4B, 2.8B, 6.9B, and 12B based on the [Pythia Scaling Suite](https://huggingface.co/collections/EleutherAI/pythia-scaling-suite-64fb5dfa8c21ebb3db7ad2e1). For the 70m size, there are two sets of models: one trained on the original OpenDeid-AICUP corpus, and one trained on the corpus generated by the previous model.
 
 
9
 
10
  ## Model Details
11
 
12
  ### Model Description
13
 
14
+ This model is trained on the full OpenDeid-AICUP corpus released in the [ACIUP 2023 competition](https://codalab.lisn.upsaclay.fr/competitions/15425).
 
15
 
16
+ - **Developed by:** [ISLab](https://nkustislab.github.io/)
17
+ - **Model type:** Transformer-based Language Model
18
+ - **Language:** English
19
+ - **License:** Apache 2.0
20
+ - **Finetuned from model:** [EleutherAI/pythia-160m](https://huggingface.co/EleutherAI/pythia-160m)
21
 
22
+ ### Model Sources
 
 
 
 
 
 
23
 
24
+ - **Repository:** [ISLab-git](https://islab.ee.nkust.edu.tw:40000/hjdai/opendeid)
25
+ - **Paper:** [More Information Needed]
26
+ - **Demo:** [More Information Needed]
 
 
 
 
27
 
28
  ## Uses
29
 
30
+ The primary intended use of the OpenDeid AICUP Suite is research on the behavior, functionality, and limitations of large language models for the deidentification and normalization tasks proposed in the [ACIUP 2023 competition](https://codalab.lisn.upsaclay.fr/competitions/15425). This suite is intended to provide a controlled setting for performing scientific experiments.
31
+
32
+ The models in the suite work with the Hugging Face Transformers Library. You may also further fine-tune and adapt the model for deployment, as long as your use is in accordance with the Apache 2.0 license and conduct your own risk and bias assessment.
33
 
34
  ### Direct Use
35
 
 
45
 
46
  ### Out-of-Scope Use
47
 
48
+ Similar to the original Pythia Suite, the OpenDeid AICUP Suite is not intended for deployment. It is not a in itself a product and cannot be used for human-facing interactions. For example, the model may generate harmful or offensive text. Please evaluate the risks associated with your particular use case.
49
 
50
+ The OpenDeid models are English-language only, and are not suitable for translation or generating text in other languages.
 
 
51
 
52
+ OpenDeid-160M has been fine-tuned for the sensitive health information recognition and normalization tasks based on a pre-defined format. This means the OpenDeid AICUP Suite will not respond to a given prompt the way a product like ChatGPT does, which was fine-tuned using methods such as Reinforcement Learning from Human Feedback (RLHF) to better “follow” human instructions.
53
 
54
+ ## Bias, Risks, and Limitations
55
 
56
+ This OpenDeid AICUP models are based on the Pythia models which were pre-trained on the Pile and further fine-tuned on the OpenDeid AICUP corpus, a dataset compiled for the sensitive health information and normalization tasks. The fine-tuned models tend to generate outputs in the manner of a pre-defined output layout which may not suiable for downstream tasks like text summarization or translation.
57
 
58
+ ## How to Get Started with the Model
59
 
60
+ Use the code (based on [vLLM](https://github.com/vllm-project/vllm)) below to get started with the model.
61
 
62
+ ```
63
+ from vllm import LLM, SamplingParams
64
+ from transformers import AutoTokenizer
65
 
66
+ model = LLM('ISLabResearch/opendeid-160m-ft-full')
67
+ seed = 309
68
+ tokenizer = AutoTokenizer.from_pretrained(Name)
69
+ eos = tokenizer.eos_token
70
 
71
+ params = SamplingParams(max_tokens = 50, include_stop_str_in_output = True, temperature = 0,
72
+ ignore_eos = False, stop = [eos], seed=seed)
73
+ preds = model.generate("Hello", params, use_tqdm = False)
74
+ ```
75
 
76
  ## Training Details
77
 
 
81
 
82
  [More Information Needed]
83
 
84
+ ### Training Procedure
85
 
86
  <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
87
 
 
126
 
127
  ### Results
128
 
129
+ #### Sensitive Health Information Recognition Results
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
130
 
 
131
 
132
+ #### Temporal Information Normalization Results
133
 
 
134
 
135
  ## Citation [optional]
136
 
 
144
 
145
  [More Information Needed]
146
 
 
 
 
 
 
 
147
  ## More Information [optional]
148
 
149
  [More Information Needed]
 
154
 
155
  ## Model Card Contact
156
 
157
+ [More Information Needed]