unikei commited on
Commit
a59eab3
1 Parent(s): 637af1c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -7
README.md CHANGED
@@ -2,10 +2,7 @@
2
  license: bigscience-openrail-m
3
  widget:
4
  - text: >-
5
- wnt signalling orchestrates a number of developmental programs in response
6
- to this stimulus cytoplasmic beta catenin (encoded by ctnnb1) is stabilized
7
- enabling downstream transcriptional activation by members of the lef/tcf
8
- family
9
  datasets:
10
  - bigbio/drugprot
11
  - bigbio/ncbi_disease
@@ -20,11 +17,25 @@ tags:
20
  # DistilBERT base model for restoring punctuation of medical/biotech speed-to-text transcripts
21
  E.g.:
22
  ```
23
- EXAMPLE
 
 
 
 
 
 
 
24
  ```
25
  will be punctuated as follows:
26
  ```
27
- EXAMPLE
 
 
 
 
 
 
 
28
  ```
29
 
30
  ## How to use it in your code:
@@ -137,6 +148,13 @@ def punctuate(text, tokenizer, model):
137
  #
138
  # Example
139
  #
140
- text = ""
141
  result = punctuate(text, tokenizer, model)
142
  print(result)
 
 
 
 
 
 
 
 
2
  license: bigscience-openrail-m
3
  widget:
4
  - text: >-
5
+ the atm protein is a single high molecular weight protein predominantly confined to the nucleus of human fibroblasts but is present in both nuclear and microsomal fractions from human lymphoblast cells and peripheral blood lymphocytes atm protein levels and localization remain constant throughout all stages of the cell cycle truncated atm protein was not detected in lymphoblasts from ataxia telangiectasia patients homozygous for mutations leading to premature protein termination exposure of normal human cells to gamma irradiation and the radiomimetic drug neocarzinostatin had no effect on atm protein levels in contrast to a noted rise in p53 levels over the same time interval these findings are consistent with a role for the atm protein in ensuring the fidelity of dna repair and cell cycle regulation following genome damage
 
 
 
6
  datasets:
7
  - bigbio/drugprot
8
  - bigbio/ncbi_disease
 
17
  # DistilBERT base model for restoring punctuation of medical/biotech speed-to-text transcripts
18
  E.g.:
19
  ```
20
+ the atm protein is a single high molecular weight protein predominantly confined to the nucleus of human
21
+ fibroblasts but is present in both nuclear and microsomal fractions from human lymphoblast cells and peripheral
22
+ blood lymphocytes atm protein levels and localization remain constant throughout all stages of the cell cycle
23
+ truncated atm protein was not detected in lymphoblasts from ataxia telangiectasia patients homozygous
24
+ for mutations leading to premature protein termination exposure of normal human cells to gamma irradiation and the
25
+ radiomimetic drug neocarzinostatin had no effect on atm protein levels in contrast to a noted rise in p53 levels
26
+ over the same time interval these findings are consistent with a role for the atm protein in ensuring the fidelity
27
+ of dna repair and cell cycle regulation following genome damage
28
  ```
29
  will be punctuated as follows:
30
  ```
31
+ The ATM protein is a single, high-molecular-weight protein predominantly confined to the nucleus of human
32
+ fibroblasts, but is present in both nuclear and microsomal fractions from human lymphoblast cells and peripheral
33
+ blood lymphocytes. ATM protein levels and localization remain constant throughout all stages of the cell cycle.
34
+ Truncated ATM protein was not detected in lymphoblasts from ataxia-telangiectasia-patients homozygous
35
+ for mutations leading to premature protein termination. Exposure of normal human cells to gamma-irradiation and the
36
+ radiomimetic drug neocarzinostatin had no effect on ATM protein levels, in contrast to a noted rise in p53 levels
37
+ over the same time interval. These findings are consistent with a role for the ATM protein in ensuring the fidelity
38
+ of DNA repair and cell-cycle regulation following genome damage.
39
  ```
40
 
41
  ## How to use it in your code:
 
148
  #
149
  # Example
150
  #
151
+ text = "the atm protein is a single high molecular weight protein predominantly confined to the nucleus of human fibroblasts but is present in both nuclear and microsomal fractions from human lymphoblast cells and peripheral blood lymphocytes atm protein levels and localization remain constant throughout all stages of the cell cycle truncated atm protein was not detected in lymphoblasts from ataxia telangiectasia patients homozygous for mutations leading to premature protein termination exposure of normal human cells to gamma irradiation and the radiomimetic drug neocarzinostatin had no effect on atm protein levels in contrast to a noted rise in p53 levels over the same time interval these findings are consistent with a role for the atm protein in ensuring the fidelity of dna repair and cell cycle regulation following genome damage"
152
  result = punctuate(text, tokenizer, model)
153
  print(result)
154
+
155
+
156
+ """
157
+ Output:
158
+ The ATM protein is a single, high-molecular-weight protein predominantly confined to the nucleus of human fibroblasts, but is present in both nuclear and microsomal fractions from human lymphoblast cells and peripheral blood lymphocytes. ATM protein levels and localization remain constant throughout all stages of the cell cycle. Truncated ATM protein was not detected in lymphoblasts from ataxia-telangiectasia-patients homozygous for mutations leading to premature protein termination. Exposure of normal human cells to gamma-irradiation and the radiomimetic drug neocarzinostatin had no effect on ATM protein levels, in contrast to a noted rise in p53 levels over the same time interval. These findings are consistent with a role for the ATM protein in ensuring the fidelity of DNA repair and cell-cycle regulation following genome damage.
159
+ """
160
+ ```