avuhong commited on
Commit
e6ec0ef
1 Parent(s): 83386b9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -2
README.md CHANGED
@@ -24,12 +24,20 @@ It achieves the following results on the evaluation set:
24
 
25
  ## Model description
26
 
27
- More information needed
28
 
29
  ## Intended uses & limitations
30
 
 
 
31
  ### Generate novel sequences for viral capsid proteins
32
 
 
 
 
 
 
 
33
  ### Calculate the perplexity of a protein sequence
34
 
35
  ```python
@@ -67,7 +75,7 @@ print(f"{calculatePerplexity(seq, model, tokenizer):.2f}")
67
 
68
  ## Training and evaluation data
69
 
70
- More information needed
71
 
72
  ## Training procedure
73
 
 
24
 
25
  ## Model description
26
 
27
+ This model is a GPT2-like model for generating capsid amino acid sequences. It was trained exclusively on capsid aa_seqs of Piccovirales members.
28
 
29
  ## Intended uses & limitations
30
 
31
+ As a typical GPT model, it can be used to generate new sequences or used to evaluate the perplexity of given sequences.
32
+
33
  ### Generate novel sequences for viral capsid proteins
34
 
35
+ ```python
36
+ from transformers import pipeline
37
+ protgpt2 = pipeline('text-generation', model="avuhong/PiccoviralesGPT")
38
+ sequences = protgpt2("<|endoftext|>", max_length=750, do_sample=True, top_k=950, repetition_penalty=1.2, num_return_sequences=10, eos_token_id=0)
39
+ ```
40
+
41
  ### Calculate the perplexity of a protein sequence
42
 
43
  ```python
 
75
 
76
  ## Training and evaluation data
77
 
78
+ Traning script is included in bash file in this repository.
79
 
80
  ## Training procedure
81