lintang commited on
Commit
2f07ae1
1 Parent(s): bd3df82

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -2
README.md CHANGED
@@ -97,8 +97,8 @@ Pile-T5 can be loaded using the `AutoModelForSeq2SeqLM` functionality:
97
  ```python
98
  from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
99
 
100
- tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-base")
101
- model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-base")
102
  ```
103
 
104
  ### Training
@@ -132,6 +132,8 @@ There are in total 200 checkpoints that are spaced 10,000 steps. For T5x-native
132
  checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
133
 
134
 
 
 
135
  ### Evaluations
136
 
137
  Pile-T5 XL was evaluated on SuperGLUE, CodeXGLUE. A Flan-finetuned version was evaluated on Flan Held In tasks, MMLU and BBH.
 
97
  ```python
98
  from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
99
 
100
+ tokenizer = AutoTokenizer.from_pretrained("EleutherAI/pile-t5-xl")
101
+ model = AutoModelForSeq2SeqLM.from_pretrained("EleutherAI/pile-t5-xl")
102
  ```
103
 
104
  ### Training
 
132
  checkpoints that can be used for finetuning with the T5x library, refer to [here](https://huggingface.co/lintang/pile-t5-base-t5x/tree/main)
133
 
134
 
135
+ The training loss (in tfevent format) and validation perplexity (in jsonl) can be found [here](https://huggingface.co/EleutherAI/pile-t5-xl/blob/main/xl.zip).
136
+
137
  ### Evaluations
138
 
139
  Pile-T5 XL was evaluated on SuperGLUE, CodeXGLUE. A Flan-finetuned version was evaluated on Flan Held In tasks, MMLU and BBH.