declare-lab
/

flan-alpaca-xl

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chiayewken commited on Mar 28, 2023

Commit

a57e4b6

·

1 Parent(s): 69557a0

Update README.md

Files changed (1) hide show

README.md +7 -4

README.md CHANGED Viewed

@@ -9,10 +9,13 @@ datasets:
 Our [repository](https://github.com/declare-lab/flan-alpaca) contains code for extending the [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
 synthetic instruction tuning to existing instruction-tuned models such as [Flan-T5](https://arxiv.org/abs/2210.11416).
 The pretrained models and demos are available on HuggingFace 🤗 :
-[Base](https://huggingface.co/declare-lab/flan-alpaca-base) (220M),
-[Large](https://huggingface.co/declare-lab/flan-alpaca-large) (770M),
-[XL](https://huggingface.co/declare-lab/flan-alpaca-xl) (3B),
-XXL (11B, Coming soon)
 ### Why?

 Our [repository](https://github.com/declare-lab/flan-alpaca) contains code for extending the [Stanford Alpaca](https://github.com/tatsu-lab/stanford_alpaca)
 synthetic instruction tuning to existing instruction-tuned models such as [Flan-T5](https://arxiv.org/abs/2210.11416).
 The pretrained models and demos are available on HuggingFace 🤗 :
+| Model                                                                     | Parameters | Training GPUs   |
+|---------------------------------------------------------------------------|------------|-----------------|
+| [Flan-Alpaca-Base](https://huggingface.co/declare-lab/flan-alpaca-base)   | 220M       | 1x A6000        |
+| [Flan-Alpaca-Large](https://huggingface.co/declare-lab/flan-alpaca-large) | 770M       | 1x A6000        |
+| [Flan-Alpaca-XL](https://huggingface.co/declare-lab/flan-alpaca-xl)       | 3B         | 1x A6000        |
+| [Flan-Alpaca-XXL](https://huggingface.co/declare-lab/flan-alpaca-xxl)     | 11B        | 4x A6000 (FSDP) |
 ### Why?