garage-bAInd
/

Platypus-30B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ArielleE commited on Jun 26, 2023

Commit

655c2c5

•

1 Parent(s): 761ac41

Update README.md

Files changed (1) hide show

README.md +13 -23

README.md CHANGED Viewed

@@ -3,34 +3,26 @@ language:
 - en
 tags:
 - llama
-license: apache-2.0
 metrics:
 - MMLU
 - ARC
 - HellaSwag
 - TruthfulQA
-- ReClor
 ---
-# 🥳 Platypus30B has arrived!
 | Metric                | Value |
 |-----------------------|-------|
-| MMLU (5-shot)         | 64.2  |
-| ARC (25-shot)         | 76.7  |
 | HellaSwag (10-shot)   | 84.3  |
-| TruthfulQA (0-shot)   | 37.4  |
-| ReClor (0-shot)       | 70    |
-## Model Description
-Platypus30B is an instruction fine-tuned LlaMa model.
-## Apply Delta Weights
-```sh
-ADD
-```
 ## Usage
@@ -41,7 +33,7 @@ ADD
 ## Model Details
 * **Trained by**: [Ariel Lee & Cole Hunter, LINK TO WEBSITES]
-* **Model type:**  **Platypus30B** is an auto-regressive language model based on the LLaMA transformer architecture.
 * **Language(s)**: English
 * **License for base weights**: License for the base LLaMA model's weights is Meta's [non-commercial bespoke license](https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md).
@@ -52,15 +44,13 @@ ADD
 | \\(n_\text{layers}\\)     | 60    |
 | \\(n_\text{heads}\\)      | 52    |
-## Training
-### Training Dataset
 Dataset of highly filtered and curated question and answer pairs. Release TBD.
-### Training Procedure
-`lilloukas/Platypus30b` was instruction fine-tuned using lora [CITE REPO] on 2 A100 80GB with the following configuration:
 | Hyperparameter      | Value |
 |---------------------|-------|

 - en
 tags:
 - llama
+license: other
 metrics:
 - MMLU
 - ARC
 - HellaSwag
 - TruthfulQA
 ---
+# 🥳 Platypus-30B has arrived!
+Platypus-30B is an instruction fine-tuned model based on the LLaMA-30b transformer architecture.
 | Metric                | Value |
 |-----------------------|-------|
+| MMLU (5-shot)         | 65.4  |
+| ARC (25-shot)         | 64.6  |
 | HellaSwag (10-shot)   | 84.3  |
+| TruthfulQA (0-shot)   | 45.8  |
+|-----------------------|-------|
+| Avg.                  | 65    | 💥
 ## Usage
 ## Model Details
 * **Trained by**: [Ariel Lee & Cole Hunter, LINK TO WEBSITES]
+* **Model type:**  **Platypus-30B** is an auto-regressive language model based on the LLaMA transformer architecture.
 * **Language(s)**: English
 * **License for base weights**: License for the base LLaMA model's weights is Meta's [non-commercial bespoke license](https://github.com/facebookresearch/llama/blob/main/MODEL_CARD.md).
 | \\(n_\text{layers}\\)     | 60    |
 | \\(n_\text{heads}\\)      | 52    |
+## Training Dataset
 Dataset of highly filtered and curated question and answer pairs. Release TBD.
+## Training Procedure
+`lilloukas/Platypus-30b` was instruction fine-tuned using lora [CITE REPO] on 4 A100 80GB with the following configuration:
 | Hyperparameter      | Value |
 |---------------------|-------|