openaccess-ai-collective
/

manticore-13b-chat-pyg

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

winglian commited on May 23, 2023

Commit

c989146

•

1 Parent(s): cbd7499

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ Manticore 13B Chat is a Llama 13B model fine-tuned on the following datasets alo
 **Manticore 13B Chat was trained on 25% of the datasets below. The datasets were merged, shuffled, and then sharded into 4 parts.**
-- de-duped pygmalion dataset
 - [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
 - hellaswag, updated for detailed explanations w 30K+ rows
 - [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
@@ -52,7 +52,9 @@ Manticore 13B
 Not added from Manticore 13B:
 - mmlu - mmlu datasets were not added to this model as the `test` split is used for benchmarks
 # Demo
 Try out the model in HF Spaces. The demo uses a quantized GGML version of the model to quickly return predictions on smaller GPUs (and even CPUs). Quantized GGML may have some minimal loss of model quality.

 **Manticore 13B Chat was trained on 25% of the datasets below. The datasets were merged, shuffled, and then sharded into 4 parts.**
+- de-duped pygmalion dataset, filtered down to RP data
 - [riddle_sense](https://huggingface.co/datasets/riddle_sense) - instruct augmented
 - hellaswag, updated for detailed explanations w 30K+ rows
 - [gsm8k](https://huggingface.co/datasets/gsm8k) - instruct augmented
 Not added from Manticore 13B:
 - mmlu - mmlu datasets were not added to this model as the `test` split is used for benchmarks
+# Shoutouts
+Special thanks to Nanobit for helping with Axolotl, TheBloke for quantizing these models are more accessible to all, ehartford for cleaned datasets, and 0x000011b for the RP dataset.
 # Demo
 Try out the model in HF Spaces. The demo uses a quantized GGML version of the model to quickly return predictions on smaller GPUs (and even CPUs). Quantized GGML may have some minimal loss of model quality.