RichardErkhov
/

Ba2han_-_Tinypus-1.5B-awq

4-bit precision

Model card Files Files and versions Community

RichardErkhov commited on Dec 31, 2024

Commit

61dcde0

·

verified ·

1 Parent(s): 24e8f8e

uploaded readme

Files changed (1) hide show

README.md +68 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+Quantization made by Richard Erkhov.
+[Github](https://github.com/RichardErkhov)
+[Discord](https://discord.gg/pvy7H8DZMG)
+[Request more models](https://github.com/RichardErkhov/quant_request)
+Tinypus-1.5B - AWQ
+- Model creator: https://huggingface.co/Ba2han/
+- Original model: https://huggingface.co/Ba2han/Tinypus-1.5B/
+Original model description:
+---
+license: mit
+datasets:
+- garage-bAInd/Open-Platypus
+pipeline_tag: text-generation
+---
+\***drumroll please**\*
+**Introducing Tinypus!**
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/gJDAcOioOe0nzJLVzPfav.jpeg)
+I passthrough merged base Tiny Llama Chat with itself, then fine-tuned with around 1/3 of Platypus dataset.
+Observations:
+- It's smarter (I think?)
+- It sometimes throws "### Instruction:" line. This could be due to the platypus dataset, or the fact that I know jackshit about programming. You can add it to "custom stopping strings" in oobaboga.
+- It may be possible to train very specialized mini experts and merge them???
+  **Template**
+  Same with TinyLlama/TinyLlama-1.1B-Chat-v1.0
+**Merge details**
+slices:
+  - sources:
+    - model: E://text-generation-webui//models//TinyLlama
+      layer_range: [0, 12]
+  - sources:
+    - model: E://text-generation-webui//models//TinyLlama
+      layer_range: [4, 22]
+merge_method: passthrough
+dtype: bfloat16
+**QLoRA Details**
+Chunk Length: 1152
+R/A: 64/128
+Epoch: 1
+q-k-v-o