sydonayrex
/

Barely-Regal-10.7B-GGUF

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sydonayrex commited on Jun 9

Commit

7eb54fe

•

1 Parent(s): a6ea651

Update README.md

Files changed (1) hide show

README.md +10 -4

README.md CHANGED Viewed

@@ -16,16 +16,22 @@ pipeline_tag: text-generation
 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63603f4a3605bd411c196eef/cU9HbxD9OYFlpSxj6k6Es.jpeg)
-The name of the model is not a ding against the models performance, more of a commentary of our current ISP infrastructure within the U.S. and the fact that many of our ISPs have not moved into the era of AI yet.
-They still are primarily monopolies, they put forth arbitrary limits on downloads, and they do not invest in improving services for customers in moderate to low density population areas.
-The base of this model is Mistral Instruct 0.3 that has been supersized using task arithmetic to combine layers, when folding it in on itself. In just me basic testing, this seems to have worked better than simple passthrough merging of layers, as the LLM has had less issues.
 In addition to the layer merging, the model has been further fine tuned using SFT using Unsloth to act as a base for further training and experimentation with DPO or ORPO (current DPO project in the process of being trained using Axolotl.)
 If you find the LLM is acting as if it has had a stroke, see if you have flash attn turned off and enable it is so. This seemed to correct any issues I had when running the model in LM Studio.
 # Uploaded  model
 - **Developed by:** sydonayrex

 ![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/63603f4a3605bd411c196eef/cU9HbxD9OYFlpSxj6k6Es.jpeg)
+The name of the model is not a ding against the models performance, more of a commentary of our current ISP infrastructure within the U.S. and the fact that many of our ISPs have not moved into the era of AI yet. They still are primarily monopolies, they put forth arbitrary caps on data transfer, and they do not invest in improving services for customers in moderate to low density population areas. Some of us hit those arbitrary data caps just trying to upload and download models.
+The base of this model is Mistral Instruct 0.3 that has been supersized using task arithmetic to combine layers, when folding it in on itself. This new model I call Artificial Innovation - Mistral 3, which will show as AI-M3-10.7B as the base model on hub. In just my basic testing, this seems to have worked better than simple passthrough merging of layers, as the LLM has had less issues.
 In addition to the layer merging, the model has been further fine tuned using SFT using Unsloth to act as a base for further training and experimentation with DPO or ORPO (current DPO project in the process of being trained using Axolotl.)
 If you find the LLM is acting as if it has had a stroke, see if you have flash attn turned off and enable it is so. This seemed to correct any issues I had when running the model in LM Studio.
+GGUFs are available here:
+Q4_K_M and Q8: https://huggingface.co/sydonayrex/Barely-Regal-10.7B-Q6_K-GGUF
+Q5_K_M: https://huggingface.co/sydonayrex/Barely-Regal-10.7B-Q5_K_M-GGUF
+Q6_K: https://huggingface.co/sydonayrex/Barely-Regal-10.7B-Q6_K-GGUF
 # Uploaded  model
 - **Developed by:** sydonayrex