DavidAU
/

D_AU-Mistral-7B-Instruct-v0.2-Bagel-DarkSapling-DPO-7B-v2.0-imat-plus-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

DavidAU commited on May 3

Commit

235e7d3

•

1 Parent(s): 42cbe9e

Update README.md

Files changed (1) hide show

README.md +31 -3

README.md CHANGED Viewed

@@ -1,3 +1,31 @@
----
-license: mit
----

+---
+license: mit
+---
+Imatrix compressions of FP Merge of "D_AU-Mistral-7B-Instruct-v0.2-Bagel-DarkSapling-DPO-7B-v2.0".
+"Imatrix Plus" is an upgraded form of Imatrix which using full precision for specific parts of the compression.
+As a result all compressions will be slightly larger in size than standard 13B compressions.
+This method results in a higher quality model, especially at lower compressions.
+This method is applied across all compressions from IQ1 to Q8.
+Even IQ1_S - the most compressed verison - works well, however IQ4/Q4 are suggested as minimums for quality.
+Highest quality will be Q6/Q8.
+In addition the Imatrix file used to "fix" the compressed files post compression resulted in
+over 2 whole points lower perplexity at IQ1_S vs some of the other "Imatrix" files currently in use.
+This merge was an experiment to test already established Roleplay, Fiction and Story
+generation of "DarkSapling" with a some of "Bagel"'s qualities with a Mistral Instruct Base.
+For Imatrix plus this was a test of high precision in specific areas of the model leading to a slightly larger compressed file.
+In addition the Imatrix process itself used a larger "calibration" file than standard to further enhance quality.
+The process added appoximately 310 MB to each compressed file.
+An additional enhancement added another 200 mb to each compressed file.
+A blank or standard Alpaca Template for text generation will work.
+Context length: 32768.
+Please see the orginal model card for specific details of use, additional credits and tips: