DavidAU commited on
Commit
cd68cc4
1 Parent(s): 6e3d8e8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +8 -0
README.md CHANGED
@@ -31,6 +31,14 @@ tags:
31
  pipeline_tag: text-generation
32
  ---
33
 
 
 
 
 
 
 
 
 
34
  <h2>L3-Dark-Planet-8B-GGUF</h2>
35
 
36
  <img src="dark-planet.jpg" style="float:right; width:300px; height:300px; padding:10px;">
 
31
  pipeline_tag: text-generation
32
  ---
33
 
34
+ Updates Dec 21 2024: (uploading quants ... refreshed, and new quants):
35
+ - New quants, quanted with the lastest LLAMACPP improvements : Better instruction following, output generation across all quants.
36
+ - New "ARM" quants for machines than can run them. (format: ".../Q4_0_4_4.gguf")
37
+ - All quants have also been upgraded with "more bits" for output tensor and embed for better performance.
38
+ - Specialized additional quants: "max, max-cpu" (will include this in the file name) for quants "Q2K" (max cpu only), "IQ4_XS", "Q6_K" and "Q8_0"
39
+ - "MAX" : output tensor / embed at float 16. (better instruction following/output generation than standard quants)
40
+ - "MAX-CPU" : output tensor / embed at bfloat 16, which forces these on to the CPU (Nvidia cards / other will vary), this frees up vram at cost of token/second and you get better instruction following/output generation too.
41
+
42
  <h2>L3-Dark-Planet-8B-GGUF</h2>
43
 
44
  <img src="dark-planet.jpg" style="float:right; width:300px; height:300px; padding:10px;">