DavidAU
/

L3-Dark-Planet-8B-GGUF

Model card Files Files and versions Community

DavidAU commited on Dec 21, 2024

Commit

7ea7df2

·

verified ·

1 Parent(s): 3b2cf8c

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -43,7 +43,7 @@ pipeline_tag: text-generation
 - "MAX": output tensor / embed at float 16. (better instruction following/output generation than standard quants)
 - "MAX-CPU": output tensor / embed at bfloat 16, which forces these on to the CPU (Nvidia cards / other will vary), this frees up vram at cost of token/second and you get better instruction following/output generation too.
 - Q8_0 (Max,Max-CPU) now clocks in at almost 10 bits (average).
--
 <h2>L3-Dark-Planet-8B-GGUF</h2>
 <img src="dark-planet.jpg" style="float:right; width:300px; height:300px; padding:10px;">

 - "MAX": output tensor / embed at float 16. (better instruction following/output generation than standard quants)
 - "MAX-CPU": output tensor / embed at bfloat 16, which forces these on to the CPU (Nvidia cards / other will vary), this frees up vram at cost of token/second and you get better instruction following/output generation too.
 - Q8_0 (Max,Max-CPU) now clocks in at almost 10 bits (average).
 <h2>L3-Dark-Planet-8B-GGUF</h2>
 <img src="dark-planet.jpg" style="float:right; width:300px; height:300px; padding:10px;">