DavidAU
/

L3-Dark-Planet-8B-GGUF

Model card Files Files and versions Community

DavidAU commited on Sep 3, 2024

Commit

a7e41a7

•

1 Parent(s): 3424995

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -62,7 +62,9 @@ Example outputs below.
 - If you use rope to extend context, increase temp AND instructions detail levels to compensate for "rope issues".
 - Source code for this model (Bfloat16), Float 32 master GGUFs (and source), and Imatrix GGUFs versions will be uploaded shortly at separate repos.
-Note the "float32" version of this model behaves VERY differently which is why it was not uploaded first.
 The Imatrix versions of this model have even lower perplexity (1/2 level of magnitude lower than this model, 1 full level of magnitude
 lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhanced output.
@@ -70,6 +72,8 @@ lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhance
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 8k / 8192.
 However this can be extended using "rope" settings up to 32k.
 Here is the standard LLAMA3 template:
 <PRE>

 - If you use rope to extend context, increase temp AND instructions detail levels to compensate for "rope issues".
 - Source code for this model (Bfloat16), Float 32 master GGUFs (and source), and Imatrix GGUFs versions will be uploaded shortly at separate repos.
+Note the "float32" version of this model behaves VERY differently which is why it was not uploaded first. Usually I would
+use the "float32" version only, however the "character range" displayed by the Bfloat16 and Float32 versions of this model
+dictate they have their own repos.
 The Imatrix versions of this model have even lower perplexity (1/2 level of magnitude lower than this model, 1 full level of magnitude
 lower than LLama3 Instruct) then both this model and Llama3 Instruct and enhanced output.
 This is a LLAMA3 model, and requires Llama3 template, but may work with other template(s) and has maximum context of 8k / 8192.
 However this can be extended using "rope" settings up to 32k.
+If you use "Command-R" template your output will be very different from using "Llama3" template.
 Here is the standard LLAMA3 template:
 <PRE>