nisten
/

Biggie-SmoLlm-0.15B-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nisten commited on Aug 6, 2024

Commit

6cab427

·

verified ·

1 Parent(s): 24116da

Update README.md

Files changed (1) hide show

README.md +8 -6

README.md CHANGED Viewed

@@ -26,15 +26,17 @@ I did not except this repo to blow up and now all the training scripts depend on
 Now for the magic trained finetune that runs at insane speeds:
-```verilog
-wget https://huggingface.co/nisten/Biggie-SmoLlm-0.15B-Base/resolve/main/biggie_groked_int8_q8_0.gguf
-```
 The settings are very finicky so be careful with your experimentation
-```bash
-./llama-cli -fa -b 512 -ctv q8_0 -ctk q8_0 --min-p 0.3 --top-p 0.85 --keep -1 -p "You are a NASA JPL Scientists. Human: I want to bring my cat to mars." -m biggie_groked_int8_q8_0.gguf -co -cnv --in-prefix "<|im_start|>Human:" --reverse-prompt "Human:" -c 1024 -n 700 --temp 1.5 -ngl 0 -t 1
 ```
-Done via semi-automated continuous merging to figure out the recipe.
 Model is more coherent.
 The temperature settings and min p etc need to be adjusted but even at default temp0 it was coherent for first 100 tokens.

 Now for the magic trained finetune that runs at insane speeds:
 The settings are very finicky so be careful with your experimentation
+```verilog
+./llama-cli -fa -b 512 -ctv q8_0 -ctk q8_0 --min-p 0.3 --top-p 0.85 --keep -1 \
+  -p "You are a NASA JPL Scientists. Human: I want to bring my cat to mars." \
+  --in-prefix "<|im_start|>Human:" --reverse-prompt "Human:" \
+  -m biggie_groked_int8_q8_0.gguf -co -cnv \
+  -c 1024 -n 700 --temp 1.5 -ngl 0 -t 1
 ```
+Yup, that's no gpu, 1 cpu core.
+This base model was built one via semi-automated continuous merging to figure out the recipe.
 Model is more coherent.
 The temperature settings and min p etc need to be adjusted but even at default temp0 it was coherent for first 100 tokens.