Triangle104
/

Dumpling-Mistral-Nemo-8B-Q6_K-GGUF

Inference Endpoints

Model card Files Files and versions Community

Triangle104 commited on 12 days ago

Commit

83f3713

·

verified ·

1 Parent(s): 77d6bd2

Update README.md

Files changed (1) hide show

README.md +38 -0

README.md CHANGED Viewed

@@ -24,6 +24,44 @@ tags:
 This model was converted to GGUF format from [`nbeerbower/Dumpling-Mistral-Nemo-8B`](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)

 This model was converted to GGUF format from [`nbeerbower/Dumpling-Mistral-Nemo-8B`](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/nbeerbower/Dumpling-Mistral-Nemo-8B) for more details on the model.
+---
+🧪 Experimental
+An attempt to recover intelligence with a quick train, results are meh
+Dumpling-Mistral-Nemo-8B
+nbeerbower/mistral-nemo-kartoffel-PRUNE3 finetuned on:
+-nbeerbower/GreatFirewall-DPO
+-nbeerbower/Schule-DPO
+-nbeerbower/Purpura-DPO
+-nbeerbower/Arkhaios-DPO
+-jondurbin/truthy-dpo-v0.1
+-antiven0m/physical-reasoning-dpo
+-flammenai/Date-DPO-NoAsterisks
+-flammenai/Prude-Phi3-DPO
+-Atsunori/HelpSteer2-DPO (1,000 samples)
+-jondurbin/gutenberg-dpo-v0.1
+-nbeerbower/gutenberg2-dpo
+-nbeerbower/gutenberg-moderne-dpo.
+Method
+---
+QLoRA ORPO tune with 2x RTX 3090 for 2 epochs.
+---
 ## Use with llama.cpp
 Install llama.cpp through brew (works on Mac and Linux)