nisten
/

zelensky-78b

Model card Files Files and versions Community

nisten commited on 10 days ago

Commit

4cad2f9

·

verified ·

1 Parent(s): 531f854

Update README.md

Files changed (1) hide show

README.md +8 -5

README.md CHANGED Viewed

@@ -1,5 +1,9 @@
 ---
-base_model: Qwen/Qwen2.5-72B-Instruct
 license: mit
 datasets:
 - arcee-ai/EvolKit-75K
@@ -14,12 +18,11 @@ Experimental commander model V1.
 Named it Zelensky in order to troll Uncle Elon on twitter over how bad Grok-2 is.
-Training process, low 1 epoch learning rate and evolutionary-merged via https://github.com/arcee-ai/EvolKit
-Process on 8x AMD Mi300 192GB gpus.
 Thank you Vultr https://www.vultr.com/register/ for sponsoring the compute.
-Qwen License applies by default.

 ---
+base_model:
+- Qwen/Qwen2.5-72B-Instruct
+- huihui-ai/Qwen2.5-72B-Instruct-abliterated
+- Qwen/Qwen2.5-72B
+- spow12/ChatWaifu_72B_v2.2
 license: mit
 datasets:
 - arcee-ai/EvolKit-75K
 Named it Zelensky in order to troll Uncle Elon on twitter over how bad Grok-2 is.
+Training process, low 1 epoch learning rate and evolutionary-merged with the 3 other listed models via https://github.com/arcee-ai/EvolKit
+Process repeated multiple times on 8x AMD Mi300 192GB gpus while also running gpqa_diamond_zeroshot on LM_Eval harness.
 Thank you Vultr https://www.vultr.com/register/ for sponsoring the compute.
+Qwen License still applies by default.