Lambent
/

qwen2.5-reinstruct-alternate-lumen-14B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Lambent commited on Sep 24, 2024

Commit

41f5bd9

·

verified ·

1 Parent(s): 53b3986

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -13,6 +13,16 @@ tags:
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method
 This model was merged using the della merge method using [Lambent/qwen2.5-lumen-rebased-14B](https://huggingface.co/Lambent/qwen2.5-lumen-rebased-14B) as a base.

 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
+Extracted an approximate LoRA of v000000/Qwen2.5-Lumen-14B, rank 128 difference between that and instruct,
+and first applied this to Lambent/qwen2.5-14B-alternate-instruct-slerp which had no issues with EQ-Bench.
+Then, here, re-applied a density and weight of original Instruct which in previous merges gave me no issues with EQ-Bench.
+This one has EQ-Bench of 77.6713 and no "emotions don't match reference error" (if possibly still one not parsed).
+This is similar to Lumen and original Instruct and slightly exceeds both (within margin of error).
+My hope is that it has healed Instruct somewhat and regained its intelligence.
 ### Merge Method
 This model was merged using the della merge method using [Lambent/qwen2.5-lumen-rebased-14B](https://huggingface.co/Lambent/qwen2.5-lumen-rebased-14B) as a base.