aloobun
/

CosmicNoodle-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

aloobun commited on Apr 16, 2024

Commit

b74055d

·

verified ·

1 Parent(s): b89272c

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -13,10 +13,23 @@ tags:
 This is an experimental model.
 The idea is :
-- Calculate the difference in weights between a donor model(meta-math/MetaMath-Mistral-7B) and the base model(mistralai/Mistral-7B-v0.1). This difference represents how much each parameter needs to be adjusted to go from the base state to the donor state.
 - Vector retrieved from the result of step one, is added to third model(lex-hue/Delexa-7b). This should transfer **math** *skills* to our third model.
 ```
 from transformers import AutoTokenizer, AutoModelForCausalLM
 import torch

 This is an experimental model.
 The idea is :
+- Calculate the difference in weights between a donor model(meta-math/MetaMath-Mistral-7B) and the base model(mistralai/Mistral-7B-v0.1). This difference represents how much each parameter needs to be adjusted to go from the base state to the donor state.
+```
+vector = math_model.state_dict()[k] - base_model.state_dict()[k]
+```
 - Vector retrieved from the result of step one, is added to third model(lex-hue/Delexa-7b). This should transfer **math** *skills* to our third model.
 ```
+vector = math_model.state_dict()[k]
+new_v = v + vector.to(v.device)
+v.copy_(new_v)
+```
+### Example:
+```
 from transformers import AutoTokenizer, AutoModelForCausalLM
 import torch