KBlueLeaf
/

guanaco-7B-leh

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

KBlueLeaf commited on Mar 25, 2023

Commit

16f6966

•

1 Parent(s): bbafa38

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -101,4 +101,19 @@ User: 「今天天氣真好」は日本語で何ですか
 Response:
 「今天天氣真好」は、日本語で「今日の天気が良好だ」と言われています。
-```

 Response:
 「今天天氣真好」は、日本語で「今日の天気が良好だ」と言われています。
+```
+## Some more information
+### Why use lora+embed+head
+First, I think it is obvious that when a LLM isn't good at some language and you want to ft for it. You should train the embed and head part.<br>
+But the question is: "Why not just native finetune?"<br>
+If you have searched for some alpaca model or training thing, you may notice that lot of them has 1 problem: "memorize".<br>
+The loss will drop at the begin of every epoch, just like some kind of "overfit".<br>
+And in my opinion, this is because that the number of params of LLaMA is too large. So it just memorize all the training data.
+But if I use lora for attention part(ignore MLP part), the param number is not large enough for "memorizing training data", so it is more unlikely to memorize all the things.
+And here is the loss graph of this 2epoch model:
+![Image](https://i.imgur.com/Z1ilyCm.png)