Update README.md
Browse files
README.md
CHANGED
@@ -1,7 +1,7 @@
|
|
1 |
---
|
2 |
license: cc-by-4.0
|
3 |
datasets:
|
4 |
-
- cyberagent/
|
5 |
language:
|
6 |
- ja
|
7 |
- en
|
@@ -9,7 +9,7 @@ language:
|
|
9 |
|
10 |
# Model Card for "calm2-7b-chat-dpo-experimental"
|
11 |
|
12 |
-
[cyberagent/calm2-7b-chat](https://huggingface.co/cyberagent/calm2-7b-chat)γ«[cyberagent/
|
13 |
DPOγ«γ―[Low-Rank Adaptation (LoRA)](https://huggingface.co/docs/peft/conceptual_guides/lora)γη¨γγΎγγγ
|
14 |
|
15 |
## Requirements, Usage, Chat Template
|
|
|
1 |
---
|
2 |
license: cc-by-4.0
|
3 |
datasets:
|
4 |
+
- cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental
|
5 |
language:
|
6 |
- ja
|
7 |
- en
|
|
|
9 |
|
10 |
# Model Card for "calm2-7b-chat-dpo-experimental"
|
11 |
|
12 |
+
[cyberagent/calm2-7b-chat](https://huggingface.co/cyberagent/calm2-7b-chat)γ«[cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental](https://huggingface.co/datasets/cyberagent/chatbot-arena-ja-calm2-7b-chat-experimental)γγΌγΏγ»γγγη¨γγ¦[Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290)γγγγ’γγ«γ§γγ
|
13 |
DPOγ«γ―[Low-Rank Adaptation (LoRA)](https://huggingface.co/docs/peft/conceptual_guides/lora)γη¨γγΎγγγ
|
14 |
|
15 |
## Requirements, Usage, Chat Template
|