J-LAB commited on
Commit
4c14389
1 Parent(s): ab3b6fd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -30
README.md CHANGED
@@ -1,37 +1,31 @@
1
  ---
2
  license: apache-2.0
3
  tags:
4
- - merge
5
- - mergekit
6
- - lazymergekit
7
  - JJhooww/Mistral-7B-v0.2-Base_ptbr
8
  - J-LAB/BRisa
9
  ---
10
 
11
- # BRisa-7B-Instruct-v0.2
12
-
13
- BRisa-7B-Instruct-v0.2 is a merge of the following models using [mergekit](https://github.com/cg123/mergekit):
14
- * [JJhooww/Mistral-7B-v0.2-Base_ptbr](https://huggingface.co/JJhooww/Mistral-7B-v0.2-Base_ptbr)
15
- * [J-LAB/BRisa](https://huggingface.co/J-LAB/BRisa)
16
-
17
- ## 🧩 Configuration
18
-
19
- ```yaml
20
- slices:
21
- - sources:
22
- - model: JJhooww/Mistral-7B-v0.2-Base_ptbr
23
- layer_range: [0, 32]
24
- - model: J-LAB/BRisa
25
- layer_range: [0, 32]
26
- merge_method: slerp
27
- base_model: JJhooww/Mistral-7B-v0.2-Base_ptbr
28
- parameters:
29
- t:
30
- - filter: self_attn
31
- value: [0, 0.5, 0.3, 0.7, 1]
32
- - filter: mlp
33
- value: [1, 0.5, 0.7, 0.3, 0]
34
- - value: 0.5
35
- dtype: bfloat16
36
-
37
- ```
 
1
  ---
2
  license: apache-2.0
3
  tags:
 
 
 
4
  - JJhooww/Mistral-7B-v0.2-Base_ptbr
5
  - J-LAB/BRisa
6
  ---
7
 
8
+ # Model Card for Model ID
9
+
10
+ <!-- Provide a quick summary of what the model is/does. -->
11
+
12
+ This is an instruction model trained for good performance in Portuguese. The initial base is the Mistral 7B v2 Model ([source](https://huggingface.co/mistral-community/Mistral-7B-v0.2)). We utilized the JJhooww/Mistral-7B-v0.2-Base_ptbr version pre-trained on 1 billion tokens in Portuguese ([source](https://huggingface.co/JJhooww/Mistral-7B-v0.2-Base_ptbr)).
13
+
14
+
15
+ ## Model Details
16
+
17
+ ### Model Description
18
+
19
+ <!-- Provide a longer summary of what this model is. -->
20
+
21
+ The base model has good performance in Portuguese but faces significant challenges following instructions. We therefore used the version mistralai/Mistral-7B-Instruct-v0.2 and fine-tuned it for responses in Portuguese, then merged it with the base JJhooww/Mistral-7B-v0.2-Base_ptbr (https://huggingface.co/JJhooww/Mistral-7B-v0.2-Base_ptbr).
22
+
23
+ - **Developed by:** ([J-LAB](https://huggingface.co/J-LAB/))
24
+ - **Language(s) (NLP):** Portuguese
25
+ - **License:** *APACHE*
26
+ - **Finetuned from model [optional]:** ([source](https://huggingface.co/JJhooww/Mistral-7B-v0.2-Base_ptbr))
27
+
28
+ ### Model Sources [optional]
29
+
30
+ - **Demo [optional]:** ([Spaces](https://huggingface.co/J-LAB/))
31
+