BornSaint
/

cabra-pt-br-3B_peft

Model card Files Files and versions Community

Róger Nascimento Santos commited on Oct 4, 2023

Commit

1df3f80

•

1 Parent(s): 1fb1a13

Update README.md

Files changed (1) hide show

README.md +4 -0

README.md CHANGED Viewed

@@ -5,9 +5,13 @@ datasets:
 library_name: peft
 ---
 this adapter model using (peft) was made on top of openlm-research/open_llama_3b_v2 (https://huggingface.co/openlm-research/open_llama_3b_v2)
 it's not perfect in portuguese, but in the perfect point to train a bit more for specific task in this language.
 consider check the jupyter notebooks in the files section for more info.
 these notebooks were get from web and it's very similar to "cabrita" model, that was made on top of llama1.
 trained in only 120 steps and with some results very similar to VMware/open-llama-13b-open-instruct
 maybe necessary to adjust the parameters of inference to make it work better.

 library_name: peft
 ---
 this adapter model using (peft) was made on top of openlm-research/open_llama_3b_v2 (https://huggingface.co/openlm-research/open_llama_3b_v2)
 it's not perfect in portuguese, but in the perfect point to train a bit more for specific task in this language.
 consider check the jupyter notebooks in the files section for more info.
 these notebooks were get from web and it's very similar to "cabrita" model, that was made on top of llama1.
 trained in only 120 steps and with some results very similar to VMware/open-llama-13b-open-instruct
 maybe necessary to adjust the parameters of inference to make it work better.