shisa-ai
/

shisa-v1-llama3-70b

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

leonardlin commited on May 25

Commit

fec208d

•

1 Parent(s): 093800e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ datasets:
 # shisa-v2 Base Model ablation
 This is a fine-tune Llama 3 70B Instruct with the primary `shisa-v1` dataset to improve Japanese language capabilities.
 This model uses a LR of 8e-6 that slightly improves performance vs the original 2e-5 tune (based on and validating predictive power of the the

 # shisa-v2 Base Model ablation
+*Per the  Llama 3 Community License Agreement, the official name of this model is "LLama 3 shisa-v1-llama3-70b"*
 This is a fine-tune Llama 3 70B Instruct with the primary `shisa-v1` dataset to improve Japanese language capabilities.
 This model uses a LR of 8e-6 that slightly improves performance vs the original 2e-5 tune (based on and validating predictive power of the the