postitive666
/

llama3_ruozhiba_8b

Model card Files Files and versions Community

postitive666 commited on Apr 19

Commit

5c1e0c5

•

1 Parent(s): 3da5d7d

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -8,4 +8,10 @@ library_name: adapter-transformers
 ---
 ---
 license: apache-2.0
 --- sft 1700 llama3 test, 25 EPOCH

 ---
 ---
 license: apache-2.0
+--- Model Architecture Llama 3 is an auto-regressive language model that uses an optimized transformer architecture. The tuned versions use supervised fine-tuning (SFT) and reinforcement learning with human feedback (RLHF) to align with human preferences for helpfulness and safety.
+Training Data	Params	Context length	GQA	Token count	Knowledge cutoff
+Llama 3	A new mix of publicly available online data.	8B	8k	Yes	15T+	March, 2023
+70B	8k	Yes	December, 2023
+Llama 3 family of models. Token counts refer to pretraining data only. Both the 8 and 70B versions use Grouped-Query Attention (GQA) for improved inference scalability.
 --- sft 1700 llama3 test, 25 EPOCH