h2oai
/

h2ogpt-4096-llama2-7b-chat

Text Generation

text-generation-inference

Model card Files Files and versions Community

arnocandel commited on Aug 10, 2023

Commit

f55dcbf

·

1 Parent(s): 8d8266c

Update README.md

Files changed (1) hide show

README.md +32 -1

README.md CHANGED Viewed

@@ -20,4 +20,35 @@ Try it live on our [h2oGPT demo](https://gpt.h2o.ai) with side-by-side LLM compa
 See how it compares to other models on our [LLM Leaderboard](https://evalgpt.ai/)!
-See more at [H2O.ai](https://h2o.ai/)

 See how it compares to other models on our [LLM Leaderboard](https://evalgpt.ai/)!
+See more at [H2O.ai](https://h2o.ai/)
+## Model Architecture
+```
+LlamaForCausalLM(
+  (model): LlamaModel(
+    (embed_tokens): Embedding(32000, 4096, padding_idx=0)
+    (layers): ModuleList(
+      (0-31): 32 x LlamaDecoderLayer(
+        (self_attn): LlamaAttention(
+          (q_proj): Linear(in_features=4096, out_features=4096, bias=False)
+          (k_proj): Linear(in_features=4096, out_features=4096, bias=False)
+          (v_proj): Linear(in_features=4096, out_features=4096, bias=False)
+          (o_proj): Linear(in_features=4096, out_features=4096, bias=False)
+          (rotary_emb): LlamaRotaryEmbedding()
+        )
+        (mlp): LlamaMLP(
+          (gate_proj): Linear(in_features=4096, out_features=11008, bias=False)
+          (up_proj): Linear(in_features=4096, out_features=11008, bias=False)
+          (down_proj): Linear(in_features=11008, out_features=4096, bias=False)
+          (act_fn): SiLUActivation()
+        )
+        (input_layernorm): LlamaRMSNorm()
+        (post_attention_layernorm): LlamaRMSNorm()
+      )
+    )
+    (norm): LlamaRMSNorm()
+  )
+  (lm_head): Linear(in_features=4096, out_features=32000, bias=False)
+)
+```