VMware
/

open-llama-7b-v2-open-instruct

Text Generation

text-generation-inference

Model card Files Files and versions Community

Teja-Gollapudi commited on Jul 11, 2023

Commit

b06b002

·

1 Parent(s): ea64011

Update README.md

Files changed (1) hide show

README.md +6 -4

README.md CHANGED Viewed

@@ -70,7 +70,10 @@ output = tokenizer.decode(output1[0])
 print(output)
-'''
 Sure, I can help you with that!
 Attention mechanisms in transformer models are typically implemented using the attention mechanism in the self-attention layer. Self-attention allows the model to focus on different parts of the input sequence when processing it. This is achieved by computing a set of attention weights, which are used to weigh the contribution of each input element to the output.
@@ -118,9 +121,8 @@ The `query`, `key`, and `value` tensors represent the input sequence to the tran
 The output of the `attention_weights` function is a NumPy tensor that represents the attention weights for the input sequence. These weights are used by the transformer model to weigh the contribution of each input element to the output.
 I hope this helps!</s>
-'''
-```
 ## Finetuning details
 The finetuning scripts will be available in our [RAIL Github Repository](https://github.com/vmware-labs/research-and-development-artificial-intelligence-lab/tree/main/instruction-tuning)
 ## Evaluation

 print(output)
+```
+### Output
 Sure, I can help you with that!
 Attention mechanisms in transformer models are typically implemented using the attention mechanism in the self-attention layer. Self-attention allows the model to focus on different parts of the input sequence when processing it. This is achieved by computing a set of attention weights, which are used to weigh the contribution of each input element to the output.
 The output of the `attention_weights` function is a NumPy tensor that represents the attention weights for the input sequence. These weights are used by the transformer model to weigh the contribution of each input element to the output.
 I hope this helps!</s>
+<hr>
 ## Finetuning details
 The finetuning scripts will be available in our [RAIL Github Repository](https://github.com/vmware-labs/research-and-development-artificial-intelligence-lab/tree/main/instruction-tuning)
 ## Evaluation