awacke1 commited on
Commit
94063a2
β€’
1 Parent(s): 49151d9

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +42 -0
app.py CHANGED
@@ -94,6 +94,48 @@ with gr.Blocks(css=css) as demo:
94
  gr.ChatInterface(
95
  generate,
96
  additional_inputs=additional_inputs,
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
97
  examples = [
98
  ["🏰 Welcome to the Kingdom of Elandria! You are Jim and Tim, two bumbling bros with a knack for mischief. 🀴🀴 [Action: Introduce yourselves, Equipment: Scepters of Foolishness]"],
99
  ["🌲 You find yourselves in a forest filled with magical creatures and oddly specific 'Do Not Disturb' signs. πŸ¦„ [Action: Proceed cautiously, Equipment: Map of Social Etiquette]"],
 
94
  gr.ChatInterface(
95
  generate,
96
  additional_inputs=additional_inputs,
97
+ markdown="""
98
+ | Feature | Description | Byline |
99
+ |---------|-------------|--------|
100
+ | πŸͺŸ Sliding Window Attention with 128K tokens span | Enables the model to have a larger context for each token. | Increases model's understanding of context, resulting in more coherent and contextually relevant outputs. |
101
+ | πŸš€ GQA for faster inference | Graph Query Attention allows faster computation during inference. | Speeds up the model inference time without sacrificing too much on accuracy. |
102
+ | πŸ“ Byte-fallback BPE tokenizer | Uses Byte Pair Encoding but can fall back to byte-level encoding. | Allows the tokenizer to handle a wider variety of input text while keeping token size manageable. |
103
+ | πŸ“œ License | Released under Apache 2.0 License | Gives you a permissive free software license, allowing you freedom to use, modify, and distribute the code. |
104
+ | πŸ“¦ Usage | | |
105
+ | πŸ“š Available on Huggingface Hub | The model can be easily downloaded and set up from Huggingface. | Makes it easier to integrate the model into various projects. |
106
+ | 🐍 Python code snippets for easy setup | Provides Python code snippets for quick and easy model setup. | Facilitates rapid development and deployment, especially useful for prototyping. |
107
+ | πŸ“ˆ Expected speedups with Flash Attention 2 | Upcoming update expected to bring speed improvements. | Keep an eye out for this update to benefit from performance gains. |
108
+ # πŸ›  Model Features and More πŸ› 
109
+
110
+ ## Features
111
+
112
+ - πŸͺŸ Sliding Window Attention with 128K tokens span
113
+ - **Byline**: Increases model's understanding of context, resulting in more coherent and contextually relevant outputs.
114
+
115
+ - πŸš€ GQA for faster inference
116
+ - **Byline**: Speeds up the model inference time without sacrificing too much on accuracy.
117
+
118
+ - πŸ“ Byte-fallback BPE tokenizer
119
+ - **Byline**: Allows the tokenizer to handle a wider variety of input text while keeping token size manageable.
120
+
121
+ - πŸ“œ License: Released under Apache 2.0 License
122
+ - **Byline**: Gives you a permissive free software license, allowing you freedom to use, modify, and distribute the code.
123
+
124
+ ## Usage πŸ“¦
125
+
126
+ - πŸ“š Available on Huggingface Hub
127
+ - **Byline**: Makes it easier to integrate the model into various projects.
128
+
129
+ - 🐍 Python code snippets for easy setup
130
+ - **Byline**: Facilitates rapid development and deployment, especially useful for prototyping.
131
+
132
+ - πŸ“ˆ Expected speedups with Flash Attention 2
133
+ - **Byline**: Keep an eye out for this update to benefit from performance gains.
134
+
135
+ """
136
+
137
+
138
+ gr.Markdown(markdown)
139
  examples = [
140
  ["🏰 Welcome to the Kingdom of Elandria! You are Jim and Tim, two bumbling bros with a knack for mischief. 🀴🀴 [Action: Introduce yourselves, Equipment: Scepters of Foolishness]"],
141
  ["🌲 You find yourselves in a forest filled with magical creatures and oddly specific 'Do Not Disturb' signs. πŸ¦„ [Action: Proceed cautiously, Equipment: Map of Social Etiquette]"],