Vezora
/

Mistral-14b-Merge-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Vezora commited on Nov 3, 2023

Commit

78d2be8

•

1 Parent(s): d40b8de

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -2,6 +2,8 @@
 license: apache-2.0
 ---
 # Mistral 14b: A New Base Model
 The objective of this model is to serve as a new base model for Mistral 14b. It has been enhanced with a LoRa adapter attached to all 62 layers of the merged model. The model is capable of generating outputs and responding accurately to inputs. However, it tends to over-respond with unasked questions when asked to process more than 512 tokens, which is its training limit using QLoRa.

 license: apache-2.0
 ---
+<img src="https://imgur.com/a/3HUIVxJ" width="300" alt="Description of the image">
 # Mistral 14b: A New Base Model
 The objective of this model is to serve as a new base model for Mistral 14b. It has been enhanced with a LoRa adapter attached to all 62 layers of the merged model. The model is capable of generating outputs and responding accurately to inputs. However, it tends to over-respond with unasked questions when asked to process more than 512 tokens, which is its training limit using QLoRa.