Add Flax checkpoints
#95 opened 6 months ago
by
ksmcg
Update README.md
#93 opened 6 months ago
by
AzerOuerghi
can i use mistral as embedding model?
8
#92 opened 6 months ago
by
raynWest
Adding `safetensors` variant of this model
2
#91 opened 7 months ago
by
lcahill
Adding Evaluation Results
#90 opened 7 months ago
by
leaderboard-pr-bot
Embeddings API
3
#88 opened 7 months ago
by
priamai
Update config.json
#86 opened 7 months ago
by
PlanetDOGE
Create README.md
#80 opened 7 months ago
by
joey1895
Keyerror "Mistral"
7
#79 opened 7 months ago
by
lakshmiu
Korean data rate in pretraining datasets.
3
#78 opened 7 months ago
by
Korabbit
Model outputs only <unk> tokens after training on my data
#77 opened 7 months ago
by
Fico
MemGPT, Function Calling and Mistral-7b-v0.1
#76 opened 7 months ago
by
Joseph717171
I create a site for someone want full guide of this model
#72 opened 7 months ago
by
gstarwd
Can you give an example of a good prompt template?
3
#70 opened 7 months ago
by
iplayfast
Hosting Mistral 7B API
2
#69 opened 7 months ago
by
wahab12
ImportError: Using `load_in_8bit=True` requires Accelerate
4
#68 opened 7 months ago
by
ubermenchh
Update README.md
#67 opened 7 months ago
by
Enoughking
Suggested Architecture for Small Mistral Model
#66 opened 7 months ago
by
mnitin73
Does Mistral support accelerate library?
4
#65 opened 7 months ago
by
Sp1der
The attention mask and the pad token id were not set.
2
#64 opened 8 months ago
by
victor314159
[AUTOMATED] Model Memory Requirements
#63 opened 8 months ago
by
model-sizer-bot
If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?
2
#62 opened 8 months ago
by
brando
Best french model embedder for retriever LangChain?
2
#61 opened 8 months ago
by
cfrancois7
token limit exceeded
3
#60 opened 8 months ago
by
nidabijapure
a=2, b=3, n=a+b, n=?
3
#59 opened 8 months ago
by
marc47marc47
Request: Please Make a LLAVA-Like Model from Mistral-7B - It Would be Amazing 🤩
6
#57 opened 8 months ago
by
Joseph717171
Open-Ko-LLM Leaderboard - Thanks for Uploading!
#55 opened 8 months ago
by
hunkim
Can't load tokenizer for 'bert-base-uncased'.
2
#54 opened 8 months ago
by
Momoxiao111
A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set `padding_side='left'` when initializing the tokenizer.
5
#51 opened 8 months ago
by
Ayush8120
Unrecognized configuration class <class 'transformers.models.mistral.configuration_mistral.MistralConfig'>
2
#50 opened 8 months ago
by
zeio
requests.exceptions.JSONDecodeError: Expecting value: line 1 column 1 (char 0)
6
#49 opened 8 months ago
by
Jenad1kr
Problems with tokenizer
1
#48 opened 8 months ago
by
abdurnawaz
QLORA fine tuning with longer length of sequence (max_length=2048, padding=True) cause RuntimeError: CUDA error: device-side assert triggered; shorten length to 512 works !
#46 opened 8 months ago
by
nps798
MCQ Question Answering
#45 opened 8 months ago
by
Ayush8120
Is `added_tokens.json` intended to be here?
4
#43 opened 8 months ago
by
xzuyn
Adding `safetensors` variant of this model
4
#42 opened 8 months ago
by
nth-attempt
Adding `safetensors` variant of this model
#41 opened 8 months ago
by
nth-attempt
Mistral en français ?
6
#40 opened 8 months ago
by
Giroud
Question answering
11
#39 opened 8 months ago
by
codegood
Tensorflow-variant coming?
1
#37 opened 8 months ago
by
areinh
Default template and configuration for local run with GPU
#33 opened 8 months ago
by
brunoedcf
still throws refusals
1
#31 opened 8 months ago
by
Phoenixalight
Has a massive repetition problem
14
#29 opened 8 months ago
by
Delcos
Which Mistral datacenter was used for training ?
2
#25 opened 8 months ago
by
niko32
ValueError: Please specify `target_modules` in `peft_config`
3
#23 opened 8 months ago
by
Tapendra
13b in the future?
9
#21 opened 8 months ago
by
deleted
Architectural difference with Llama
1
#20 opened 8 months ago
by
imone
How to deploy the model to local?
4
#19 opened 8 months ago
by
chao0524