mistralai/Mistral-7B-v0.1

#91 opened 7 months ago by

lcahill

Adding Evaluation Results

#90 opened 7 months ago by

leaderboard-pr-bot

Embeddings API

#88 opened 7 months ago by

priamai

Update config.json

#86 opened 7 months ago by

PlanetDOGE

Create xx

#83 opened 7 months ago by

joey1895

Create README.md

#80 opened 7 months ago by

joey1895

Keyerror "Mistral"

7

#79 opened 7 months ago by

lakshmiu

Korean data rate in pretraining datasets.

#78 opened 7 months ago by

Korabbit

Model outputs only <unk> tokens after training on my data

#77 opened 7 months ago by

Fico

MemGPT, Function Calling and Mistral-7b-v0.1

#76 opened 7 months ago by

Joseph717171

I create a site for someone want full guide of this model

#72 opened 7 months ago by

gstarwd

Can you give an example of a good prompt template?

#70 opened 7 months ago by

iplayfast

Hosting Mistral 7B API

#69 opened 7 months ago by

wahab12

ImportError: Using `load_in_8bit=True` requires Accelerate

#68 opened 7 months ago by

ubermenchh

Update README.md

#67 opened 7 months ago by

Enoughking

Suggested Architecture for Small Mistral Model

#66 opened 7 months ago by

mnitin73

Does Mistral support accelerate library?

#65 opened 7 months ago by

Sp1der

The attention mask and the pad token id were not set.

#64 opened 8 months ago by

victor314159

[AUTOMATED] Model Memory Requirements

#63 opened 8 months ago by

model-sizer-bot

If I trained a model on mistral already, do I need to start from scratch due to difficulties of fine-tuning?

#62 opened 8 months ago by

brando

Best french model embedder for retriever LangChain?

#61 opened 8 months ago by

cfrancois7

token limit exceeded

#60 opened 8 months ago by

nidabijapure

a=2, b=3, n=a+b, n=?

#59 opened 8 months ago by

marc47marc47

AI专家

#58 opened 8 months ago by

sun95

Request: Please Make a LLAVA-Like Model from Mistral-7B - It Would be Amazing 🤩

6

#57 opened 8 months ago by

Joseph717171

Open-Ko-LLM Leaderboard - Thanks for Uploading!

#55 opened 8 months ago by

hunkim

Can't load tokenizer for 'bert-base-uncased'.

#54 opened 8 months ago by

Momoxiao111

A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set `padding_side='left'` when initializing the tokenizer.

5

#51 opened 8 months ago by

Ayush8120

Unrecognized configuration class <class 'transformers.models.mistral.configuration_mistral.MistralConfig'>

#50 opened 8 months ago by

zeio

requests.exceptions.JSONDecodeError: Expecting value: line 1 column 1 (char 0)

6

#49 opened 8 months ago by

Jenad1kr

Problems with tokenizer

#48 opened 8 months ago by

abdurnawaz

QLORA fine tuning with longer length of sequence (max_length=2048, padding=True) cause RuntimeError: CUDA error: device-side assert triggered; shorten length to 512 works !

#46 opened 8 months ago by

nps798

MCQ Question Answering

#45 opened 8 months ago by

Ayush8120

Is `added_tokens.json` intended to be here?

#43 opened 8 months ago by

xzuyn

Adding `safetensors` variant of this model

#42 opened 8 months ago by

nth-attempt

Adding `safetensors` variant of this model

#41 opened 8 months ago by

nth-attempt

Mistral en français ?

6

#40 opened 8 months ago by

Giroud

Question answering

11

#39 opened 8 months ago by

codegood

Tensorflow-variant coming?

#37 opened 8 months ago by

areinh

Default template and configuration for local run with GPU

#33 opened 8 months ago by

brunoedcf

still throws refusals

#31 opened 8 months ago by

Phoenixalight

Has a massive repetition problem

14

#29 opened 8 months ago by

Delcos

Which Mistral datacenter was used for training ?

#25 opened 8 months ago by

niko32

ValueError: Please specify `target_modules` in `peft_config`

#23 opened 8 months ago by

Tapendra

13b in the future?

9

#21 opened 8 months ago by deleted

Architectural difference with Llama

#20 opened 8 months ago by

imone

How to deploy the model to local?