Has the tokenizer of the base model(Mistral-7B-v0.1) been retrained?
#37 opened 3 days ago
by
LH0521
How did you trained your LatentAttentionLayer?
1
#36 opened 12 days ago
by
juneonetwothree
Why do we need to hardcode self._attn_implementation = "eager"
1
#35 opened 12 days ago
by
shantanuagarwal
Error to load model with HuggingFace API
1
#34 opened 13 days ago
by
hhcloud
Regarding max seq length
1
#33 opened 13 days ago
by
sandeep456
How to fine-tune this model?
#32 opened 14 days ago
by
caochengchen
error with module datasets
2
#31 opened 15 days ago
by
claraadam
Distant resource does not have a Content-Length
#30 opened 15 days ago
by
caochengchen
Best instructions for clustering and semantic similarity
2
#29 opened 16 days ago
by
rmilliere
Dataloader multiprocessing error
1
#28 opened 19 days ago
by
Atsunori
Fixing "KeyError: 'NVEmbedConfig'"
8
#27 opened 20 days ago
by
Th3l
Error using multi-gpu support
4
#26 opened 21 days ago
by
bobwhiterabbit
Access to model nvidia/NV-Embed-v1 is restricted. You must be authenticated to access it
6
#25 opened 22 days ago
by
yijiu
Matryoshka Embedding
1
#24 opened 22 days ago
by
XingyanZhang
nvidia/NV-Embed-v1 is not the path to a directory containing a file named config.json.
3
#23 opened 22 days ago
by
XuehangCang
Finetuning guidelines
#21 opened 24 days ago
by
mali404
How much VRAM is needed to run this model? Like for the bare minimum length etc?
3
#20 opened 25 days ago
by
smpa239
Ollama Version
1
#19 opened 25 days ago
by
yangwang825
Weights are in FP16 (loaded in FP32) but paper mentions BF16
#17 opened 27 days ago
by
AdrienC
ONNX version
1
#16 opened 27 days ago
by
michaelfeil
Sentence Transformer compatibility
4
#15 opened 27 days ago
by
michaelfeil
Please provide a 8bit quantified version
#14 opened 27 days ago
by
fukai
How to use for AutoModelForSequenceClassification?
#13 opened 28 days ago
by
deshwalmahesh
Possible to implement `_no_split_modules` attribute?
1
#12 opened 28 days ago
by
ronnybehrens
missing citation
3
#11 opened 28 days ago
by
SeanLee97
Multi-Lingual?
2
#10 opened 28 days ago
by
dejanseo
Getting "KeyError" when loading model
5
#8 opened 29 days ago
by
tsakaiba
TypeError: MistralDecoderLayer.forward() got an unexpected keyword argument 'is_causal'
3
#7 opened 29 days ago
by
yxzwayne
Is this model active?
1
#5 opened 29 days ago
by
gsnic
Sharing training data & reproducing training
1
#4 opened 30 days ago
by
xhluca