gradient_checkpointing_enable这个方法并没有开启梯度检查点的作用
#56 opened 10 days ago
by
Qleon
在哪里能看到模型的分词逻辑?
#55 opened 12 days ago
by
Loki1996
chatglm3系列仍未完全支持vllm
2
#52 opened about 2 months ago
by
zzhdbw
请问chatglm3-6b支持Flash Attention 2.0吗
#51 opened about 2 months ago
by
chentao111
chatglm-6b-128k模型输入长文本只能输入32k
3
#50 opened about 2 months ago
by
deqiuqiuzhang
怎么测试mmlu数据集精度呢?
3
#48 opened 3 months ago
by
StriveForC
GGUF model needed
1
#40 opened 3 months ago
by
ehmy
🚩 Report: Legal issue(s)
#31 opened 5 months ago
by
xcong
HF transformers integration
2
#28 opened 5 months ago
by
ybelkada
Add `ChatGLMTokenizerFast` and `ChatGLMTokenizerConverter`
10
#12 opened 7 months ago
by
chielo
[AUTOMATED] Model Memory Requirements
#1 opened 8 months ago
by
model-sizer-bot