LagPixelLOL
v2ray
AI & ML interests
Looking for compute sponsors, please contact me through my email 2282688304@qq.com!
Recent Activity
updated
a model
about 19 hours ago
x2ray/wheels
updated
a dataset
4 days ago
v2ray/r-chatgpt-general-dump
new activity
4 days ago
cognitivecomputations/DeepSeek-R1-AWQ:About the group size
Organizations
v2ray's activity
About the group size
1
#26 opened 4 days ago
by
Skyeaee

The awq quantization model may encounter garbled characters when performing inference on long texts.
9
#24 opened 12 days ago
by
wx111
How can I quantify my BF16 format model into AWQ?
1
#25 opened 11 days ago
by
AlipaySimon

Support for inference with MTP module?
1
#23 opened 13 days ago
by
yhh001
poor performance for DeepSeek-V3-AWQ
2
#9 opened 18 days ago
by
fridayl
The V3-AWQ model's response seems not as expected
12
#8 opened 20 days ago
by
juxing
Can't get 48 TPS on 8x H800
1
#21 opened 22 days ago
by
Light4Bear

gpt-4chan Neo-J
1
#1 opened 22 days ago
by
gman402
Pipeline Parallellism
1
#20 opened 22 days ago
by
leo98xh
8*a100 OUT OF MEMORY
1
#19 opened 22 days ago
by
Jaren
requests get stuck when sending long prompts (already solved, but still don't know why?)
1
#18 opened 22 days ago
by
uv0xab
Significant Speed Drop with Increasing Input Length on H800 GPUs
2
#17 opened 23 days ago
by
wangkkk956
Docker start with vllm failed. Official vllm docker image 0.7.3
1
#7 opened 23 days ago
by
kuliev-vitaly
when i use vllm v0.7.2 to deploy r1 awq, i got empty content
13
#10 opened about 1 month ago
by
bupalinyu
why "MLA is not supported with awq_marlin quantization. Disabling MLA." with 4090 * 32 (4 node / vllm 0.7.2)
3
#14 opened 24 days ago
by
FightLLM
when i run command ,it didnot work. ( via vllm 0.7.3)
2
#16 opened 23 days ago
by
xueshuai
skips the thinking process
11
#5 opened about 1 month ago
by
muzizon
Any one can run this model with SGlang framework?
3
#13 opened 24 days ago
by
muziyongshixin
Any thresholds recommendation for this model?
3
#1 opened 29 days ago
by
narugo

GPTQ Support
2
#1 opened 2 months ago
by
warlock-edward