Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
90.5
TFLOPS
21
14
Hu
Moses25
Follow
0 followers
·
2 following
moseshu
AI & ML interests
None yet
Recent Activity
new
activity
13 days ago
deepseek-ai/DeepSeek-R1:
输出乱码
new
activity
20 days ago
deepseek-ai/DeepSeek-R1:
Adding <think>\n after chat template will cause vllm to not return reasoning_content (null) when reasoning
new
activity
about 1 month ago
Qwen/Qwen2.5-7B-Instruct:
vllm 0.6.6加速qwen2.5-7B模型出错
View all activity
Organizations
Moses25
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
deepseek-ai/DeepSeek-R1
13 days ago
输出乱码
1
#163 opened 14 days ago by
cell22
New activity in
deepseek-ai/DeepSeek-R1
20 days ago
Adding <think>\n after chat template will cause vllm to not return reasoning_content (null) when reasoning
6
#144 opened 22 days ago by
kebeliu
New activity in
Qwen/Qwen2.5-7B-Instruct
about 1 month ago
vllm 0.6.6加速qwen2.5-7B模型出错
#15 opened about 1 month ago by
Moses25
New activity in
mistralai/Mistral-Nemo-Instruct-2407
2 months ago
why is the system prompt missing?
1
#87 opened 2 months ago by
Moses25
New activity in
stepfun-ai/GOT-OCR2_0
3 months ago
input_ids = torch.as_tensor(inputs.input_ids).cuda() 是否有问题?
#37 opened 3 months ago by
Moses25
New activity in
OpenGVLab/InternVid
4 months ago
where is the video dataset?
1
#5 opened 4 months ago by
Moses25
updated
a model
4 months ago
Moses25/Llama-3-8B-chat-32K
Text Generation
•
Updated
Oct 25, 2024
•
34
•
3
updated
a collection
7 months ago
Llama3-8B
Collection
4 items
•
Updated
Jul 28, 2024
updated
a model
7 months ago
Moses25/LlaMA-3-8B-32K-INT8
Text Generation
•
Updated
Jul 28, 2024
•
10
updated
a dataset
7 months ago
Moses25/generate-Instruction-output-190k
Viewer
•
Updated
Jul 28, 2024
•
197k
•
82
updated
2 models
8 months ago
Moses25/Mistral-7B-Instruct-32K-GPTQ-INT8
Text Generation
•
Updated
Jul 13, 2024
•
17
•
1
Moses25/Mistral-7B-Base-V1
Text Generation
•
Updated
Jul 12, 2024
•
23
•
1
liked
a model
8 months ago
Moses25/Mistral-7B-Base-V1
Text Generation
•
Updated
Jul 12, 2024
•
23
•
1
updated
a model
8 months ago
Moses25/Llama-3-8B-Instruct-V1.0
Text Generation
•
Updated
Jul 3, 2024
•
34
•
1
New activity in
Moses25/Instruct-dataset11M
9 months ago
[bot] Conversion to Parquet
#1 opened 9 months ago by
parquet-converter
updated
a dataset
9 months ago
Moses25/Instruct-dataset11M
Viewer
•
Updated
Jun 21, 2024
•
1.69M
•
103
liked
a model
9 months ago
Moses25/Llama-3-8B-Instruct-V1.0
Text Generation
•
Updated
Jul 3, 2024
•
34
•
1
updated
a dataset
9 months ago
Moses25/llava_instruction_80k
Preview
•
Updated
Jun 21, 2024
•
92
•
3
New activity in
Moses25/llava_instruction_80k
9 months ago
[bot] Conversion to Parquet
#1 opened 9 months ago by
parquet-converter
liked
a dataset
9 months ago
Moses25/llava_instruction_80k
Preview
•
Updated
Jun 21, 2024
•
92
•
3
Load more