michael jefferson
520jefferson
AI & ML interests
None yet
Recent Activity
new activity
about 12 hours ago
deepseek-ai/DeepSeek-V3-Base:模型区别
liked
a model
4 days ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
liked
a dataset
4 days ago
SimpleBerry/OpenLongCoT-SFT
Organizations
520jefferson's activity
模型区别
#22 opened about 12 hours ago
by
520jefferson
Run on iOS
4
#1 opened 8 months ago
by
evgeniymikholap
模型能区分下吗?
2
#2 opened 4 months ago
by
520jefferson
在此基础上能训练100k吗?
#6 opened 6 months ago
by
520jefferson
How much memory is needed if you make the 128k context length
1
#13 opened 7 months ago
by
ggbondcxk
multiturn roleplay
2
#2 opened 11 months ago
by
520jefferson
how can i limit the generate length , i haven't see the new max length in chat().
2
#6 opened about 1 year ago
by
520jefferson
my i use deepspeed to accelerate the model infer process?
1
#5 opened about 1 year ago
by
520jefferson