Steven Goldfeather
treehugg3
·
AI & ML interests
None yet
Recent Activity
new activity
about 1 month ago
infly/INF-ORM-Llama3.1-70B:Basic question: How would you reproduce the training of this model?
new activity
about 1 month ago
Skywork/Skywork-Reward-Gemma-2-27B-v0.2:Model reproducability
new activity
about 2 months ago
nvidia/Llama-3_1-Nemotron-51B-Instruct:What is the context size this model was trained on?
Organizations
None yet
treehugg3's activity
Basic question: How would you reproduce the training of this model?
2
#1 opened about 1 month ago
by
treehugg3
Model reproducability
#2 opened about 1 month ago
by
treehugg3
What is the context size this model was trained on?
2
#23 opened 2 months ago
by
treehugg3
Model will need to be requantized, rope issues for long context
3
#2 opened about 2 months ago
by
treehugg3
Model will need to be re-quantized, rope issues
#4 opened about 2 months ago
by
treehugg3
Poor long-context performance?
6
#2 opened 2 months ago
by
treehugg3
Compatible small models for speculative decoding?
#9 opened 2 months ago
by
treehugg3
Chat template
3
#3 opened 7 months ago
by
sydneyfong