Ivan Stepanov

ivanstepanovftw
·

AI & ML interests

None yet

Recent Activity

Organizations

None yet

ivanstepanovftw's activity

reacted to bartowski's post with 👍 5 days ago
view post
Post
54123
Switching to author_model-name

I posted a poll on twitter, and others have mentioned the interest in me using the convention of including the author name in the model path when I upload.

It has a couple advantages, first and foremost of course is ensuring clarity of who uploaded the original model (did Qwen upload Qwen2.6? Or did someone fine tune Qwen2.5 and named it 2.6 for fun?)

The second thing is that it avoids collisions, so if multiple people upload the same model and I try to quant them both, I would normally end up colliding and being unable to upload both

I'll be implementing the change next week, there are just two final details I'm unsure about:

First, should the files also inherit the author's name?

Second, what to do in the case that the author name + model name pushes us past the character limit?

Haven't yet decided how to handle either case, so feedback is welcome, but also just providing this as a "heads up"
  • 3 replies
·
New activity in ai-sage/Giga-Embeddings-instruct 3 months ago

Exception

3
#2 opened 3 months ago by
ivanstepanovftw
New activity in hs-hf/jina-embeddings-v3-distilled 4 months ago
New activity in Lightricks/LTX-Video 4 months ago
New activity in jinaai/jina-embeddings-v3 4 months ago

Disable tqdm

2
#66 opened 4 months ago by
ivanstepanovftw
New activity in tenyx/Llama3-TenyxChat-70B 10 months ago

Prompt template?

1
#4 opened 10 months ago by
ivanstepanovftw