Consumer sized versions? (26B A3B Versions)

#5
by CYISNOTHERE - opened

I was wondering if it would be possible for moonshootai to create consumer GPU versions of the kimi model? This model is so good, it is sad that we don't get smaller versions for consumers to be able to run themselves. Right now this class of model seems to be only usable if you have enterprise funding or are wealthy with disposable income.

I know this isn't the typical pattern of releases you guys do, but it would be really appreciated. πŸ™

Even a version a quarter of the size would at least allow running it on small enterprises setups. I don't have dozens of H200 lying arround, even in our small DC.

Even a version a quarter of the size would at least allow running it on small enterprises setups. I don't have dozens of H200 lying arround, even in our small DC.

Lebron James Heaven
If there's a will, there's a way

I was wondering if it would be possible for moonshootai to create consumer GPU versions of the kimi model? This model is so good, it is sad that we don't get smaller versions for consumers to be able to run themselves. Right now this class of model seems to be only usable if you have enterprise funding or are wealthy with disposable income.

I know this isn't the typical pattern of releases you guys do, but it would be really appreciated. πŸ™

You already have Qwen 3.6 models.

@daniel-dona to be fair, no 135b or 397b sized Qwen models were released last time, which I'd love to have. This is kind of a forgotten size right now. You either have 40b-ish models (and lower), or 750b and above.

The middle is empty.

Sign up or log in to comment