I think it is a very interesting model.

#1
by suitup91 - opened

If possible, could you please upload the unquantized weights? I would like to check the performance of this model.

The uncompressed weights are too heavy and can only really be used by T5 in their current form; that is why I had to do something abstract like use shaping, they are not 1:1 analogs of something Gemma can natively use.

But you'll find if you DL T5-small for example, you can outright replace non-decoder layers with weights generated by ContourFuse and it will function as it should, there is no breakage, if you're curious whether they are valid or not.

The method I used to construct them relies on RELU boundary, and I was only able to reason two boundaries so the rest of the 'weights' are interpolation that hold the reasoning but not the actual training data

With just two boundaries, it is a lossy image of the reasoning

Sign up or log in to comment