Commit History

Update Sonoma model with faster 8x8 conv and split einsum attention
dba673f

smpanaro commited on

Update sequoia mode with transposed value cache and 4:508 input:cache length
722eedf
verified

smpanaro commited on

Upload Sequoia model
f554427
verified

smpanaro commited on

Update README.md
3764204
verified

smpanaro commited on

Add model
a76a14d

smpanaro commited on

initial commit
6fcd72b
verified

smpanaro commited on