Commit History

Update Mistral Window_Size for GQA
3d5ce13
verified

petermcaughan commited on

Update README with usage
08cd86f

petermcaughan commited on

Remove duplicate models
fcff7bc

petermcaughan commited on

Update newest interface for Mistral (remove seq_len input, inferred instead)
6e7127a

petermcaughan commited on

Upload .ONNX model files
007544b

petermcaughan commited on