Commit History

try except flash-attn
f48478c

Andrei Panferov commited on

inference lib
03ea233

Andrei Panferov commited on

slightly faster inference
f1a2023

Andrei Panferov commited on

newer inference
115e749

Andrei Panferov commited on

new code
dfb8eb3

Andrei Panferov commited on

removed init
161c13a

Andrei Panferov commited on

tokenizer
8abdf20

Andrei Panferov commited on

deleted leftovers
0110580

Andrei Panferov commited on

depth 1
5edaefc

Andrei Panferov commited on

flat
7e4a8ff

Andrei Panferov commited on

correct import
c0d7cc2

Andrei Panferov commited on

Custom config in modeling
c43662f

Andrei Panferov commited on

inference and autoloading
5c0d7ef

Andrei Panferov commited on

model
cc25d01

Andrei Panferov commited on

config
d1f8951

Andrei Panferov commited on