Commit History

Trying to stop OOMs on MMLU and GSM8K by halving seq len
c13858c

Yeyito commited on

Update .gitignore
4996c9a

Yeyito commited on

Avoiding re-loading already loaded models. Stated unload functionality as not-implemented.
b28ad14

Yeyito commited on

Upload 16 files
2a135fe

Yeyito commited on

Caching model1 responses for 2x speedup
5b3849a

Yeyito commited on

Functional
8ea42fc

Yeyito commited on

Modulized subprocess
d649c17

Yeyito commited on

First Test
0217809

Yeyito commited on

Test
eb5eda7

Yeyito commited on

Add application file
90a03a3

Yeyito commited on

initial commit
86af809

Yeyito commited on