Commit History

Added Llama-70b batch_size 4 to inference cache
593822e
verified

dacorvo HF staff commited on

Create inference-cache-config/llama.json
1960ccb
verified

philschmid HF staff commited on