Commit History
Added Llama-70b batch_size 4 to inference cache
593822e
verified
Create inference-cache-config/llama.json
1960ccb
verified
philschmid
commited on