optimum-neuron-cache / inference-cache-config

dacorvo HF staff

Update inference-cache-config/Llama3.1-70b.json

7b0370b verified 2 months ago