[Cache Request] aws-neuron/Llama-2-7b-hf-neuron-budget

#59
by Gerald001 - opened

Please add the following model to the neuron cache

AWS Inferentia and Trainium org

Llama 7b is already present in the cache: please go to the model card, select deploy and look at the Inferentia code snippet.

dacorvo changed discussion status to closed

Sign up or log in to comment