Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
15
Follow
AWS Inferentia and Trainium
92
License:
apache-2.0
Model card
Files
Files and versions
Community
360
98bd4ca
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.21.dev0
/
inference
/
llama
/
princeton-nlp
/
Sheared-LLaMA-1.3B
4 contributors
History:
4 commits
dacorvo
HF staff
Synchronizing local compiler cache.
c5b6dab
verified
10 months ago
059827c299e8d9043f57.json
881 Bytes
Synchronizing local compiler cache.
10 months ago
ece87a51a12bdc2169c6.json
881 Bytes
Synchronizing local compiler cache.
10 months ago