Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
16
Follow
AWS Inferentia and Trainium
93
License:
apache-2.0
Model card
Files
Files and versions
Community
363
0e97e8e
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.21.dev0
/
inference
/
llama
/
princeton-nlp
/
Sheared-LLaMA-1.3B
3 contributors
History:
4 commits
dacorvo
HF staff
Synchronizing local compiler cache.
c5b6dab
verified
10 months ago
059827c299e8d9043f57.json
881 Bytes
Synchronizing local compiler cache.
10 months ago
ece87a51a12bdc2169c6.json
881 Bytes
Synchronizing local compiler cache.
10 months ago