Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
13
Follow
AWS Inferentia and Trainium
74
License:
apache-2.0
Model card
Files
Files and versions
Community
280
bb6e6ba
optimum-neuron-cache
/
neuronxcc-2.14.227.0+2d4f85be
/
0_REGISTRY
/
0.0.24.dev0
/
inference
/
llama
/
princeton-nlp
Commit History
Synchronizing local compiler cache.
492a248
verified
dacorvo
HF staff
commited on
Jul 19, 2024
Synchronizing local compiler cache.
485e40c
verified
dacorvo
HF staff
commited on
Jul 19, 2024