Converting OPT-175B to NumPy weights sure took a lot as I did it all on my hardware. 600ish GB of used swap on spinning rust. For the License please read LICENSE.md
Use https://github.com/FMInference/FlexGen to run this model.
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
HF Inference API was unable to determine this model's library.