philschmid's picture
philschmid HF staff
added custom handler for sharded loading
9327b57
|
raw
history blame
388 Bytes
metadata
tags:
  - endpoints-template
library_name: generic

Shareded fp16 copy of EleutherAI/gpt-j-6B

This is fork of EleutherAI/gpt-j-6B with shareded fp16 weights implementing a custom handler.py as an example for how to use gpt-j inference-endpoints