upload custom handler and requirements.txt for direct compatibility with HF inference endpoints
#3 opened about 17 hours ago
by
MoritzLaurer
Quantization Options for Faster Inference and Lower VRAM Usage
#2 opened 19 days ago
by
1sarim
GPU requirements for real time response?
2
#1 opened 2 months ago
by
lukiggs