abacaj's picture
Update README.md
f1b6814
|
raw
history blame contribute delete
No virus
447 Bytes
metadata
license: other

This is a ggml quantized version of Replit-v2-CodeInstruct-3B. Quantized to 4bit -> q4_1. To run inference you can use ggml directly or ctransformers.