How to use from the
Use from the
llama-cpp-python library
# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="deepforce/deepforce-coder-v1",
	filename="deepforce-coder-v1-q4_k_m.gguf",
)
llm.create_chat_completion(
	messages = [
		{
			"role": "user",
			"content": "What is the capital of France?"
		}
	]
)

DeepForce Coder v1

⚠️ Note: v1 has known issues with simple Apex generation due to adapter weight loss during training. v2 is currently in development with full retraining and will be significantly improved. Use v2 when available: coming soon.

Known Limitations in v1

  • Over-engineers simple Apex requests
  • Occasionally generates non-existent Apex APIs
  • Best used for complex generation, debug, review, and refactor tasks

v2 Coming Soon

Full retraining with verified adapter weights across all adapters.

Downloads last month
42
GGUF
Model size
3B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for deepforce/deepforce-coder-v1

Base model

Qwen/Qwen2.5-3B
Quantized
(106)
this model