Instructions to use nileshsarkar-ai/erdos-forge-7b-qlora with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- PEFT
How to use nileshsarkar-ai/erdos-forge-7b-qlora with PEFT:
from peft import PeftModel from transformers import AutoModelForCausalLM base_model = AutoModelForCausalLM.from_pretrained("Qwen/Qwen2.5-Coder-7B-Instruct") model = PeftModel.from_pretrained(base_model, "nileshsarkar-ai/erdos-forge-7b-qlora") - Notebooks
- Google Colab
- Kaggle
erdos-forge-7b-qlora β FORGE: Fabrication-Oriented Reasoning Engine (Qwen2.5-Coder-7B QLoRA)
QLoRA adapters and intermediate checkpoints for FORGE β a constraint-graphβdriven engineering system that turns natural-language intent into manufacturable CAD parts (source on GitHub).
What FORGE does
Traditional CAD stores geometry. FORGE stores engineering intent as a live constraint graph, with every dimension traceable to the constraint that produced it. Change the manufacturing process and the part re-derives.
Adapter / training config
| Base model | Qwen/Qwen2.5-Coder-7B-Instruct |
| Method | QLoRA (4-bit base + LoRA adapters) |
LoRA rank r |
64 |
LoRA alpha |
128 |
| LoRA dropout | 0.05 |
| PEFT version | 0.18.1 |
| Tokenizer | included (tokenizer.json, vocab.json, merges.txt, etc.) |
Repository layout
.
βββ adapter_config.json # final adapter config
βββ adapter_model.safetensors # final LoRA weights
βββ tokenizer.json # tokenizer artifacts
βββ tokenizer_config.json
βββ added_tokens.json
βββ special_tokens_map.json
βββ vocab.json
βββ merges.txt
βββ chat_template.jinja
βββ training_args.bin # HF TrainingArguments at final
βββ checkpoint-500/ # intermediate checkpoints from training
βββ checkpoint-1000/
βββ checkpoint-1500/
βββ checkpoint-2000/
βββ checkpoint-2184/ # latest step (effectively final)
Each checkpoint-{step}/ mirrors the top-level adapter layout for resuming.
Loading the adapter
from peft import PeftModel
from transformers import AutoModelForCausalLM, AutoTokenizer
base = AutoModelForCausalLM.from_pretrained(
"Qwen/Qwen2.5-Coder-7B-Instruct",
torch_dtype="auto",
device_map="auto",
)
tokenizer = AutoTokenizer.from_pretrained("nileshsarkar-ai/erdos-forge-7b-qlora")
model = PeftModel.from_pretrained(base, "nileshsarkar-ai/erdos-forge-7b-qlora")
prompt = "Design a mounting bracket for an M8 bolt, injection molded in ABS."
inputs = tokenizer(prompt, return_tensors="pt").to(model.device)
out = model.generate(**inputs, max_new_tokens=512)
print(tokenizer.decode(out[0], skip_special_tokens=True))
To load an intermediate checkpoint instead of the final one:
model = PeftModel.from_pretrained(
base,
"nileshsarkar-ai/erdos-forge-7b-qlora",
subfolder="checkpoint-2000",
)
Reproducing
Training pipeline (data generation β training β eval β serving) is at
ERDOS/forge-training/ on GitHub.
Axolotl config: 03_training/axolotl_config.yaml.
cd ERDOS/forge-training
bash 06_scripts/run_full_pipeline.sh
Related
- Project README:
ERDOS/forge-training/README.md - Forge core (constraint graph + OCC kernel):
ERDOS/forge-core/ - Backup-restore doc:
COLM_BACKUP_RESTORE.md
- Downloads last month
- 49