GGUF
conversational

curious about the working mechanism

#1
by jimzlf - opened

Is this a completely new model or a fine-tune based on any existing model(seem to be qwen2.5-7b regarding to the size)? And how does it work in the software(how does AI assist in the creative process)?
(oh sorry I only realized that after posting that it has been noted that the architecture is qwen2)

jimzlf changed discussion title from curious about the base model to curious about the working mechanism

Hey, no worries. Yes, it’s a Qwen2.5-Coder 7B Instruct model LoRA fine-tuned on MAGDA tasks, including MAGDA DSL generation, then merged/exported as a Q4_K_M GGUF for local inference

Sign up or log in to comment