Bug: Missing modeling_iquestpltcoder.py file prevents model from loading

#3
by JohnMolotov - opened

Hi there,

Thank you for releasing the LoopCoder-V2 model!

It looks like the repository is currently missing the core modeling file required to initialize the custom architecture. While configuration_iquestpltcoder.py and tokenization_iquestcoder.py are present, modeling_iquestpltcoder.py appears to have been omitted during the upload.

When attempting to load the model via standard HF transformers or vLLM with trust_remote_code=True, it fails at architecture resolution because the AutoModelForCausalLM hook has no blueprint to construct the layer weights.

Could you please upload modeling_iquestpltcoder.py to the repository? Thank you!

I ran into the same problem. There's a YouTube video where the model is being tested, but I think he's running the model in V1, not V2. https://www.youtube.com/watch?v=ruxvS5Bd3mU

Multilingual-Multimodal-NLP org

Sry for inconvenience, we will release the codebase for inference of v2 version

Multilingual-Multimodal-NLP org

Hi, we updated the readme, and users can use the vLLM branch for inference.

For vLLM inference, install vLLM from yxing-bj/vllm and use transformers==4.57.1, then start the server with the following command:

vllm serve $MODEL --port 8080 \
    --max-num-batched-tokens 8192 --max-num-seqs 512 -tp 1 -dp 1 --trust-remote-code \
    --cudagraph-capture-sizes 1 2 4 8 12 16 24 32

Sign up or log in to comment