OpenGVLab/InternVL2-8B · Parameters not affecting model using LMdeploy

Aug 1, 2024

I'm currently experimenting with internvl-2 using lmdeploy as inference engine and have encountered a few issues regarding the parameter settings during inference.

Parameters like temperature and top_p are not affecting results: Despite changing these parameters to various settings, I don't observe any noticeable differences in the outputs. Has anyone else experienced this issue? Are there any known solutions or workarounds to ensure these parameters influence the generation as expected?
max_input_tiles parameter: I couldn't find the max_input_tiles parameter in the lmdeploy engine documentation. Could someone provide guidance on how to use this parameter during inference? Is there a specific syntax or method to include it?

Any help or insights on these issues would be greatly appreciated. Thanks in advance!

zwgao

OpenGVLab org Aug 20, 2024

•

edited Aug 21, 2024

For the first question, could you provide specific test code and samples?
2.max_input_tiles(max_dynamic_patch in lmdeploy) has been added in lmdeploy. See details in https://github.com/InternLM/lmdeploy/pull/2245 and https://github.com/InternLM/lmdeploy/pull/2292

czczup changed discussion status to closed Aug 27, 2024