Some weights of Videollama2Qwen2ForCausalLM were not initialized from the model checkpoint at ./VideoLLaMA2.1-7B-AV and are newly initialized:
I have the following question, can you help me with that? Why can't I load the parameters of the audio tower successfully? Which file is the model parameter of Audio Tower and which file should I place it in?
Some weights of Videollama2Qwen2ForCausalLM were not initialized from the model checkpoint at ./VideoLLaMA2.1-7B-AV and are newly initialized: ['model.audio_tower.encoder.pos_conv.0.weight_g', 'model.audio_tower.encoder.pos_conv.0.weight_v']
do you deal with that problem? i meet the same problem
After infering, there no results, only "Some weights of Videollama2Qwen2ForCausalLM were not initialized from the model checkpoint at DAMO-NLP-SG/VideoLLaMA2.1-7B-AV and are newly initialized: ['model.audio_tower.encoder.pos_conv.0.weight_g', 'model.audio_tower.encoder.pos_conv.0.weight_v']
You should probably TRAIN this model on a down-stream task to be able to use it for predictions and inference.
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"
Thanks very much! i tried to rebuild the environment.
The solution is pip install transformers==4.42.3
. This issue is mainly caused by transformer version.
do you deal with that problem? i meet the same problem