使用so-vits-svc-fork,使用G_7000.pth报错

#1
by runchina - opened

Traceback (most recent call last):
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/bin/svc", line 8, in
sys.exit(cli())
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/click/core.py", line 1157, in call
return self.main(*args, **kwargs)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/click/core.py", line 1078, in main
rv = self.invoke(ctx)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/click/core.py", line 1688, in invoke
return _process_result(sub_ctx.command.invoke(sub_ctx))
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/click/core.py", line 1434, in invoke
return ctx.invoke(self.callback, **ctx.params)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/click/core.py", line 783, in invoke
return __callback(*args, **kwargs)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/so_vits_svc_fork/main.py", line 277, in infer
infer(
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/so_vits_svc_fork/inference/main.py", line 95, in infer
audio = svc_model.infer_silence(
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/so_vits_svc_fork/inference/core.py", line 300, in infer_silence
audio_chunk_pad_infer_tensor, _ = self.infer(
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/so_vits_svc_fork/inference/core.py", line 232, in infer
audio = self.net_g.infer(
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/so_vits_svc_fork/modules/synthesizers.py", line 213, in infer
x = self.pre(c) * x_mask + self.emb_uv(uv.long()).transpose(1, 2)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1511, in _wrapped_call_impl
return self._call_impl(*args, **kwargs)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1520, in _call_impl
return forward_call(*args, **kwargs)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 310, in forward
return self._conv_forward(input, self.weight, self.bias)
File "/Users/run/Workspaces/Envs/so-vits-svc-fork/lib/python3.10/site-packages/torch/nn/modules/conv.py", line 306, in _conv_forward
return F.conv1d(input, weight, bias, self.stride,
RuntimeError: Given groups=1, weight of size [192, 768, 5], expected input[1, 256, 2756] to have 768 channels, but got 256 channels instead

似乎是因为256和768的问题。换一下config里的encoder试一试,麻烦了。

Sign up or log in to comment