Unsupported architecture T5ForConditionalGeneration

#1
by Renee0v0 - opened

Hi Chiyuri,
Thank you for sharing your merged models!
I was wondering if you have encountered the following error since Flan-T5-base is an encoder-decoder model and how you solved it:

CopyTraceback (most recent call last):
  File "/usr/local/bin/mergekit-yaml", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1157, in __call__
    return self.main(*args, **kwargs)
  File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1078, in main
    rv = self.invoke(ctx)
  File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 1434, in invoke
    return ctx.invoke(self.callback, **ctx.params)
  File "/usr/local/lib/python3.10/dist-packages/click/core.py", line 783, in invoke
    return __callback(*args, **kwargs)
  File "/content/mergekit/mergekit/options.py", line 82, in wrapper
    f(*args, **kwargs)
  File "/content/mergekit/mergekit/scripts/run_yaml.py", line 47, in main
    run_merge(
  File "/content/mergekit/mergekit/merge.py", line 46, in run_merge
    model_arch_info = [
  File "/content/mergekit/mergekit/merge.py", line 47, in <listcomp>
    get_architecture_info(m.config(trust_remote_code=options.trust_remote_code))
  File "/content/mergekit/mergekit/architecture.py", line 366, in get_architecture_info
    raise RuntimeError(f"Unsupported architecture {arch_name}")
RuntimeError: Unsupported architecture T5ForConditionalGeneration

When I use your scripts on the LazyMergekit, I have been stuck on the error above.
Thank you so much, and I'm looking forward to hearing from you!

Sign up or log in to comment