"RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu!"
RuntimeError: [enforce fail at C:\cb\pytorch_1000000000000\work\c10\core\impl\alloc_cpu.cpp:72] data. DefaultCPUAllocator: not enough memory: you tried to allocate 1048576 bytes.
Running an rtx3060 with 12GBvram - managed to get this model working on method in link in description
RuntimeError: Internal: D:\a\sentencepiece\sentencepiece\src\sentencepiece_processor.cc(1102) [model_proto->ParseFromArray(serialized.data(), serialized.size())]
ValueError: Tokenizer class LlamaTokenizer does not exist or is not currently imported. (while using Transformers)
Temporal fix for "DefaultCPUAIIocator: not enough memory: you tried to allocate 13107200 bytes" error