Transformers without Normalization
Paper
•
2503.10622
•
Published
•
70
•
3
Thanks for taking the time to clarify, super interesting!
Hi, thanks for sharing! Curious if you could say more about why it's hard to finetune or modify a Model saved using GGUF?