GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection Paper โข 2403.03507 โข Published Mar 6 โข 172 โข 12
SaulLM-7B: A pioneering Large Language Model for Law Paper โข 2403.03883 โข Published Mar 6 โข 65 โข 4
PersianMind: A Cross-Lingual Persian-English Large Language Model Paper โข 2401.06466 โข Published Jan 12 โข 2 โข 2
GRATH: Gradual Self-Truthifying for Large Language Models Paper โข 2401.12292 โข Published Jan 22 โข 2 โข 2