view post Post 1223 Hi folks,Tenyx announced its latest model Llama3-TenyxChat-70B, which outperforms a GPT-4 variant on several MT-Bench measurements.By post-training Llama-3 70B in 15 hours, our model improves reasoning capabilities leveraging the relationship between geometry and LLM task complexity (Take a look at our paper: https://arxiv.org/abs/2312.01648, to be presented at ICML 2024)Model: tenyx/Llama3-TenyxChat-70B, HuggingFace Space: tenyx/Llama3-TenyxChat-70B π₯ 7 7 π 5 5 π 2 2 π€ 2 2 π§ 1 1 π€― 1 1 + Reply
A Rank Stabilization Scaling Factor for Fine-Tuning with LoRA Paper β’ 2312.03732 β’ Published Nov 28, 2023 β’ 8