77 CulturaX: A Cleaned, Enormous, and Multilingual Dataset for Large Language Models in 167 Languages · 8 authors 4
22 Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference Using Sorted Fine-Tuning (SoFT) · 6 authors 1
9 Struc-Bench: Are Large Language Models Really Good at Generating Complex Structured Data? · 5 authors 1
4 A Distributed Data-Parallel PyTorch Implementation of the Distributed Shampoo Optimizer for Training Neural Networks At-Scale · 8 authors
3 S3-DST: Structured Open-Domain Dialogue Segmentation and State Tracking in the Era of LLMs · 8 authors
2 Enhance audio generation controllability through representation similarity regularization · 9 authors