-
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w4g128
Text Generation • Updated • 8 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g64
Text Generation • Updated • 4 -
ChenMnZ/Llama-3-8b-instruct-EfficientQAT-w2g128
Text Generation • Updated • 3 -
ChenMnZ/Llama-3-8b-EfficientQAT-w4g128
Text Generation • Updated • 4
Collections
Discover the best community collections!
Collections including paper arxiv:2407.11062
-
QLoRA: Efficient Finetuning of Quantized LLMs
Paper • 2305.14314 • Published • 44 -
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
Paper • 2407.11062 • Published • 3 -
Spectra: A Comprehensive Study of Ternary, Quantized, and FP16 Language Models
Paper • 2407.12327 • Published • 62
-
CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data
Paper • 2404.15653 • Published • 25 -
MoDE: CLIP Data Experts via Clustering
Paper • 2404.16030 • Published • 11 -
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
Paper • 2405.12130 • Published • 44 -
Reducing Transformer Key-Value Cache Size with Cross-Layer Attention
Paper • 2405.12981 • Published • 26
-
FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models
Paper • 2402.10986 • Published • 75 -
bigcode/starcoder2-15b
Text Generation • Updated • 22.2k • 539 -
Zephyr: Direct Distillation of LM Alignment
Paper • 2310.16944 • Published • 120 -
mixedbread-ai/mxbai-rerank-large-v1
Text Classification • Updated • 40.7k • 83