-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 581 -
GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Paper • 2403.03507 • Published • 180 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 50 -
ResLoRA: Identity Residual Mapping in Low-Rank Adaption
Paper • 2402.18039 • Published • 11
RachidAR
RachidAR
AI & ML interests
1.58 bit LLM
Organizations
Collections
3
-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 581 -
Layer Skip: Enabling Early Exit Inference and Self-Speculative Decoding
Paper • 2404.16710 • Published • 56 -
Beyond Scaling Laws: Understanding Transformer Performance with Associative Memory
Paper • 2405.08707 • Published • 27 -
Token-Scaled Logit Distillation for Ternary Weight Generative Language Models
Paper • 2308.06744 • Published • 1
models
19
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Phi-3-mini-4k-ins-June2024-Q5_K_M-imat-GGUF
Text Generation
•
Updated
•
302
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Phi-3-mini-4k-instruct-June2024-Q6_K-GGUF
Text Generation
•
Updated
•
321
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/saiga_llama3_8b-Q6_K-GGUF
Updated
•
1.12k
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Llama-3-8B-Instruct-DPO-v0.3-Q6_K-GGUF
Text Generation
•
Updated
•
6
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Waktaverse-Llama-3-KO-8B-Instruct-Q6_K-GGUF
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/llama-3-indotuned-v0-Q6_K-GGUF
Updated
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Llama-3-8B-saiga-suzume-ties-Q6_K-GGUF-OLD
Text Generation
•
Updated
•
1.26k
•
2
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/ablation-model-fineweb-v1-Q6_K-GGUF
Updated
•
4
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/Llama-3-8B-Instruct-Physics-5k-Scar-Q6_K-GGUF
Updated
•
5
![](https://cdn-avatars.huggingface.co/v1/production/uploads/642692742f16fd599d9eb4da/6IfYBIlNdngWw4WrN08zo.png)
RachidAR/AFlow-SegMoe-1Bx3-v0.1
Text-to-Image
•
Updated
•
74
datasets
None public yet