view article Article 🤗 PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 • 14
view article Article A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes Aug 17, 2022 • 19
view article Article Assisted Generation: a new direction toward low-latency text generation May 11, 2023 • 8
view article Article Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA May 24, 2023 • 31
view article Article How to train a new language model from scratch using Transformers and Tokenizers Feb 14, 2020 • 9
Malaysian Llama Collection Llama family on Malaysian context. • 12 items • Updated 27 days ago • 1
Multimodal Malaysian LLM Collection Multimodal Malaysian LLM. • 11 items • Updated 27 days ago • 1
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 27
view article Article Generating Human-level Text with Contrastive Search in Transformers 🤗 Nov 8, 2022 • 3
ChatQA: Building GPT-4 Level Conversational QA Models Paper • 2401.10225 • Published Jan 18 • 32
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 26