-
Large Language Model Alignment: A Survey
Paper • 2309.15025 • Published • 2 -
Aligning Large Language Models with Human: A Survey
Paper • 2307.12966 • Published • 1 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 37 -
SteerLM: Attribute Conditioned SFT as an (User-Steerable) Alternative to RLHF
Paper • 2310.05344 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2106.09685
-
SMOTE: Synthetic Minority Over-sampling Technique
Paper • 1106.1813 • Published • 1 -
Scikit-learn: Machine Learning in Python
Paper • 1201.0490 • Published • 1 -
Identity Mappings in Deep Residual Networks
Paper • 1603.05027 • Published • 2 -
Deep Residual Learning for Image Recognition
Paper • 1512.03385 • Published • 5
-
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 24 -
Attention Is All You Need
Paper • 1706.03762 • Published • 34 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 37 -
Lost in the Middle: How Language Models Use Long Contexts
Paper • 2307.03172 • Published • 31
-
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
Paper • 2312.15166 • Published • 55 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 235 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 24 -
QLoRA: Efficient Finetuning of Quantized LLMs
Paper • 2305.14314 • Published • 41
-
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 24 -
Instruct-Imagen: Image Generation with Multi-modal Instruction
Paper • 2401.01952 • Published • 29 -
mistralai/Mixtral-8x7B-Instruct-v0.1
Text Generation • Updated • 534k • 3.83k -
Gemma: Open Models Based on Gemini Research and Technology
Paper • 2403.08295 • Published • 43
-
Attention Is All You Need
Paper • 1706.03762 • Published • 34 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 24 -
Direct Preference Optimization: Your Language Model is Secretly a Reward Model
Paper • 2305.18290 • Published • 37 -
Lost in the Middle: How Language Models Use Long Contexts
Paper • 2307.03172 • Published • 31