DiveR-CT: Diversity-enhanced Red Teaming with Relaxing Constraints Paper • 2405.19026 • Published 4 days ago • 5
ConvLLaVA: Hierarchical Backbones as Visual Encoder for Large Multimodal Models Paper • 2405.15738 • Published 9 days ago • 41
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models Paper • 2403.13372 • Published Mar 20 • 57
Avalon's Game of Thoughts: Battle Against Deception through Recursive Contemplation Paper • 2310.01320 • Published Oct 2, 2023 • 9
Boosting Offline Reinforcement Learning with Action Preference Query Paper • 2306.03362 • Published Jun 6, 2023 • 2