-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 97 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 30 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 48 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 37
Phuong Pham
mp1704
AI & ML interests
None yet
Organizations
Collections
1
models
9
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
2
mp1704/gemma_2b_pt
Text Generation
•
Updated
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
4
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
8
mp1704/qwen_1.8b_sft_full_1
Feature Extraction
•
Updated
•
3
mp1704/qwen_1.8b_sft_full
Text Generation
•
Updated
•
11
mp1704/qwen_1.8b_stage_2
Text Generation
•
Updated
•
6
mp1704/demo
Updated
mp1704/qwen_1.8b_stage_1
Text Generation
•
Updated
•
16