-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 110 -
sDPO: Don't Use Your Data All at Once
Paper • 2403.19270 • Published • 41 -
ViTAR: Vision Transformer with Any Resolution
Paper • 2403.18361 • Published • 55 -
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models
Paper • 2403.18814 • Published • 47
Phuong Pham
mp1704
AI & ML interests
None yet
Recent Activity
reacted
to
burtenshaw's
post
with ❤️
4 days ago
NEW UNIT in the Hugging Face Reasoning course. We dive deep into the algorithm behind DeepSeek R1 with an advanced and hands-on guide to interpreting GRPO.
🔗 https://huggingface.co/reasoning-course
This unit is super useful if you’re tuning models with reinforcement learning. It will help with:
- interpreting loss and reward progression during training runs
- selecting effective parameters for training
- reviewing and defining effective reward functions
This unit also works up smoothly toward the existing practical exercises form @mlabonne and Unsloth.
📣 Shout out to @ShirinYamani who wrote the unit. Follow for more great content.
liked
a dataset
4 days ago
leduckhai/VietMed
upvoted
a
collection
4 days ago
Vietnamese speech dataset
Organizations
Collections
1
models
15
mp1704/tora_7b_sft_ckpt_200
Text Generation
•
Updated
•
3
mp1704/tora_7b_pt
Text Generation
•
Updated
•
2
mp1704/gpt-neo-sft-v2.1
Text Generation
•
Updated
•
7
mp1704/gpt-neo-sft-v2
Text Generation
•
Updated
•
4
mp1704/gpt-neo-sft
Text Generation
•
Updated
•
3
mp1704/gpt-neo-pt
Text Generation
•
Updated
•
4
mp1704/gemma_2b_sft
Text Generation
•
Updated
•
3
mp1704/gemma_2b_pt
Text Generation
•
Updated
•
4
mp1704/qwen_1.8b_sft_full_3
Text Generation
•
Updated
•
3
mp1704/qwen_1.8b_sft_full_2
Feature Extraction
•
Updated
•
3