Dong Hai Phuong Nguyen
phuong-d-h-nguyen
AI & ML interests
LLM, RL, CV
Recent Activity
updated
a collection
15 days ago
CoT
upvoted
a
paper
15 days ago
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth
Approach
liked
a model
16 days ago
codellama/CodeLlama-70b-Instruct-hf
Organizations
Collections
9
-
PERL: Parameter Efficient Reinforcement Learning from Human Feedback
Paper • 2403.10704 • Published • 58 -
HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models
Paper • 2403.13447 • Published • 18 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper • 2402.03620 • Published • 115 -
RAFT: Adapting Language Model to Domain Specific RAG
Paper • 2403.10131 • Published • 69
spaces
1
models
None public yet
datasets
None public yet