1 40 12

Le Huy Hoang

splendor1811

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

updated a Space 11 days ago

splendor1811/AlfredAgent

published a Space 11 days ago

splendor1811/AlfredAgent

new activity 20 days ago

BarraHome/llama3.2-1b-mla:Question About MLA Usage?

View all activity

Organizations

None yet

splendor1811's activity

upvoted a paper about 1 month ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 47

upvoted 2 articles about 2 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.19k

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 227

upvoted a paper 2 months ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published Jan 11 • 87

upvoted a collection 2 months ago

Phi-3

Collection

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565

upvoted a collection 5 months ago

MIT Talk 31/10 Papers

Collection

14 items • Updated Oct 28, 2024 • 31

upvoted a paper 10 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 88

upvoted an article 11 months ago

Article

Mixture of Experts Explained

Dec 11, 2023

• 486

upvoted 4 papers 11 months ago

FLAME: Factuality-Aware Alignment for Large Language Models

Paper • 2405.01525 • Published May 2, 2024 • 28

StoryDiffusion: Consistent Self-Attention for Long-Range Image and Video Generation

Paper • 2405.01434 • Published May 2, 2024 • 56

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29, 2024 • 121

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2, 2024 • 122

upvoted an article 11 months ago

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 176

upvoted a collection 11 months ago

LLaVa-NeXT

Collection

LLaVa-NeXT (also known as LLaVa-1.6) improves upon the 1.5 series by incorporating higher image resolutions and more reasoning/OCR datasets. • 8 items • Updated Jul 19, 2024 • 29

upvoted a paper 11 months ago

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Paper • 2402.19427 • Published Feb 29, 2024 • 56

upvoted 4 collections 11 months ago