Huu-Hiep Nguyen's picture

19 364

Huu-Hiep Nguyen

hiepnh

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

OpenGVLab/InternVL2_5-78B

liked a model 6 days ago

answerdotai/ModernBERT-base

liked a Space 8 days ago

TencentARC/BrushEdit

View all activity

Organizations

None yet

hiepnh's activity

upvoted 2 collections 3 months ago

Llama 3.2

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 19 days ago • 548

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 12 days ago • 81

upvoted a collection 4 months ago

Yi-Coder

4 items • Updated Sep 4 • 31

upvoted a collection 6 months ago

DeepSeekCoder-V2

6 items • Updated Sep 5 • 81

upvoted a paper 7 months ago

Your Transformer is Secretly Linear

Paper • 2405.12250 • Published May 19 • 149

upvoted 2 articles 7 months ago

Article

Open-source LLMs as LangChain Agents

Jan 24

• 39

Article

License to Call: Introducing Transformers Agents 2.0

May 13

• 119

upvoted a paper 8 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 253

upvoted a paper 10 months ago

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Paper • 2403.05530 • Published Mar 8 • 61

upvoted 4 papers about 1 year ago

AMSP: Super-Scaling LLM Training via Advanced Model States Partitioning

Paper • 2311.00257 • Published Nov 1, 2023 • 8

FlashDecoding++: Faster Large Language Model Inference on GPUs

Paper • 2311.01282 • Published Nov 2, 2023 • 35

Sparse Finetuning for Inference Acceleration of Large Language Models

Paper • 2310.06927 • Published Oct 10, 2023 • 14

Finite Scalar Quantization: VQ-VAE Made Simple

Paper • 2309.15505 • Published Sep 27, 2023 • 21

upvoted 6 papers over 1 year ago

LMDX: Language Model-based Document Information Extraction and Localization

Paper • 2309.10952 • Published Sep 19, 2023 • 65

InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation

Paper • 2309.06380 • Published Sep 12, 2023 • 32

Efficient Memory Management for Large Language Model Serving with PagedAttention

Paper • 2309.06180 • Published Sep 12, 2023 • 25

Large Language Model for Science: A Study on P vs. NP

Paper • 2309.05689 • Published Sep 11, 2023 • 20

OctoPack: Instruction Tuning Code Large Language Models

Paper • 2308.07124 • Published Aug 14, 2023 • 28

PolyLM: An Open Source Polyglot Large Language Model

Paper • 2307.06018 • Published Jul 12, 2023 • 25