Pham Minh Tuan's picture

Pham Minh Tuan

1TuanPham

·

vTuanPham

AI & ML interests

None yet

Recent Activity

liked a dataset about 13 hours ago

PepitoTheCat2007/pepito-images-light

liked a dataset 3 days ago

glaiveai/reasoning-v1-20m

upvoted a paper 5 days ago

Gemini Robotics: Bringing AI into the Physical World

View all activity

Organizations

1TuanPham's activity

upvoted a paper 5 days ago

Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published 8 days ago • 21

upvoted a collection 12 days ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 13 days ago • 88

upvoted a collection 21 days ago

Gemma 3 Release

17 items • Updated 6 days ago • 303

upvoted a collection about 1 month ago

PaliGemma 2 Mix

13 items • Updated 6 days ago • 60

upvoted a paper about 1 month ago

Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model

Paper • 2502.10248 • Published Feb 14 • 54

upvoted a collection 2 months ago

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Feb 26 • 111

upvoted 2 collections 3 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 7 days ago • 279

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated 6 days ago • 14

upvoted a paper 3 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 363

upvoted 5 collections 4 months ago

[MASK] is All You Need

Code, dataset, and pretrained model • 6 items • Updated Feb 6 • 9

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated 16 days ago • 107

Llama 3.3

This collection hosts the transformers and original repos of the Llama 3.3 • 1 item • Updated Dec 6, 2024 • 150

PaliGemma 2 Release

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 32 items • Updated 6 days ago • 146

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 17 days ago • 90

upvoted a paper 4 months ago

One Diffusion to Generate Them All

Paper • 2411.16318 • Published Nov 25, 2024 • 29

upvoted a collection 4 months ago

AIMv2

A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74

upvoted 4 collections 5 months ago

Athene-V2

2 items • Updated Nov 14, 2024 • 7

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 578

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 251

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 111