7 562 147

蓋瑞王

gary109

AI & ML interests

GAN,Music

Recent Activity

liked a dataset 6 days ago

openai/gsm8k

liked a dataset 6 days ago

miulab/tmlu

liked a Space 6 days ago

yentinglin/open-tw-llm-leaderboard

View all activity

Organizations

None yet

gary109's activity

upvoted 6 articles 2 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 172

Article

The AI tools for Art Newsletter - Issue 1

Jan 31

• 76

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 211

Article

From Files to Chunks: Improving Hugging Face Storage Efficiency

Nov 20, 2024

• 59

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

Article

Hugging Face x LangChain : A new partner package in LangChain

May 14, 2024

• 141

upvoted a collection 2 months ago

Breeze 2 Family

Collection

Llama-Breeze2 is a multi-modal language model family specifically intended for Traditional Chinese use. BreezyVoice is a Taiwan Mandarin TTS • 6 items • Updated Feb 26 • 18

upvoted an article 3 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 845

upvoted a collection 3 months ago

high-quality Chinese training datasets

Collection

a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 13 items • Updated Mar 11 • 12

upvoted a paper 5 months ago

TAPTRv3: Spatial and Temporal Context Foster Robust Tracking of Any Point in Long Video

Paper • 2411.18671 • Published Nov 27, 2024 • 20

upvoted a collection 5 months ago

LLM2CLIP

Collection

LLM2CLIP makes SOTA pretrained CLIP modal more SOTA ever. • 11 items • Updated 5 days ago • 60

upvoted a paper 5 months ago

StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

Paper • 2411.05738 • Published Nov 8, 2024 • 15

upvoted 3 papers 7 months ago

Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale

Paper • 2409.17115 • Published Sep 25, 2024 • 63

Seeing Faces in Things: A Model and Dataset for Pareidolia

Paper • 2409.16143 • Published Sep 24, 2024 • 17

RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning

Paper • 2409.14674 • Published Sep 23, 2024 • 44

upvoted 3 papers 8 months ago

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 51

Scaling Up Diffusion and Flow-based XGBoost Models

Paper • 2408.16046 • Published Aug 28, 2024 • 10

Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 23