Unchun Yang's picture

Unchun Yang

ucyang

·

https://ucyang.com/

AI & ML interests

None yet

Recent Activity

liked a model about 6 hours ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth

liked a model about 6 hours ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-dynamic-bnb-4bit

liked a model about 6 hours ago

unsloth/Llama-4-Scout-17B-16E-Instruct-unsloth-bnb-4bit

View all activity

Organizations

ucyang's activity

upvoted a collection about 9 hours ago

Llama 4

Meta's new Llama 4 multimodal models, Scout & Maverick. Includes 16-bit, 8-bit and Dynamic 4-bit uploads. Fine-tune them with Unsloth! • 13 items • Updated about 2 hours ago • 21

upvoted an article about 15 hours ago

Article

Xet is on the Hub

20 days ago

• 43

upvoted an article about 18 hours ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

2 days ago

• 84

upvoted a collection 1 day ago

Llama 4

Llama 4 release • 10 items • Updated 1 day ago • 347

upvoted a paper 1 day ago

ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning

Paper • 2503.19470 • Published 13 days ago • 15

upvoted an article 3 days ago

Article

You could have designed state of the art positional encoding

Nov 25, 2024

• 211

upvoted a collection 3 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 4 days ago • 94

upvoted an article 5 days ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

12 days ago

• 99

upvoted a paper 6 days ago

MoCha: Towards Movie-Grade Talking Character Synthesis

Paper • 2503.23307 • Published 8 days ago • 92

upvoted an article 6 days ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

Dec 18, 2024

• 51

upvoted a paper 7 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 12 days ago • 121

upvoted a collection 10 days ago

CoRNStack

State-of-the-art code retrieval and re-ranking models and datasets • 9 items • Updated 11 days ago • 15

upvoted 3 collections 11 days ago

Qwen2.5-Omni

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 11 days ago • 79

Ling

6 items • Updated 28 days ago • 8

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 4 days ago • 43

upvoted a paper 15 days ago

Why Do Multi-Agent LLM Systems Fail?

Paper • 2503.13657 • Published 20 days ago • 42

upvoted a paper 16 days ago

SynCity: Training-Free Generation of 3D Worlds

Paper • 2503.16420 • Published 17 days ago • 25

upvoted a collection 16 days ago

💫StarVector Models

StarVector is a multimodal LLM for Scalable Vector Graphics (SVG) generation, producing structured SVG code directly from images and text. • 2 items • Updated 17 days ago • 90