Leon Tsou's picture

Leon Tsou

xxrjun

·

AI & ML interests

None yet

Recent Activity

liked a Space 10 days ago

nanotron/ultrascale-playbook

liked a model 22 days ago

MediaTek-Research/BreezyVoice

new activity 26 days ago

MediaTek-Research/Llama-Breeze2-8B-Instruct:Inference with vLLM

View all activity

Organizations

xxrjun's activity

upvoted a collection 27 days ago

InternVL2.5

Better than InternVL 2.0 • 19 items • Updated 11 days ago • 88

upvoted 2 collections about 1 month ago

Taiwan LLM

Try out at twllm.com ! • 28 items • Updated 20 days ago • 40

DeepSeek-R1

8 items • Updated Jan 21 • 576

upvoted a paper 5 months ago

BGE M3-Embedding: Multi-Lingual, Multi-Functionality, Multi-Granularity Text Embeddings Through Self-Knowledge Distillation

Paper • 2402.03216 • Published Feb 5, 2024 • 5

upvoted a collection 6 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 40

upvoted an article 6 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

By

•

Aug 26, 2024

• 51

upvoted a collection 6 months ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated 8 days ago • 85

upvoted 3 papers 6 months ago

Proximal Policy Optimization Algorithms

Paper • 1707.06347 • Published Jul 20, 2017 • 8

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment

Paper • 2408.06266 • Published Aug 12, 2024 • 10

upvoted a collection 7 months ago

Preference Datasets for KTO

This collection contains a list of curated preference datasets for KTO fine-tuning for intent alignment of LLMs through signals. • 5 items • Updated Dec 11, 2024 • 15

upvoted a paper 7 months ago

Direct Preference Optimization: Your Language Model is Secretly a Reward Model

Paper • 2305.18290 • Published May 29, 2023 • 54

upvoted 3 collections 7 months ago

Video

Stability AI's suite of image-to-video models • 5 items • Updated Jan 9 • 78

Transformers compatible Mamba

This release includes the `mamba` repositories compatible with the `transformers` library • 5 items • Updated Mar 6, 2024 • 37

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated Jan 8 • 565

upvoted 3 collections 8 months ago

Llama 3.1

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 653

DCLM

DCLM Models + Datasets • 7 items • Updated Jul 22, 2024 • 43

Meta Llama 3

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Dec 6, 2024 • 720

upvoted an article 8 months ago

Article

Everything About Long Context Fine-tuning

By

•

May 10, 2024

• 41

upvoted a paper 8 months ago

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Paper • 2401.06066 • Published Jan 11, 2024 • 52