6 55 75

Xingye

PlanetMoon

PlanetMoon

AI & ML interests

Time series, Foundation Model, Machine Learning, Artificial Intelligence.

Recent Activity

liked a Space 7 days ago

jamesliu1217/EasyControl_Ghibli

upvoted an article 14 days ago

Training and Finetuning Embedding Models with Sentence Transformers v3

liked a dataset 16 days ago

mlabonne/FineTome-100k

View all activity

Organizations

PlanetMoon's activity

upvoted an article 14 days ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

May 28, 2024

• 206

upvoted 2 papers about 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

Agency Is Frame-Dependent

Paper • 2502.04403 • Published Feb 6 • 23

upvoted a paper 8 months ago

Language Model Can Listen While Speaking

Paper • 2408.02622 • Published Aug 5, 2024 • 42

upvoted a paper 9 months ago

Stable Audio Open

Paper • 2407.14358 • Published Jul 19, 2024 • 27

upvoted a collection 9 months ago

BigVGAN

Collection

BigVGAN is a universal neural vocoder that generates audio waveform using mel spectrogram as input. • 11 items • Updated 5 days ago • 12

upvoted a paper 9 months ago

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4, 2024 • 40

upvoted a collection 11 months ago

Standard-format-preference-dataset

Collection

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted a paper 12 months ago

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 33

upvoted an article 12 months ago

Article

Introducing the Open Chain of Thought Leaderboard

Apr 23, 2024

• 31

upvoted 2 papers about 1 year ago

NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models

Paper • 2403.03100 • Published Mar 5, 2024 • 38

Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition

Paper • 2402.15504 • Published Feb 23, 2024 • 23

upvoted a collection over 1 year ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 154

upvoted 6 papers over 1 year ago

Vision Transformers Need Registers

Paper • 2309.16588 • Published Sep 28, 2023 • 79

Low-rank Adaptation of Large Language Model Rescoring for Parameter-Efficient Speech Recognition

Paper • 2309.15223 • Published Sep 26, 2023 • 19

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Paper • 2309.15807 • Published Sep 27, 2023 • 32

LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models

Paper • 2309.15103 • Published Sep 26, 2023 • 42

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Paper • 2309.11674 • Published Sep 20, 2023 • 31

LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent

Paper • 2309.12311 • Published Sep 21, 2023 • 17