Li-Wei Chen's picture

Li-Wei Chen

txya900619

·

txya900619

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

upvoted a paper 2 days ago

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

updated a model 3 days ago

kenkone/kenkone-whisper-large-v3-ja-ct2-756

View all activity

Organizations

txya900619's activity

upvoted a paper 1 day ago

RWKV-7 "Goose" with Expressive Dynamic State Evolution

Paper • 2503.14456 • Published 2 days ago • 102

upvoted a paper 2 days ago

PLADIS: Pushing the Limits of Attention in Diffusion Models at Inference Time by Leveraging Sparsity

Paper • 2503.07677 • Published 10 days ago • 78

upvoted 2 papers 3 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 7 days ago • 129

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 8 days ago • 58

upvoted a paper 7 days ago

YuE: Scaling Open Foundation Models for Long-Form Music Generation

Paper • 2503.08638 • Published 9 days ago • 57

upvoted 4 papers 9 days ago

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published 15 days ago • 212

S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information

Paper • 2503.05085 • Published 13 days ago • 45

EuroBERT: Scaling Multilingual Encoders for European Languages

Paper • 2503.05500 • Published 13 days ago • 73

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 13 days ago • 106

upvoted 4 papers 10 days ago

Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities

Paper • 2503.03983 • Published 15 days ago • 22

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 14 days ago • 64

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 14 days ago • 96

Benchmarking Large Language Models for Multi-Language Software Vulnerability Detection

Paper • 2503.01449 • Published 17 days ago • 4

upvoted 7 papers 13 days ago

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published 17 days ago • 26

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 17 days ago • 75

LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation

Paper • 2502.20583 • Published 21 days ago • 11

Predictive Data Selection: The Data That Predicts Is the Data That Teaches

Paper • 2503.00808 • Published 18 days ago • 54

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 22 days ago • 38

Towards an AI co-scientist

Paper • 2502.18864 • Published 22 days ago • 43

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 28 days ago • 164