5 409

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

DDT: Decoupled Diffusion Transformer

upvoted an article 2 days ago

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

upvoted an article 7 days ago

The NLP Course is becoming the LLM Course!

View all activity

Organizations

None yet

literate-goggles's activity

upvoted a paper 1 day ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 3 days ago • 57

upvoted an article 2 days ago

Article

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

3 days ago

• 14

upvoted an article 7 days ago

Article

The NLP Course is becoming the LLM Course!

9 days ago

• 66

upvoted a paper 8 days ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 10 days ago • 78

upvoted a paper 15 days ago

Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published 17 days ago • 23

upvoted a paper 18 days ago

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published 22 days ago • 34

upvoted a paper 22 days ago

Scaling Rich Style-Prompted Text-to-Speech Datasets

Paper • 2503.04713 • Published Mar 6 • 1

upvoted 2 papers 26 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 29 days ago • 155

WildIFEval: Instruction Following in the Wild

Paper • 2503.06573 • Published Mar 9 • 11

upvoted 2 papers about 1 month ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 116

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

upvoted an article about 2 months ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 149

upvoted 8 papers about 2 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 190

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Paper • 2502.05139 • Published Feb 7 • 1

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 151

Region-Adaptive Sampling for Diffusion Transformers

Paper • 2502.10389 • Published Feb 14 • 53

Language Models Use Trigonometry to Do Addition

Paper • 2502.00873 • Published Feb 2 • 1