-
Addition is All You Need for Energy-efficient Language Models
Paper • 2410.00907 • Published • 144 -
Emu3: Next-Token Prediction is All You Need
Paper • 2409.18869 • Published • 91 -
An accurate detection is not all you need to combat label noise in web-noisy datasets
Paper • 2407.05528 • Published • 3 -
Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP
Paper • 2407.00402 • Published • 22
meng shao
meng-shao
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
1 day ago
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs
liked
a model
4 days ago
THUDM/glm-edge-1.5b-chat
upvoted
a
paper
6 days ago
SketchAgent: Language-Driven Sequential Sketch Generation
Organizations
Collections
2
-
DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation
Paper • 2410.00201 • Published -
Does RAG Introduce Unfairness in LLMs? Evaluating Fairness in Retrieval-Augmented Generation Systems
Paper • 2409.19804 • Published -
Rethinking Conventional Wisdom in Machine Learning: From Generalization to Scaling
Paper • 2409.15156 • Published -
Just ASR + LLM? A Study on Speech Large Language Models' Ability to Identify and Understand Speaker in Spoken Dialogue
Paper • 2409.04927 • Published
models
None public yet
datasets
None public yet