3 3 2

Tao Ge

sggetao

AI & ML interests

None yet

Recent Activity

authored a paper 28 days ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

commented a paper 28 days ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

commented a paper 28 days ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

View all activity

Organizations

None yet

sggetao's activity

authored a paper 28 days ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published 29 days ago • 9

commented 2 papers 28 days ago

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published 29 days ago • 9 •

Low-Bit Quantization Favors Undertrained LLMs: Scaling Laws for Quantized LLMs with 100T Training Tokens

Paper • 2411.17691 • Published 29 days ago • 9 •

commented a paper about 2 months ago

Relaxed Recursive Transformers: Effective Parameter Sharing with Layer-wise LoRA

Paper • 2410.20672 • Published Oct 28 • 6 •

liked a dataset 6 months ago

lmarena-ai/arena-human-preference-55k

Viewer • Updated May 17 • 57.5k • 644 • 137

New activity in lmsys/chatbot_arena_conversations 6 months ago

Update?

#4 opened 9 months ago by

Hypersniper

upvoted a paper 6 months ago

Scaling Synthetic Data Creation with 1,000,000,000 Personas

Paper • 2406.20094 • Published Jun 28 • 95

liked a dataset 6 months ago

proj-persona/PersonaHub

Viewer • Updated Oct 5 • 375k • 4.01k • 474

updated a model 9 months ago

sggetao/icae

Updated Mar 30 • 3

updated a dataset 9 months ago

sggetao/PwC

Viewer • Updated Mar 30 • 260k • 93 • 3

authored 8 papers 11 months ago

Unleashing Cognitive Synergy in Large Language Models: A Task-Solving Agent through Multi-Persona Self-Collaboration

Paper • 2307.05300 • Published Jul 11, 2023 • 18

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 27

SCALE: Synergized Collaboration of Asymmetric Language Translation Engines

Paper • 2309.17061 • Published Sep 29, 2023 • 1

Unlocking Efficiency in Large Language Model Inference: A Comprehensive Survey of Speculative Decoding

Paper • 2401.07851 • Published Jan 15 • 1

Pay Attention to Your Tone: Introducing a New Dataset for Polite Language Rewrite

Paper • 2212.10190 • Published Dec 20, 2022

upvoted 2 papers 11 months ago

K-Level Reasoning with Large Language Models

Paper • 2402.01521 • Published Feb 2 • 17

In-context Autoencoder for Context Compression in a Large Language Model

Paper • 2307.06945 • Published Jul 13, 2023 • 27