On the Multi-turn Instruction Following for Conversational Web Agents Paper • 2402.15057 • Published Feb 23, 2024 • 1
Ask-before-Plan: Proactive Language Agents for Real-World Planning Paper • 2406.12639 • Published Jun 18, 2024 • 1
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Paper • 2503.19950 • Published 14 days ago • 10
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Paper • 2503.19950 • Published 14 days ago • 10
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Paper • 2503.19950 • Published 14 days ago • 10 • 2
Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models Paper • 2503.17811 • Published 17 days ago • 13
Feather-SQL: A Lightweight NL2SQL Framework with Dual-Model Collaboration Paradigm for Small Language Models Paper • 2503.17811 • Published 17 days ago • 13