Dev Khant's picture

24 3

Dev Khant

DK46

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 27 days ago

Great Models Think Alike and this Undermines AI Oversight

upvoted an article 27 days ago

Open-source DeepResearch – Freeing our search agents

upvoted a paper about 1 month ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

View all activity

Organizations

None yet

DK46's activity

upvoted a paper 27 days ago

Great Models Think Alike and this Undermines AI Oversight

Paper • 2502.04313 • Published Feb 6 • 31

upvoted an article 27 days ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.14k

upvoted 2 papers about 1 month ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published Jan 30 • 82

Humanity's Last Exam

Paper • 2501.14249 • Published Jan 24 • 65

upvoted a paper 8 months ago

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12, 2024 • 62

upvoted 5 papers 11 months ago

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22, 2024 • 256

TransformerFAM: Feedback attention is working memory

Paper • 2404.09173 • Published Apr 14, 2024 • 43

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12, 2024 • 67

Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Paper • 2404.07143 • Published Apr 10, 2024 • 107

Octopus v2: On-device language model for super agent

Paper • 2404.01744 • Published Apr 2, 2024 • 58

upvoted 4 papers 12 months ago

Can large language models explore in-context?

Paper • 2403.15371 • Published Mar 22, 2024 • 33

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15, 2024 • 58

MoAI: Mixture of All Intelligence for Large Language and Vision Models

Paper • 2403.07508 • Published Mar 12, 2024 • 75

Stealing Part of a Production Language Model

Paper • 2403.06634 • Published Mar 11, 2024 • 91

upvoted 6 papers about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

Chain-of-Thought Reasoning Without Prompting

Paper • 2402.10200 • Published Feb 15, 2024 • 105

ChemLLM: A Chemical Large Language Model

Paper • 2402.06852 • Published Feb 10, 2024 • 30

More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3, 2024 • 53

MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation

Paper • 2401.04468 • Published Jan 9, 2024 • 49

Understanding LLMs: A Comprehensive Overview from Training to Inference

Paper • 2401.02038 • Published Jan 4, 2024 • 64