Vijay Saraswat's picture

1 4 3

Vijay Saraswat

vjsaraswat

Saraswat

AI & ML interests

language

Organizations

vjsaraswat's activity

upvoted a paper 5 months ago

Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment

Paper • 2405.03594 • Published May 6 • 7

upvoted a paper 12 months ago

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 18

upvoted 2 papers about 1 year ago

Uncovering mesa-optimization algorithms in Transformers

Paper • 2309.05858 • Published Sep 11, 2023 • 12

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 40