Shivanand Guness's picture

11 8

Shivanand Guness

shivam11

·

AI & ML interests

None yet

Recent Activity

updated a collection about 2 months ago

Interesting papers

upvoted an article 2 months ago

Open-R1: a fully open reproduction of DeepSeek-R1

upvoted a paper 3 months ago

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

View all activity

Organizations

None yet

shivam11's activity

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 834

upvoted a paper 3 months ago

ReMoE: Fully Differentiable Mixture-of-Experts with ReLU Routing

Paper • 2412.14711 • Published Dec 19, 2024 • 16

upvoted 2 papers 4 months ago

Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents

Paper • 2411.16740 • Published Nov 23, 2024 • 2

VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation

Paper • 2412.10704 • Published Dec 14, 2024 • 15

upvoted a collection 4 months ago

Frugal AI Challenge Tasks

Find the 3 datasets for the Frugal AI Challenge in this Collection! 🌎 Find all the details of the challenge at https://frugalaichallenge.org/ • 7 items • Updated Jan 6 • 21

upvoted a collection 5 months ago

WildBench

4 items • Updated 21 days ago • 6

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 112

upvoted a paper 7 months ago

MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct

Paper • 2409.05840 • Published Sep 9, 2024 • 48

upvoted a paper 11 months ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published May 13, 2024 • 70

upvoted a paper about 1 year ago

RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation

Paper • 2403.05313 • Published Mar 8, 2024 • 9