Raja Biswas's picture

Raja Biswas

rbiswasfc

·

AI & ML interests

NLP, Generative AI

Recent Activity

upvoted a collection about 2 hours ago

upvoted a paper 4 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

upvoted a paper 4 days ago

Qwen2.5-VL Technical Report

View all activity

Organizations

rbiswasfc's activity

upvoted a collection about 2 hours ago

SigLIP2

36 items • Updated 3 days ago • 67

upvoted 3 papers 4 days ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 112

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 179

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 19 days ago • 113

upvoted a paper 5 days ago

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 12 days ago • 120

upvoted a collection 5 days ago

RLVR

Model and data for 'Expanding RL with Verifiable Rewards Across Diverse Domains' • 3 items • Updated 6 days ago • 10

upvoted a collection 6 days ago

ReSearch

Trained models as described in the paper "ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning" • 5 items • Updated 10 days ago • 4

updated a dataset 7 days ago

rbiswasfc/r1-7b

Viewer • Updated 7 days ago • 64 • 151

upvoted a paper 15 days ago

Transformers without Normalization

Paper • 2503.10622 • Published 24 days ago • 153

liked a model 19 days ago

nvidia/GR00T-N1-2B

Robotics • Updated 19 days ago • 2.53k • 257

upvoted a collection 22 days ago

Model Merging

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 236

upvoted an article 23 days ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

• 48

liked a dataset 24 days ago

qihoo360/Light-R1-SFTData

Viewer • Updated 21 days ago • 79.4k • 2.39k • 33

upvoted a paper 25 days ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 27 days ago • 96

upvoted an article 25 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

26 days ago

• 372

upvoted a collection 25 days ago

Gemma 3 Release

17 items • Updated 3 days ago • 311

liked a dataset 26 days ago

open-r1/codeforces

Viewer • Updated 2 days ago • 10k • 1.85k • 28

upvoted 2 articles 26 days ago

Article

HuggingFace, IISc partner to supercharge model building on India's diverse languages

Feb 27

• 18

Article

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Mar 4

• 73