Gagan Bhatia's picture

Gagan Bhatia

gagan3012

·

AI & ML interests

None yet

Recent Activity

updated a model 10 days ago

gagan3012/Qwen-2.5-reasoning-verifier

published a model 10 days ago

gagan3012/Qwen-2.5-reasoning-verifier

updated a dataset 11 days ago

gagan3012/Sky-T1_preference_data_10k_reward_templated

View all activity

Organizations

gagan3012's activity

upvoted 2 papers about 1 month ago

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

Paper • 2410.09302 • Published Oct 11, 2024 • 1

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

upvoted a paper about 2 months ago

ProcessBench: Identifying Process Errors in Mathematical Reasoning

Paper • 2412.06559 • Published Dec 9, 2024 • 79

upvoted a paper 2 months ago

Large Multi-modal Models Can Interpret Features in Large Multi-modal Models

Paper • 2411.14982 • Published Nov 22, 2024 • 16

upvoted a collection 2 months ago

🧠 Reasoning Models

9 items • Updated 15 days ago • 37

upvoted a paper 3 months ago

Swan and ArabicMTEB: Dialect-Aware, Arabic-Centric, Cross-Lingual, and Cross-Cultural Embedding Models and Benchmarks

Paper • 2411.01192 • Published Nov 2, 2024 • 3

upvoted 2 papers 4 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 8

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 76

upvoted a paper 6 months ago

Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic

Paper • 2407.18129 • Published Jul 25, 2024 • 12

upvoted a paper 7 months ago

Qalam : A Multimodal LLM for Arabic Optical Character and Handwriting Recognition

Paper • 2407.13559 • Published Jul 18, 2024 • 14

upvoted a paper 8 months ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 87

upvoted an article 9 months ago

Article

Introducing the Open Arabic LLM Leaderboard

May 14, 2024

• 78

upvoted an article 10 months ago

Article

Custom architectures with HuggingFace 🤗

By

•

Apr 22, 2024

• 25

upvoted a paper 11 months ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5, 2024 • 94

upvoted a collection 11 months ago

Finance

12 items • Updated Jun 8, 2024 • 4

upvoted 2 papers 11 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29, 2024 • 137

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25, 2024 • 37

upvoted 3 papers 12 months ago

The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 21

FinTral: A Family of GPT-4 Level Multimodal Financial Large Language Models

Paper • 2402.10986 • Published Feb 16, 2024 • 78

SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

Paper • 2402.10555 • Published Feb 16, 2024 • 35