Muhammad Khalifa's picture

2 6 15

Muhammad Khalifa

mkhalifa

·

https://mukhal.github.io/

AI & ML interests

natural language genration, reinforcement learning

Recent Activity

upvoted a paper 4 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

updated a model 10 days ago

mkhalifa/r1_14b_discriminative_prm

published a model 10 days ago

mkhalifa/r1_14b_discriminative_prm

View all activity

Organizations

None yet

mkhalifa's activity

upvoted a paper 4 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 5 days ago • 56

upvoted a paper 4 months ago

If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs

Paper • 2412.04144 • Published Dec 5, 2024 • 5

upvoted 2 papers 6 months ago

On Leakage of Code Generation Evaluation Datasets

Paper • 2407.07565 • Published Jul 10, 2024 • 6

Contextual Document Embeddings

Paper • 2410.02525 • Published Oct 3, 2024 • 22

upvoted a paper 12 months ago

Source-Aware Training Enables Knowledge Attribution in Language Models

Paper • 2404.01019 • Published Apr 1, 2024 • 1

upvoted a paper over 1 year ago

Discriminator-Guided Multi-step Reasoning with Language Models

Paper • 2305.14934 • Published May 24, 2023 • 1