J C's picture

J C

dark-pen

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

google/gemma-3-27b-it

liked a model 2 days ago

Blancy/Qwen-2.5-7B-Simple-RL

liked a dataset 2 days ago

librarian-bots/paper-recommendations-v2

View all activity

Organizations

None yet

dark-pen's activity

upvoted 2 collections 8 days ago

TheoremExplain

2 items • Updated 15 days ago • 2

Process Reward Models

Model and Datasets for Qwen 2.5 Math PRM 7B • 6 items • Updated 23 days ago • 2

upvoted a paper 11 days ago

From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline

Paper • 2406.11939 • Published Jun 17, 2024 • 7

upvoted a collection 11 days ago

Prompt-to-Leaderboard

19 items • Updated 16 days ago • 8

upvoted a collection 12 days ago

Slam

All resources for SpeechLMs from "Slamming: Training a Speech Language Model on One GPU in a Day". We provide tokeniser, lm, and datasets • 6 items • Updated 17 days ago • 13

upvoted 3 collections 13 days ago

INF-Retriever-v1

LLM-based dense retrieval models for EN & ZH (also effective in other languages) • 2 items • Updated 17 days ago • 1

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated about 2 hours ago • 92

EgoLife

CVPR 2025 - EgoLife: Towards Egocentric Life Assistant. Homepage: https://egolife-ai.github.io/ • 10 items • Updated 7 days ago • 13

upvoted a collection 15 days ago

UGround

UGround: Universal GUI Visual Grounding for GUI Agents (ICLR'25 Oral) • 9 items • Updated 25 days ago • 4

upvoted a paper 16 days ago

FantasyID: Face Knowledge Enhanced ID-Preserving Video Generation

Paper • 2502.13995 • Published 22 days ago • 8

upvoted a collection 18 days ago

Step-Audio

Step-Audio model family, including Audio-Tokenizer, Audio-Chat and TTS • 3 items • Updated 24 days ago • 30

upvoted a collection 23 days ago

Deepseek Papers

Deepseek papers collection • 18 items • Updated 23 days ago • 168

upvoted a collection 24 days ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 14 items • Updated 2 days ago • 100

upvoted a collection 29 days ago

AceCoder

13 items • Updated 29 days ago • 6

upvoted a paper about 1 month ago

MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning

Paper • 2410.18035 • Published Oct 23, 2024 • 1

upvoted a collection about 1 month ago

Gemma-2-9B-it-Advanced

Merges of the advanced research fine tunes of gemma-2 9B it • 3 items • Updated Oct 20, 2024 • 3

upvoted a paper about 1 month ago

TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models

Paper • 2410.10818 • Published Oct 14, 2024 • 17

upvoted a collection about 1 month ago

VideoChat-Flash

Faster and more powerful VideoChat. • 15 items • Updated 5 days ago • 10

upvoted a paper about 1 month ago

Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes

Paper • 2410.16930 • Published Oct 22, 2024 • 8

upvoted a collection about 1 month ago

TinySQL

"Convert English query to a SQL command" models and training data. • 26 items • Updated Jan 29 • 2