16 12

山田一郎

basic99

AI & ML interests

None yet

Recent Activity

liked a dataset about 8 hours ago

electricsheepasia/asia-owid-aquaculture-farmed-fish-production

upvoted a paper 3 days ago

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

upvoted a paper 6 days ago

"I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration

View all activity

Organizations

None yet

liked a dataset about 8 hours ago

electricsheepasia/asia-owid-aquaculture-farmed-fish-production

Viewer • Updated about 8 hours ago • 3.07k • 1

upvoted a paper 3 days ago

Alignment Tampering: How Reinforcement Learning from Human Feedback Is Exploited to Optimize Misaligned Biases

Paper • 2605.27355 • Published 8 days ago • 5

upvoted a paper 6 days ago

"I didn't Make the Micro Decisions": Measuring, Inducing, and Exposing Goal-Level AI Contributions in Collaboration

Paper • 2605.21363 • Published 14 days ago • 6

upvoted a paper 10 days ago

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

Paper • 2605.21467 • Published 14 days ago • 204

liked a dataset 11 days ago

liuhaotian/LLaVA-Instruct-150K

Preview • Updated Jan 3, 2024 • 6.62k • 609

liked a model 12 days ago

Nonene/sdxl_models

Updated 1 day ago • 8

liked a dataset 15 days ago

stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-4B_strategy_trust_t0.75_g1_run2

Viewer • Updated 15 days ago • 164 • 73 • 1

upvoted a paper 19 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 27 days ago • 232

liked a model 19 days ago

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated Oct 23, 2025 • 2.22M • 418

liked a dataset 22 days ago

yalhessi/lemexp-commercial-llm-experiment-results

Viewer • Updated 22 days ago • 3.89k • 225 • 1

liked a model about 1 month ago

realml/ocr-gemma-3-4b-it

Text Generation • Updated May 2 • 10 • 1

liked a dataset about 1 month ago

GaryYang123/zh-meme-sft-8k

Viewer • Updated Apr 20 • 8.68k • 319 • 80

liked a dataset 2 months ago

OpenMOSS-Team/OmniAction

Updated Mar 27 • 158k • 282

upvoted a paper 2 months ago

MinerU-Diffusion: Rethinking Document OCR as Inverse Rendering via Diffusion Decoding

Paper • 2603.22458 • Published Mar 23 • 136

upvoted 4 papers 3 months ago

InCoder-32B: Code Foundation Model for Industrial Scenarios

Paper • 2603.16790 • Published Mar 17 • 311

HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions

Paper • 2603.15612 • Published Mar 16 • 153

Bootstrapping Exploration with Group-Level Natural Language Feedback in Reinforcement Learning

Paper • 2603.04597 • Published Mar 4 • 211

Heterogeneous Agent Collaborative Reinforcement Learning

Paper • 2603.02604 • Published Mar 3 • 197

liked 2 models 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 5.36M • • 13.4k

zai-org/GLM-5

Text Generation • 754B • Updated Apr 5 • 116k • • 2.09k

山田一郎

AI & ML interests

Recent Activity

Organizations

basic99's activity