ishidalab

university

https://takashiishida.github.io

AI & ML interests

None defined yet.

Recent Activity

tksii authored a paper about 4 hours ago

Mitigating Reward Hacking in RLHF via Advantage Sign Robustness

tksii authored a paper about 4 hours ago

LLM Routing with Dueling Feedback

tksii authored a paper about 4 hours ago

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

View all activity

Papers

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

View all Papers

ishidalab 's papers 2

Submitted by

Thanawat Lodkaew

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

ishidalab

2

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

ishidalab