Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing
    • Website
      • Tasks
      • HuggingChat
      • Collections
      • Languages
      • Organizations
    • Community
      • Blog
      • Posts
      • Daily Papers
      • Learn
      • Discord
      • Forum
      • GitHub
    • Solutions
      • Team & Enterprise
      • Hugging Face PRO
      • Enterprise Support
      • Inference Providers
      • Inference Endpoints
      • Storage Buckets

  • Log In
  • Sign Up

ishidalab

university
https://takashiishida.github.io
Activity Feed

AI & ML interests

None defined yet.

Recent Activity

tksii  authored a paper about 4 hours ago
Mitigating Reward Hacking in RLHF via Advantage Sign Robustness
tksii  authored a paper about 4 hours ago
LLM Routing with Dueling Feedback
tksii  authored a paper about 4 hours ago
Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests
View all activity

Papers

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

View all Papers

Takashi Ishida's profile pictureThanawat Lodkaew's profile pictureTANG's profile pictureBY's profile picture
ishidalab 's papers 2
Submitted by
Thanawat Lodkaew
5

Do Coding Agents Deceive Us? Detecting and Preventing Cheating via Capped Evaluation with Randomized Tests

ishidalab ishidalab
2
2

How Can I Publish My LLM Benchmark Without Giving the True Answers Away?

ishidalab ishidalab
8
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs