Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
kolerk 's Collections
TON

TON

updated May 23

Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models.

Upvote
1

  • kolerk/TON-3B-AITZ

    Image-Text-to-Text • 4B • Updated May 23 • 8

  • kolerk/TON-3B-CLEVR

    Image-Text-to-Text • 4B • Updated May 23 • 5

  • kolerk/TON-3B-Math

    Image-Text-to-Text • 4B • Updated May 23 • 6

  • kolerk/TON-7B-Math

    Image-Text-to-Text • 8B • Updated May 23 • 10

  • kolerk/TON-AITZ-SFT

    Preview • Updated May 23 • 33

  • kolerk/TON-Math-SFT

    Viewer • Updated May 23 • 8.03k • 78

  • Think or Not? Selective Reasoning via Reinforcement Learning for Vision-Language Models

    Paper • 2505.16854 • Published May 22 • 11
Upvote
1
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs