Spaces:

nph4rd
/

tiny-hanabi

Sleeping

tiny-hanabi / README.md

update model

ad995e5 about 1 month ago

1.1 kB

A newer version of the Gradio SDK is available: 6.11.0

title: Tiny Hanabi
emoji: 🎆
colorFrom: red
colorTo: green
sdk: gradio
sdk_version: 5.9.1
app_file: app.py
pinned: false
python_version: 3.11

Tiny Hanabi

Play a simplified version of Hanabi with a trained AI model!

Play: P0 or P1 - Play the card at that position
Discard: D0 or D1 - Discard the card at that position (gain 1 info token)
Hint: 1HR, 1HG, 1H1, 1H2, 1H3 - Tell the AI about their Red/Green cards or their 1s/2s/3s

The AI uses nph4rd/Qwen3-1.7B-Tiny-Hanabi-XML-RL-12-2, a Qwen3-1.7B model fine-tuned with reinforcement learning on this Tiny Hanabi environment.