ChatBench Datasets and Simulators (same prompt + fine-tuning set-up) from the ChatBench paper.
AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
GUI-360: A Comprehensive Dataset and Benchmark for Computer-Using Agents
Latent Sketchpad: Sketching Visual Thoughts to Elicit Multimodal Reasoning in MLLMs
spaces
24
pinned
Running
13
MageBench Leaderboard
🥇
This is a leaderboard for magebench
Running
46
Phi 4 Mini
🌍
Demos for Phi-4-mini-instruct model
Running
37
ThoughtsOrganizer
🔥
Transform your spoken thoughts into organized insights
Runtime error
4.78k
TRELLIS
🏢
Scalable and Versatile 3D Generation from images
Running
34
PhineSpeechTranslator
👀
Break the language barrier
Build error
9
StoriesComeAlive
🏆
Transform handwritten moments into spoken memories
models
426
microsoft/skala
Updated
•
2
microsoft/bioemu
Updated
•
19
microsoft/BiomedParse
Updated
•
3.27k
•
96
microsoft/UserLM-8b
Text Generation
•
8B
•
Updated
•
2.8k
•
346
microsoft/rad-dino
Image Feature Extraction
•
86.6M
•
Updated
•
39.9k
•
68
microsoft/Phi-4-mini-flash-reasoning
Text Generation
•
4B
•
Updated
•
2.47k
•
249
microsoft/Phi-4-mini-instruct-onnx
Updated
•
184
•
40
microsoft/latent-zoning-networks
Updated
•
16
microsoft/Phi-Ground
Updated
•
218
•
19
microsoft/VibeVoice-1.5B
Text-to-Speech
•
3B
•
Updated
•
169k
•
1.97k
datasets
78
microsoft/SWE-Sharp-Bench
Viewer
•
Updated
•
150
•
302
microsoft/sigmacollab
Updated
•
73
•
1
microsoft/SYNUR
Preview
•
Updated
•
59
•
3
microsoft/SIMORD
Updated
•
25
•
3
microsoft/PatientSafetyBench
Viewer
•
Updated
•
466
•
77
•
2
microsoft/claimify-dataset
Viewer
•
Updated
•
6.49k
•
70
•
4
microsoft/LiveDRBench
Viewer
•
Updated
•
110
•
165
•
5
microsoft/CoSAlign-Train
Viewer
•
Updated
•
125k
•
75
•
1
microsoft/CoSApien
Viewer
•
Updated
•
200
•
114
•
1
microsoft/SynTrail
Updated
•
14
•
1