Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
onekq 
posted an update 4 days ago
Post
1523
I like to benchmark 💵o1-pro💵 but it is way too expensive for me 🤦‍♂️

Its expensive for everyone, just go with o3-mini, they just figured out that they are not the single llm provider and just doubled the cost of r1 for o3-mini.

·

I tested and ranked every model drop for my leaderboard https://huggingface.co/spaces/onekq-ai/WebApp1K-models-leaderboard but this time I gave up.

Whatever questions this model aims to solve, they are out of my league.

I've used it before when i used to be an OpenAI customer, Its good, not as good as you might think though. I assume LMArena, etc will do some benches too, as well as simple bench, etc.

Anyways, DeepSeek R1 is really good and cost effective, and they are allegedly making great progress on R2.

·

Yes! I'm looking forward to R2