Oguzhan Gencoglu

Ouz-G
·

AI & ML interests

LLM Evals

Recent Activity

updated a Space 15 days ago
root-signals/RootEvaluatorsDemo
updated a Space 15 days ago
root-signals/CustomJudgeDemo
View all activity

Organizations

Root Signals AI's profile picture

Ouz-G's activity

New activity in WildEval/ZebraLogic 28 days ago

Human baseline

#1 opened 28 days ago by
Ouz-G
New activity in WildEval/ZebraLogic 28 days ago

Human baseline for ZebraLogic

#3 opened 28 days ago by
Ouz-G
New activity in 6cf/liveideabench about 1 month ago

Do you have human baselines?

2
#2 opened about 1 month ago by
Ouz-G
New activity in marianna13/AIW-responses about 1 month ago

Is human baseline available?

#1 opened about 1 month ago by
Ouz-G
New activity in cais/hle about 1 month ago

Do you have human baseline?

#3 opened about 1 month ago by
Ouz-G
New activity in Salesforce/GIFT-Eval about 1 month ago
New activity in IGNF/FLAIR-INC_rgb_15cl_resnet34-unet about 2 months ago
New activity in Salesforce/GIFT-Eval 2 months ago
New activity in justin-zk/Personalize-SAM 2 months ago
New activity in bigcode/bigcodebench-leaderboard 2 months ago

Human baseline

1
#8 opened 2 months ago by
Ouz-G
New activity in open-llm-leaderboard/open_llm_leaderboard 2 months ago

Human Performance row

1
#1050 opened 2 months ago by
Ouz-G
New activity in shenyunhang/APE_demo 4 months ago

Demo is down

#2 opened 4 months ago by
Ouz-G
New activity in huawei-noah/human_rank_eval 4 months ago
New activity in jadechoghari/OmniParser 4 months ago
New activity in gabrielvaz/microsoft-OmniParser 4 months ago

Space seems to be down

#1 opened 4 months ago by
Ouz-G
New activity in IGNF/FLAIR-INC_rgbie_12cl_resnet34-unet 4 months ago

config file is missing

1
#1 opened 4 months ago by
Ouz-G
New activity in google/frames-benchmark 5 months ago