HAE-RAE

non-profit

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

seungone authored a paper 5 days ago

M-Prometheus: A Suite of Open Multilingual LLM Judges

seungone authored a paper 5 days ago

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Albertmade authored a paper 9 days ago

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean

View all activity

HAERAE-HUB's activity

seungone

authored 2 papers 5 days ago

M-Prometheus: A Suite of Open Multilingual LLM Judges

Paper • 2504.04953 • Published 7 days ago

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Paper • 2503.19877 • Published 20 days ago

Albertmade

authored a paper 9 days ago

HRET: A Self-Evolving LLM Evaluation Toolkit for Korean

Paper • 2503.22968 • Published 17 days ago

amphora

authored a paper about 2 months ago

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 25

Cartinoe5930

authored 2 papers about 2 months ago

Multi-Step Reasoning in Korean and the Emergent Mirage

Paper • 2501.05712 • Published Jan 10

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning

Paper • 2502.17407 • Published Feb 24 • 25

Cartinoe5930

updated a dataset about 2 months ago

HAERAE-HUB/HRM8K

Viewer • Updated Feb 14 • 8.01k • 2.02k • 17

Albertmade

authored 5 papers 2 months ago

TWICE: What Advantages Can Low-Resource Domain-Specific Embedding Model Bring? - A Case Study on Korea Financial Texts

Paper • 2502.07131 • Published Feb 10

HAE-RAE Bench: Evaluation of Korean Knowledge in Language Models

Paper • 2309.02706 • Published Sep 6, 2023 • 2

KMMLU: Measuring Massive Multitask Language Understanding in Korean

Paper • 2402.11548 • Published Feb 18, 2024

Removing Non-Stationary Knowledge From Pre-Trained Language Models for Entity-Level Sentiment Classification in Finance

Paper • 2301.03136 • Published Jan 9, 2023

EaSyGuide : ESG Issue Identification Framework leveraging Abilities of Generative Large Language Models

Paper • 2306.06662 • Published Jun 11, 2023

Muennighoff

authored a paper 2 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 117

amphora

in HAERAE-HUB/HAE_RAE_BENCH_1.1 2 months ago

date understanding query 이슈

#3 opened 3 months ago by

kimcando

Dasool

updated a dataset 3 months ago

HAERAE-HUB/butterflies_and_moths_vqa

Viewer • Updated Jan 21 • 400 • 34

Dasool

published a dataset 3 months ago

HAERAE-HUB/butterflies_and_moths_vqa

Viewer • Updated Jan 21 • 400 • 34

amphora

updated a dataset 3 months ago

HAERAE-HUB/hret_agent_idavidrein_gpqa_diamond_translated

Viewer • Updated Jan 20 • 5 • 30

amphora

published a dataset 3 months ago

HAERAE-HUB/hret_agent_idavidrein_gpqa_diamond_translated

Viewer • Updated Jan 20 • 5 • 30

Cartinoe5930

authored a paper 3 months ago

LLM-as-a-Judge & Reward Model: What They Can and Cannot Do

Paper • 2409.11239 • Published Sep 17, 2024 • 2

amphora

updated a dataset 3 months ago

HAERAE-HUB/HRMCR

Viewer • Updated Jan 13 • 100 • 85 • 2

AI & ML interests

Recent Activity

Team members 18

HAERAE-HUB's activity

date understanding query 이슈