Heegyu Kim's picture

Heegyu Kim PRO

heegyu

·

https://sites.google.com/view/heegyu-kim/

AI & ML interests

NLP

Recent Activity

liked a dataset about 6 hours ago

glaiveai/reasoning-v1-20m

new activity 1 day ago

iknow-lab/open-materials-guide-2024:Improve dataset card: Add task categories, Github link, clarify license

updated a model 9 days ago

heegyu/EXAONE-3.5-32B-Instruct-FP16

View all activity

Organizations

heegyu's activity

upvoted a paper 29 days ago

EgoSpeak: Learning When to Speak for Egocentric Conversational Agents in the Wild

Paper • 2502.14892 • Published Feb 17 • 6

upvoted an article about 1 month ago

Article

DABStep: Data Agent Benchmark for Multi-step Reasoning

Feb 4

• 67

upvoted a paper about 1 month ago

Towards Fully-Automated Materials Discovery via Large-Scale Synthesis Dataset and Expert-Level LLM-as-a-Judge

Paper • 2502.16457 • Published Feb 23 • 11

upvoted a collection about 1 month ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 18 items • Updated about 21 hours ago • 115

upvoted a collection 4 months ago

EXAONE-3.5

EXAONE 3.5 language model series including instruction-tuned models of 2.4B, 7.8B, and 32B • 10 items • Updated 9 days ago • 107

upvoted 6 collections 5 months ago

Cosmos Tokenizer

A suite of image and video tokenizers • 13 items • Updated about 18 hours ago • 40

Granite 3.0 Language Models

A series of language models trained by IBM licensed under Apache 2.0 license. We release both the base pretrained and instruct models. • 8 items • Updated about 1 month ago • 96

Arch-Function

6 items • Updated Oct 29, 2024 • 7

LLM Safety Datasets

Korean safety, ethics dataset • 9 items • Updated Nov 23, 2024 • 3

En Ko Translate

영어 데이터셋을 한글로 번역한 데이터셋입니다. • 4 items • Updated Nov 6, 2024 • 1

Magpie Conversation Ko

Magpie 데이터셋 한국어 번역본 (@nayohan님 번역 모델 사용) • 10 items • Updated Nov 6, 2024 • 1

upvoted a paper 5 months ago

MixEval: Deriving Wisdom of the Crowd from LLM Benchmark Mixtures

Paper • 2406.06565 • Published Jun 3, 2024 • 9

upvoted 3 collections 7 months ago

3D

Stability AI's suite of models for 3D generation • 6 items • Updated Jan 9 • 37

4bit Instruct Models

18 items • Updated 1 day ago • 28

DeepSeek-Prover

DeepSeek-V1-and-V1.5-Series • 7 items • Updated Aug 16, 2024 • 27

upvoted 2 collections 9 months ago

Magpie-Qwen2 Datasets

Dataset built with Qwen2 72B and Qwen2 7B. • 6 items • Updated Jan 13 • 10

Awesome feedback datasets

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12, 2024 • 68

upvoted a collection 10 months ago

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8, 2024 • 24

upvoted a paper 10 months ago

DoRA: Weight-Decomposed Low-Rank Adaptation

Paper • 2402.09353 • Published Feb 14, 2024 • 28

upvoted a collection 12 months ago

Eurus

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Oct 22, 2024 • 24