Cartinoe5930 (Hyunwoo Ko)

upvoted a paper 19 days ago

RLHF Workflow: From Reward Modeling to Online RLHF

Paper • 2405.07863 • Published 20 days ago • 57

upvoted an article 24 days ago

Article

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

By

•

26 days ago

• 25

upvoted a collection 27 days ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 29 items • Updated 3 days ago • 181

upvoted 2 papers about 1 month ago

Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models

Paper • 2405.01535 • Published May 2 • 103

Replacing Judges with Juries: Evaluating LLM Generations with a Panel of Diverse Models

Paper • 2404.18796 • Published Apr 29 • 63

upvoted 2 articles about 1 month ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

Apr 28

• 33

Article

Can We Train Chat Models with Raw Data?

By

•

Apr 25

• 17

upvoted 2 papers about 1 month ago

OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework

Paper • 2404.14619 • Published Apr 22 • 122

Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone

Paper • 2404.14219 • Published Apr 22 • 239

upvoted a paper about 2 months ago

Learn Your Reference Model for Real Good Alignment

Paper • 2404.09656 • Published Apr 15 • 80

upvoted 2 articles about 2 months ago

Article

Mergoo: Efficiently Build Your Own MoE LLM

By

•

27 days ago

• 32

Article

Mixture of Experts Explained

Dec 11, 2023

• 81

upvoted 5 papers 2 months ago

upvoted a collection 2 months ago

ORPO

Collection

This is the official collection of "ORPO: Monolithic Preference Optimization without Reference Model". • 5 items • Updated Apr 12 • 10

upvoted 2 papers 3 months ago

RAFT: Adapting Language Model to Domain Specific RAG

Paper • 2403.10131 • Published Mar 15 • 64

Yi: Open Foundation Models by 01.AI

Paper • 2403.04652 • Published Mar 7 • 59

upvoted a collection 3 months ago

Matryoshka Embedding Models

Collection

https://huggingface.co/blog/matryoshka • 12 items • Updated 18 days ago • 10

upvoted 6 papers 4 months ago

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Paper • 2402.03620 • Published Feb 6 • 102

More Agents Is All You Need

Paper • 2402.05120 • Published Feb 3 • 46

PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models

Paper • 2402.01118 • Published Feb 2 • 28

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31 • 55

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1 • 76

Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling

Paper • 2401.16380 • Published Jan 29 • 46

upvoted a collection 4 months ago

MoEs papers reading list

Collection

43 items • Updated 4 days ago • 123

upvoted 5 papers 5 months ago

Contrastive Preference Optimization: Pushing the Boundaries of LLM Performance in Machine Translation

Paper • 2401.08417 • Published Jan 16 • 27

Mixtral of Experts

Paper • 2401.04088 • Published Jan 8 • 153

Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

Paper • 2401.01335 • Published Jan 2 • 61

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling

Paper • 2312.15166 • Published Dec 23, 2023 • 55

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 131

upvoted 2 collections 5 months ago

Awesome feedback datasets

Collection

A curated list of datasets with human or AI feedback. Useful for training reward models or applying techniques like DPO. • 19 items • Updated Apr 12 • 53

Journal Club

Collection

Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21 • 24

upvoted 2 papers 6 months ago

Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision

Paper • 2312.09390 • Published Dec 14, 2023 • 32

TinyGSM: achieving >80% on GSM8k with small language models

Paper • 2312.09241 • Published Dec 14, 2023 • 34

upvoted a collection 6 months ago

Awesome SFT datasets

Collection

A curated list of interesting datasets to fine-tune language models with. • 43 items • Updated Apr 12 • 94

upvoted a paper 6 months ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 173

upvoted a collection 6 months ago

GAIA release

Collection

Gather the items of the GAIA release • 4 items • Updated Nov 23, 2023 • 17

upvoted a paper 6 months ago

Orca 2: Teaching Small Language Models How to Reason

Paper • 2311.11045 • Published Nov 18, 2023 • 69

upvoted 3 papers 7 months ago

Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2

Paper • 2311.10702 • Published Nov 17, 2023 • 17

Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 31

Zephyr: Direct Distillation of LM Alignment

Paper • 2310.16944 • Published Oct 25, 2023 • 116

upvoted 3 papers 8 months ago

Prometheus: Inducing Fine-grained Evaluation Capability in Language Models

Paper • 2310.08491 • Published Oct 12, 2023 • 50

DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models

Paper • 2309.03883 • Published Sep 7, 2023 • 14

Large Language Models Cannot Self-Correct Reasoning Yet

Paper • 2310.01798 • Published Oct 3, 2023 • 30

upvoted a paper 9 months ago

Textbooks Are All You Need II: phi-1.5 technical report

Paper • 2309.05463 • Published Sep 11, 2023 • 84

upvoted 5 papers 10 months ago

Self-Alignment with Instruction Backtranslation

Paper • 2308.06259 • Published Aug 11, 2023 • 38

Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding

Paper • 2307.15337 • Published Jul 28, 2023 • 34

Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback

Paper • 2307.15217 • Published Jul 27, 2023 • 34

Measuring Faithfulness in Chain-of-Thought Reasoning

Paper • 2307.13702 • Published Jul 17, 2023 • 26

FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets

Paper • 2307.10928 • Published Jul 20, 2023 • 11

upvoted 3 papers 11 months ago

Llama 2: Open Foundation and Fine-Tuned Chat Models

Paper • 2307.09288 • Published Jul 18, 2023 • 235

AlpaGasus: Training A Better Alpaca with Fewer Data

Paper • 2307.08701 • Published Jul 17, 2023 • 21

Flacuna: Unleashing the Problem Solving Power of Vicuna using FLAN Fine-Tuning

Paper • 2307.02053 • Published Jul 5, 2023 • 23

Hyunwoo Ko

AI & ML interests

Organizations

Cartinoe5930's activity

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

Expanding Model Context and Creating Chat Models with a Single Click

Can We Train Chat Models with Raw Data?

Mergoo: Efficiently Build Your Own MoE LLM

Mixture of Experts Explained