Sunyoung Hwang's picture

Sunyoung Hwang PRO

sosoai

·

https://sosohajalab.com

sosonagi

AI & ML interests

llm, vision, transformers, megabytes

Organizations

sosoai's activity

upvoted a paper 3 days ago

AutoCoder: Enhancing Code Large Language Model with AIEV-Instruct

Paper • 2405.14906 • Published 7 days ago • 18

upvoted a paper 24 days ago

Scaling Down to Scale Up: A Guide to Parameter-Efficient Fine-Tuning

Paper • 2303.15647 • Published Mar 28, 2023 • 5

upvoted a paper 26 days ago

A Multimodal Automated Interpretability Agent

Paper • 2404.14394 • Published Apr 22 • 19

upvoted an article 28 days ago

Article

Expanding Model Context and Creating Chat Models with a Single Click

By

•

Apr 28

• 33

upvoted 2 collections about 1 month ago

ablation-models

1.8B models trained on 350BT to compare different pretraining datasets • 8 items • Updated 23 minutes ago • 20

Idefics2 🐶

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated 24 days ago • 83

upvoted a paper about 1 month ago

PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training

Paper • 2309.10400 • Published Sep 19, 2023 • 22

upvoted a collection 3 months ago

Qwen1.5

Qwen1.5 is the improved version of Qwen, the large language model series developed by Alibaba Cloud. • 55 items • Updated 17 days ago • 178

upvoted 3 collections 4 months ago

SLIM Models

Structured Language Instruction Models (SLIMs) • 21 items • Updated 3 days ago • 25

zephyr-7b-sft-full-SPIN

Models fine-tuned with SPIN across iterations 0,1,2,3 • 4 items • Updated Feb 7 • 7

datasets-SPIN

Generated synthetic data used to finetune SPIN. • 8 items • Updated Feb 9 • 10