Sungwoo Oh

sackoh

AI & ML interests

None yet

Recent Activity

Organizations

KotterAI's profile picture KB-AI Research's profile picture KIFAI's profile picture Gradio-Blocks-Party's profile picture

sackoh's activity

posted an update 7 months ago
view post
Post
700
πŸš€ Release of open-source Korean LLM: GECKO-7B

I am delighted to share my recent project, GECKO, a bilingual large language model for Korean and English πŸ‡°πŸ‡·πŸ‡ΊπŸ‡Έ. This initiative was inspired by the lack of resources for Korean large language models.

@donggyukimc and I wrote the technical report to share our insights and experiences of developing our model. While our model may not achieve sate-of-the-art performance on all benchmarks, it shows modest results with a relatively small amount of pretrained tokens.

I hope GECKO contribute to the open-source community, offering resources that can built upon and improved. I believe that through collaboration and shared knowledge, we can advance the capabilities and accessibility of large language models for Korean and other low-resource languages.

πŸ€— Model: kifai/GECKO-7B
πŸ“„ Technical Report: https://arxiv.org/pdf/2405.15640
  • 2 replies
Β·
New activity in openchat/openchat-3.6-8b-20240522 7 months ago

Thanks for sharing new model

#1 opened 7 months ago by
sackoh
New activity in kifai/KoInFoBench 8 months ago
New activity in google-research-datasets/mbpp 8 months ago

This dataset is broken!

2
#5 opened 10 months ago by
j3m
reacted to akhaliq's post with ❀️ 12 months ago
view post
Post
Here is my selection of papers for today (12 Jan)

https://huggingface.co/papers

PALP: Prompt Aligned Personalization of Text-to-Image Models

Object-Centric Diffusion for Efficient Video Editing

TRIPS: Trilinear Point Splatting for Real-Time Radiance Field Rendering

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Parrot: Pareto-optimal Multi-Reward Reinforcement Learning Framework for Text-to-Image Generation

TOFU: A Task of Fictitious Unlearning for LLMs

Patchscope: A Unifying Framework for Inspecting Hidden Representations of Language Models

Secrets of RLHF in Large Language Models Part II: Reward Modeling

LEGO:Language Enhanced Multi-modal Grounding Model

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Tuning LLMs with Contrastive Alignment Instructions for Machine Translation in Unseen, Low-resource Languages

A Shocking Amount of the Web is Machine Translated: Insights from Multi-Way Parallelism

Towards Conversational Diagnostic AI

Transformers are Multi-State RNNs

Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training

Distilling Vision-Language Models on Millions of Videos

Efficient LLM inference solution on Intel GPU

TrustLLM: Trustworthiness in Large Language Models