Jeonghwan Park's picture

Jeonghwan Park PRO

maywell

·

https://www.linkedin.com/in/jeonghwan-park-6b97b1245

AI & ML interests

None yet

Recent Activity

liked a model 15 days ago

Snowflake/snowflake-arctic-embed-m-v2.0

liked a model 18 days ago

deepseek-ai/DeepSeek-V3-0324

liked a model about 1 month ago

DevWorld/Gemago-2b

View all activity

Organizations

Posts 2

Post

9474

🔥 Transfer model's Chat feature, Context length and Knowledge to another under 1 minute without any train.

Imagine being able to create chat models, expand context, and transfer domain-specific knowledge to models, all within a matter of minutes. Our innovative approach, based on a combination of diff-based techniques and sigmoid ratio calculations, makes this possible.

By considering the diffs between the desired information model (long context or chat) and the base model, as well as the diffs between the base model and the target model, we can efficiently transfer features and expand context without the need for extensive training or resources.

Our method minimizes model degradation and ensures that only the desired information is captured, resulting in high-quality models that can be created with just a single click. Whether you need a chat model, expanded context, or domain-specific knowledge transfer, our approach offers a rapid and effective solution.

In blog post below, we will dive into the details of our method, provide code examples, and showcase the impressive results achieved using our approach. Get ready to revolutionize your model creation process and unlock new possibilities with this powerful technique.

Blog - https://huggingface.co/blog/maywell/llm-feature-transfer

Articles 2

Article

38

Expanding Model Context and Creating Chat Models with a Single Click

View all Articles

Collections 4

models 68

maywell/Synatra-7B-v0.3-dpo

Text Generation • Updated Aug 13, 2024 • 3.24k • 29

maywell/EXAONE-3.0-7.8B-Instruct-Llamafied

Updated Aug 8, 2024 • 2.53k • 39

maywell/Llama-3-Ko-Luxia-Instruct

Text Generation • Updated Jul 3, 2024 • 1 • 3

maywell/Llama-3-Ko-8B-Instruct

Text Generation • Updated Jun 25, 2024 • 1.66k • 31

maywell/Qwen2-7B-Multilingual-RP

Text Generation • Updated Jun 25, 2024 • 1.69k • 55

maywell/Yi-Ko-34B-Instruct

Text Generation • Updated May 18, 2024 • 1 • 3

maywell/l3-211m

Text Generation • Updated Apr 30, 2024 • 51 • 2

maywell/miqu-evil-dpo

Text Generation • Updated Apr 25, 2024 • 22

maywell/Llama-3-Synatra-11B-v1-20k

Text Generation • Updated Apr 21, 2024 • 2 • 9

maywell/Llama-3-Synatra-11B-v1

Text Generation • Updated Apr 21, 2024 • 60 • 12

datasets 26

maywell/koVast

Viewer • Updated Nov 20, 2024 • 685k • 157 • 24

maywell/NoReject_OpusShareGPT

Viewer • Updated Jun 22, 2024 • 720k • 9

maywell/LogicKor

Preview • Updated Jun 9, 2024 • 55 • 20

maywell/ko_youtube_transcription_sample

Viewer • Updated May 23, 2024 • 11.6k • 38 • 26

maywell/ko_youtube_transcription_v2

Viewer • Updated May 21, 2024 • 96.6k • 8

maywell/logickor_evaluators

Preview • Updated May 7, 2024 • 73 • 2

maywell/hh-rlhf-nosafe

Viewer • Updated Apr 3, 2024 • 125k • 56 • 4

maywell/kiqu_samples_filtered_2

Viewer • Updated Mar 20, 2024 • 23.6k • 25

maywell/test_kiqu

Viewer • Updated Mar 19, 2024 • 3k • 25 • 2

maywell/bge_portion

Viewer • Updated Mar 9, 2024 • 648k • 166