Test ORG

non-profit

AI & ML interests

None defined yet.

Recent Activity

vicgalle authored a paper 7 months ago

Merging Improves Self-Critique Against Jailbreak Attacks

vicgalle authored a paper 9 months ago

Configurable Safety Tuning of Language Models with Synthetic Preference Data

vicgalle updated a model 10 months ago

vicgalleorg/TruthfulQwen1.5-4B

View all activity

vicgalleorg's activity

vicgalle

authored a paper 7 months ago

Merging Improves Self-Critique Against Jailbreak Attacks

Paper • 2406.07188 • Published Jun 11, 2024 • 3

vicgalle

authored a paper 9 months ago

Configurable Safety Tuning of Language Models with Synthetic Preference Data

Paper • 2404.00495 • Published Mar 30, 2024 • 2

vicgalle

updated 2 models 10 months ago

vicgalleorg/TruthfulQwen1.5-4B

Text Generation • Updated Mar 4, 2024 • 74 • 3

vicgalleorg/test1

Text Generation • Updated Mar 2, 2024 • 78

vicgalle

authored a paper 11 months ago

Refined Direct Preference Optimization with Synthetic Data for Behavioral Alignment of LLMs

Paper • 2402.08005 • Published Feb 12, 2024 • 1

vicgalle

posted an update 11 months ago

Post

Can you merge models of different sizes? ⚗️

Well, yes, if the models are somewhat compatible. Here is an experiment I did. I wanted to merge two of the best performing models: mlabonne/NeuralBeagle14-7B and jeonsworld/CarbonVillain-en-10.7B-v4

Here is my recipe:
1. Expand the layers of NeuralBeagle to 10.7B ala frankenmerge.
2. DPO-tune the previous model with a high-quality preference dataset, argilla/distilabel-intel-orca-dpo-pairs
3. Merge the previous model with CarbonVillain (needs —allow-crimes in mergekit! 🔪)

And here is the resulting model, CarbonBeagle-11B, which ranked top in the leaderboard for its size class:
vicgalle/CarbonBeagle-11B

2 replies

·

vicgalle

authored a paper about 1 year ago

Distilled Self-Critique of LLMs with Synthetic Data: a Bayesian Perspective

Paper • 2312.01957 • Published Dec 4, 2023 • 1

vicgalle

authored 3 papers over 1 year ago

Fast Adaptation with Bradley-Terry Preference Models in Text-To-Image Classification and Generation

Paper • 2308.07929 • Published Jul 15, 2023 • 1

Personalizing Text-to-Image Generation via Aesthetic Gradients

Paper • 2209.12330 • Published Sep 25, 2022 • 1

ZYN: Zero-Shot Reward Models with Yes-No Questions

Paper • 2308.06385 • Published Aug 11, 2023 • 1