Salvador Salcedo's picture

9 18

Salvador Salcedo

ssalcedo00

·

AI & ML interests

None yet

Recent Activity

liked a model 2 months ago

cognitivecomputations/Dolphin3.0-Llama3.2-3B

liked a Space 3 months ago

HuggingFaceH4/blogpost-scaling-test-time-compute

liked a model 3 months ago

NousResearch/Hermes-3-Llama-3.2-3B

View all activity

Organizations

None yet

ssalcedo00's activity

liked a model 2 months ago

cognitivecomputations/Dolphin3.0-Llama3.2-3B

Updated Jan 6 • 21.6k • 38

liked a Space 3 months ago

Scaling test-time compute

Enhance math problem solving by scaling test-time compute

liked a model 3 months ago

NousResearch/Hermes-3-Llama-3.2-3B

Text Generation • Updated Dec 18, 2024 • 346k • • 147

upvoted 2 papers about 1 year ago

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information

Paper • 2402.13616 • Published Feb 21, 2024 • 47

liked a model about 1 year ago

CohereForAI/aya-101

Text2Text Generation • Updated Mar 31, 2024 • 2.19k • 636

reacted to gsarti's post with 👍 about 1 year ago

Post

🔍 Today's pick in Interpretability & Analysis of LMs: Recovering the Pre-Fine-Tuning Weights of Generative Models by @eliahu , J. Kahana, Y. Hoshen

Using low-rank adapters (LoRA) is nowadays a common practice to fine-tune pre-trained generative models on specific tasks, or align them to human preferences.

This work explores pre-fine tuning weight recovery: given a set of LoRA models with merged weights fine-tuned from the same pre-trained system, the task is to recover the original (unknown) weights of the pre-trained model.

Authors propose SpectralDeTuning, a method framing this task as an optimisation problem alternating a step of approximation for all low-rank tuned matrices using SVD and the closed-form computation of the optimal pre-trained matrix given the approximate low-rank ones.

The LoRA Weight Recovery Attack (LoWRA) benchmark is introduced to evaluate pre-fine tuning weight recovery across language and vision tasks on ViT, Mistral and Stable Diffusion models.

The SpectralDeTuning method is shown to be effective in recovering original models both intrinsically (difference in weights) and behavirally (similar outputs). The main limitations of the approach are the assumption that the rank used by LoRAs is known by the attacker, and the relatively high number of LoRAs needed to provide a good approximation.

📄 Paper: Recovering the Pre-Fine-Tuning Weights of Generative Models (2402.10208)

💻 LoWRA Bench: Eliahu/LoWRA-Bench

🔍 All daily picks in LM interpretability: https://huggingface.co/collections/gsarti/daily-picks-in-interpretability-and-analysis-of-lms-65ae3339949c5675d25de2f9

upvoted 2 papers about 1 year ago

WARM: On the Benefits of Weight Averaged Reward Models

Paper • 2401.12187 • Published Jan 22, 2024 • 18

Scalable Pre-training of Large Autoregressive Image Models

Paper • 2401.08541 • Published Jan 16, 2024 • 38

upvoted a collection about 1 year ago

Comparing DPO with IPO and KTO

A collection of chat models to explore the differences between three alignment techniques: DPO, IPO, and KTO. • 56 items • Updated Jan 8 • 32

liked a model about 1 year ago

cognitivecomputations/MegaDolphin-120b

Text Generation • Updated May 20, 2024 • 1.82k • 71

upvoted 2 papers about 1 year ago

Mistral 7B

Paper • 2310.06825 • Published Oct 10, 2023 • 46

GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

Paper • 2312.11461 • Published Dec 18, 2023 • 19

liked a dataset about 1 year ago

WizardLMTeam/WizardLM_evol_instruct_V2_196k

Viewer • Updated Mar 10, 2024 • 143k • 509 • 234

liked a model about 1 year ago

WizardLMTeam/WizardMath-7B-V1.1

Text Generation • Updated Jan 12, 2024 • 2.46k • 78

upvoted a paper about 1 year ago

SwitchHead: Accelerating Transformers with Mixture-of-Experts Attention

Paper • 2312.07987 • Published Dec 13, 2023 • 41

upvoted a collection over 1 year ago

Juanako Top Models

These are the Juanako 7B Trained with SFT & DDP & UNA • 8 items • Updated Nov 23, 2024 • 4

liked a model over 1 year ago

fblgit/una-cybertron-7b-v2-bf16

Text Generation • Updated Mar 8, 2024 • 2.29k • 116

liked 2 Spaces over 1 year ago

Chatbot Arena Leaderboard

Display chatbot leaderboard and statistics

Base Model Explorer

Explore model lineage and popularity on the Hugging Face Hub