Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Sophie Xhonneux
sophiex
Follow
busycalibrating's profile picture
1 follower
·
3 following
AI & ML interests
LLM alignment and adversarial attacks/robustness
Recent Activity
updated
a model
about 1 month ago
sophiex/SFT-ATTACK
updated
a model
about 1 month ago
sophiex/SFT-ATTACK
updated
a model
4 months ago
sophiex/onlinedpo_pythia2.8b_tldr6.9
View all activity
Organizations
Papers
1
arxiv:
2405.15589
models
7
Sort: Recently updated
sophiex/SFT-ATTACK
Updated
Jan 7
sophiex/onlinedpo_pythia2.8b_tldr6.9
Updated
Sep 30, 2024
•
14
sophiex/dpo_pythia1b_hh_rlhf.yml_local_29-04-24_13-31-33_xxxxx
Updated
Apr 29, 2024
•
4
sophiex/dpo_pythia1b_hh_rlhf.yml_local_27-04-24_21-57-03_xxxxx
Updated
Apr 28, 2024
sophiex/config_name_xxxxx
Updated
Apr 27, 2024
sophiex/pythia-1b-sft_hh_rlhf
Text Generation
•
Updated
Apr 27, 2024
•
199
sophiex/pythia-410m-sft_hh_rlhf
Updated
Apr 26, 2024
datasets
1
sophiex/hh-rlhf
Viewer
•
Updated
Apr 12, 2024
•
169k
•
33