1 1 8

Xinyue Shen

vera365

AI & ML interests

None yet

Recent Activity

updated a dataset about 2 months ago

TrustAIRLab/HateBenchSet

liked a dataset 3 months ago

TrustAIRLab/HateBenchSet

published a dataset 3 months ago

TrustAIRLab/HateBenchSet

View all activity

Organizations

vera365's activity

updated a dataset about 2 months ago

TrustAIRLab/HateBenchSet

Viewer • Updated Mar 1 • 15.7k • 105 • 5

liked a dataset 3 months ago

TrustAIRLab/HateBenchSet

Viewer • Updated Mar 1 • 15.7k • 105 • 5

published a dataset 3 months ago

TrustAIRLab/HateBenchSet

Viewer • Updated Mar 1 • 15.7k • 105 • 5

New activity in TrustAIRLab/in-the-wild-jailbreak-prompts 5 months ago

Enormous amount of duplicates

#3 opened 5 months ago by

AmenRa

updated a dataset 5 months ago

TrustAIRLab/in-the-wild-jailbreak-prompts

Viewer • Updated Nov 19, 2024 • 21.5k • 837 • 13

updated a dataset 7 months ago

TrustAIRLab/forbidden_question_set

Viewer • Updated Oct 10, 2024 • 390 • 279 • 3

liked 2 datasets 7 months ago

TrustAIRLab/in-the-wild-jailbreak-prompts

Viewer • Updated Nov 19, 2024 • 21.5k • 837 • 13

TrustAIRLab/forbidden_question_set

Viewer • Updated Oct 10, 2024 • 390 • 279 • 3

updated a Space 7 months ago

README

💻

liked a dataset 11 months ago

vera365/lexica_dataset

Viewer • Updated May 16, 2024 • 61.5k • 303 • 4

updated a dataset 11 months ago

vera365/lexica_dataset

Viewer • Updated May 16, 2024 • 61.5k • 303 • 4

updated a model about 1 year ago

vera365/llama-7b-qlora-ultrachat

Updated Mar 10, 2024

upvoted a paper over 1 year ago

"Do Anything Now": Characterizing and Evaluating In-The-Wild Jailbreak Prompts on Large Language Models

Paper • 2308.03825 • Published Aug 7, 2023 • 2

updated 2 models over 1 year ago

vera365/git-base-pokemon

Image-Text-to-Text • Updated Oct 14, 2023 • 4

vera365/git-base-coco-pokemon

Updated Oct 14, 2023

liked 2 models over 1 year ago

facebook/roberta-hate-speech-dynabench-r4-target

Text Classification • Updated Mar 16, 2023 • 1.93M • 80

GroNLP/hateBERT

Fill-Mask • Updated Jun 2, 2023 • 5.46k • 31

liked 2 datasets almost 2 years ago

Anthropic/hh-rlhf

Viewer • Updated May 26, 2023 • 169k • 15.6k • 1.32k

lighteval/civil_comments_helm

Viewer • Updated May 4, 2023 • 623k • 167 • 1